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Preface 


This updated edition is intended for a one- or two-term introductory course in discrete 
mathematics, based on my experience in teaching this course over many years and re- 
quests from users of previous editions. Formal mathematics prerequisites are minimal; 
calculus is not required. There are no computer science prerequisites. The book includes 
examples, exercises, figures, tables, sections on problem-solving, sections containing 
problem-solving tips, section reviews, notes, chapter reviews, self-tests, and computer 
exercises to help the reader master introductory discrete mathematics. In addition, an 
Instructor’s Guide and website are available. 

In the early 1980s there were few textbooks appropriate for an introductory course 
in discrete mathematics. However, there was a need for a course that extended students’ 
mathematical maturity and ability to deal with abstraction, which also included use- 
ful topics such as combinatorics, algorithms, and graphs. The original edition of this 
book (1984) addressed this need and significantly influenced the development of dis- 
crete mathematics courses. Subsequently, discrete mathematics courses were endorsed 
by many groups for several different audiences, including mathematics and computer 
science majors. A panel of the Mathematical Association of America (MAA) endorsed 
a year-long course in discrete mathematics. The Educational Activities Board of the 
Institute of Electrical and Electronics Engineers (IEEE) recommended a freshman dis- 
crete mathematics course. The Association for Computing Machinery (ACM) and IEEE 
accreditation guidelines mandated a discrete mathematics course. This edition, like its 
predecessors, includes topics such as algorithms, combinatorics, sets, functions, and 
mathematical induction endorsed by these groups. It also addresses understanding and 
constructing proofs and, generally, expanding mathematical maturity. 


New to This Edition 


The changes in this book, the eighth edition, result from comments and requests from 
numerous users and reviewers of previous editions of the book. This edition includes the 
following changes from the seventh edition: 


= The web icons in the seventh edition have been replaced by short URLs, making 

it possible to quickly access the appropriate web page, for example, by using a 
hand-held device. 

m= The exercises in the chapter self-tests no longer identify the relevant sections mak- 

ing the self-test more like a real exam. (The hints to these exercises do identify the 
relevant sections.) 
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= Examples that are worked problems clearly identify where the solution begins and 
ends. 


m= The number of exercises in the first three chapters (Sets and Logic; Proofs; and 
Functions, Sequences, and Relations) has been increased from approximately 1640 
worked examples and exercises in the seventh edition to over 1750 in the current 
edition. 


= Many comments have been added to clarify potentially tricky concepts (e.g., “sub- 
set” and “element of,’ collection of sets, logical equivalence of a sequence of 
propositions, logarithmic scale on a graph). 


= There are more examples illustrating diverse approaches to developing proofs and 
alternative ways to prove a particular result [see, e.g., Examples 2.2.4 and 2.2.8; 
Examples 6.1.3(c) and 6.1.12; Examples 6.7.7, 6.7.8, and 6.7.9; Examples 6.8.1 
and 6.8.2]. 


= A number of definitions have been revised to allow them to be more directly ap- 
plied in proofs [see, e.g., one-to-one function (Definition 3.1.22) and onto function 
(Definition 3.1.29)]. 


= Additional real-world examples (see descriptions in the following section) are in- 
cluded. 


= The altered definition of sequence (Definition 3.2.1) provides more generality and 
makes subsequent discussion smoother (e.g., the discussion of subsequences). 


m Exercises have been added (Exercises 40-49, Section 5.1) to give an example of 
an algebraic system in which prime factorization does not hold. 


= An application of the binomial theorem is used to prove Fermat’s little theorem 
(Exercises 40 and 41, Section 6.7). 


= There is now a randomized algorithm to search for a Hamiltonian cycle in a graph 
(Algorithm 8.3.10). 


= The Closest-Pair Problem (Section 13.1 in the seventh edition) has been integrated 
into Chapter 7 (Recurrence Relations) in the current edition. The algorithm to solve 
the closest-pair problem is based on merge sort, which is discussed and analyzed 
in Chapter 7. Chapter 13 in the seventh edition, which has now been removed, had 
only one additional section. 


= A number of recent books and articles have been added to the list of references, 
and several book references have been updated to current editions. 


m= The number of exercises has been increased to nearly 4500. (There were approx- 
imately 4200 in the seventh edition.) 


Contents and Structure 


Content Overview 


Chapter 1 Sets and Logic 

Coverage includes quantifiers and features practical examples such as using the Google 
search engine (Example 1.2.13). We cover translating between English and symbolic 
expressions as well as logic in programming languages. We also include a logic game 
(Example 1.6.15), which offers an alternative way to determine whether a quantified 
propositional function is true or false. 
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Chapter 2 Proofs 

Proof techniques discussed include direct proofs, counterexamples, proof by contradic- 
tion, proof by contrapositive, proof by cases, proofs of equivalence, existence proofs 
(constructive and nonconstructive), and mathematical induction. We present loop in- 
variants as a practical application of mathematical induction. We also include a brief, 
optional section on resolution proofs (a proof technique that can be automated). 


Chapter 3 Functions, Sequences, and Relations 

The chapter includes strings, sum and product notations, and motivating examples such 
as the Luhn algorithm for computing credit card check digits, which opens the chapter. 
Other examples include an introduction to hash functions (Example 3.1.15), pseudo- 
random number generators (Example 3.1.16). a real-world example of function compo- 
sition showing its use in making a price comparison (Example 3.1.45), an application of 
partial orders to task scheduling (Section 3.3), and relational databases (Section 3.6). 


Chapter 4 Algorithms 

The chapter features a thorough discussion of algorithms, recursive algorithms, and the 
analysis of algorithms. We present a number of examples of algorithms before getting 
into big-oh and related notations (Sections 4.1 and 4.2), thus providing a gentle introduc- 
tion and motivating the formalism that follows. We then continue with a full discussion 
of the “big oh,” omega, and theta notations for the growth of functions (Section 4.3). 
Having all of these notations available makes it possible to make precise statements 
about the growth of functions and the time and space required by algorithms. 

We use the algorithmic approach throughout the remainder of the book. We men- 
tion that many modern algorithms do not have all the properties of classical algorithms 
(e.g., many modern algorithms are not general, deterministic, or even finite). To illustrate 
the point, we give an example of a randomized algorithm (Example 4.2.4). Algorithms 
are written in a flexible form of pseudocode, which resembles currently popular lan- 
guages such as C, C++, and Java. (The book does not assume any computer science 
prerequisites; the description of the pseudocode used is given in Appendix C.) Among 
the algorithms presented are: 


Tiling (Section 4.4) 

Euclidean algorithm for finding the greatest common divisor (Section 5.3) 
RSA public-key encryption algorithm (Section 5.4) 

Generating combinations and permutations (Section 6.4) 

Merge sort (Section 7.3) 

Finding a closest pair of points (Section 7.4) 

Dijkstra’s shortest-path algorithm (Section 8.4) 

Backtracking algorithms (Section 9.3) 

Breadth-first and depth-first search (Section 9.3) 

Tree traversals (Section 9.6) 


Evaluating a game tree (Section 9.9) 


Finding a maximal flow in a network (Section 10.2) 


Chapter 5 Introduction to Number Theory 

The chapter includes classical results (e.g., divisibility, the infinitude of primes, funda- 
mental theorem of arithmetic), as well as algorithmic number theory (e.g., the Euclidean 
algorithm to find the greatest common divisor, exponentiation using repeated squaring, 
computing s and f such that gcd(a, b) = sa + tb, computing an inverse modulo an inte- 
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ger). The major application is the RSA public-key cryptosystem (Section 5.4). The calcu- 
lations required by the RSA public-key cryptosystem are performed using the algorithms 
previously developed in the chapter. 


Chapter 6 Counting Methods and the Pigeonhole Principle 

Coverage includes combinations, permutations, discrete probability (optional Sections 
6.5 and 6.6), and the Pigeonhole Principle. Applications include internet addressing 
(Section 6.1) and real-world pattern recognition problems in telemarketing (Example 
6.6.21) and virus detection (Example 6.6.22) using Bayes’ Theorem. 


Chapter 7 Recurrence Relations 
The chapter includes recurrence relations and their use in the analysis of algorithms. 


Chapter 8 Graph Theory 

Coverage includes graph models of parallel computers, the knight’s tour, Hamiltonian 
cycles, graph isomorphisms, and planar graphs. Theorem 8.4.3 gives a simple, short, el- 
egant proof of the correctness of Dijkstra’s algorithm. 


Chapter 9 Trees 
Coverage includes binary trees, tree traversals, minimal spanning trees, decision trees, 
the minimum time for sorting, and tree isomorphisms. 


Chapter 10 Network Models 
Coverage includes the maximal flow algorithm and matching. 


Chapter 11 Boolean Algebras and Combinatorial Circuits 
Coverage emphasizes the relation of Boolean algebras to combinatorial circuits. 


Chapter 12 Automata, Grammars, and Languages 

Our approach emphasizes modeling and applications. We discuss the SR flip-flop circuit 
in Example 12.1.11, and we describe fractals, including the von Koch snowflake, which 
can be described by special kinds of grammars (Example 12.3.19). 


Book frontmatter and endmatter 

Appendixes include coverage of matrices, basic algebra, and pseudocode. A reference 
section provides more than 160 references to additional sources of information. Front 
and back endpapers summarize the mathematical and algorithm notation used in the 
book. 


Features of Content Coverage 


= A strong emphasis on the interplay among the various topics. Examples of this 
include: 


e We closely tie mathematical induction to recursive algorithms (Section 4.4). 


e We use the Fibonacci sequence in the analysis of the Euclidean algorithm 
(Section 5.3). 


e Many exercises throughout the book require mathematical induction. 


e We show how to characterize the components of a graph by defining an 
equivalence relation on the set of vertices (see the discussion following 
Example 8.2.13). 


e We count the number of nonisomorphic n-vertex binary trees (Theorem 
9.8.12). 


= A strong emphasis on reading and doing proofs. We illustrate most proofs of 
theorems with annotated figures and/or motivate them by special Discussion sec- 
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tions. Separate sections (Problem-Solving Corners) show students how to attack 
and solve problems and how to do proofs. Special end-of-section Problem-Solving 
Tips highlight the main problem-solving techniques of the section. 


= A large number of applications, especially applications to computer science. 


Figures and tables illustrate concepts, show how algorithms work, elucidate 
proofs, and motivate the material. Several figures illustrate proofs of theorems. 
The captions of these figures provide additional explanation and insight into the 
proofs. 


Textbook Structure 


Each chapter is organized as follows: 


Chapter X Overview 

Section X.1 

Section X.1 Review Exercises 
Section X.1 Exercises 
Section X.2 

Section X.2 Review Exercises 
Section X.2 Exercises 


Chapter X Notes 

Chapter X Review 

Chapter X Self-Test 

Chapter X Computer Exercises 


In addition, most chapters have Problem-Solving Corners (see “Hallmark Features” 
for more information about this feature). 

Section review exercises review the key concepts, definitions, theorems, tech- 
niques, and so on of the section. All section review exercises have answers in the back 
of the book. Although intended for reviews of the sections, section review exercises can 
also be used for placement and pretesting. 

Chapter notes contain suggestions for further reading. Chapter reviews provide 
reference lists of the key concepts of the chapters. Chapter self-tests contain exer- 
cises based on material from throughout the chapter, with answers in the back of the 
book. 

Computer exercises include projects, implementation of some of the algorithms, 
and other programming related activities. Although there is no programming prerequisite 
for this book and no programming is introduced in the book, these exercises are provided 
for those readers who want to explore discrete mathematics concepts with a computer. 


Hallmark Features 


Exercises 


The book contains nearly 4500 exercises, approximately 150 of which are computer 
exercises. We use a star to label exercises felt to be more challenging than average. 
Exercise numbers in color (approximately one-third of the exercises) indicate that the 
exercise has a hint or solution in the back of the book. The solutions to most of the 
remaining exercises may be found in the Instructor’s Guide. A handful of exercises are 
clearly identified as requiring calculus. No calculus concepts are used in the main body 
of the book and, except for these marked exercises, no calculus is needed to solve the 
exercises. 
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Examples 


The book contains almost 650 worked examples. These examples show students how to 
tackle problems in discrete mathematics, demonstrate applications of the theory, clarify 
proofs, and help motivate the material. 


Problem-Solving Corners 


The Problem-Solving Corner sections help students attack and solve problems and show 
them how to do proofs. Written in an informal style, each is a self-contained section 
centered around a problem. The intent of these sections is to go beyond simply presenting 
a proof or a solution to the problem: we show alternative ways of attacking a problem, 
discuss what to look for in trying to obtain a solution to a problem, and present problem- 
solving and proof techniques. 

Each Problem-Solving Corner begins with a statement of a problem. We then dis- 
cuss ways to attack the problem, followed by techniques for finding a solution. After we 
present a solution, we show how to correctly write it up in a formal manner. Finally, we 
summarize the problem-solving techniques used in the section. Some sections include 
a Comments subsection, which discusses connections with other topics in mathematics 
and computer science, provides motivation for the problem, and lists references for fur- 
ther reading about the problem. Some Problem-Solving Corners conclude with a few 
exercises. 


Supplements and Technology 


Instructor's Solution Manual (downloadable) 


ISBN-10: 0-321-98309-2 | ISBN-13: 978-0-321-98309-1 

The Instructor’s Guide, written by the author, provides worked-out solutions for most 
exercises in the text. It is available for download to qualified instructors from the Pearson 
Instructor Resource Center www.pearsonhighered.com/irc. 


Web Support 
The short URLs in the margin of the text provide students with direct access to relevant 
NOTE: content at point-of-use, including: 
When you enter URLs that 
appear in the text, take care m= Expanded explanations of difficult material and links to other sites for additional 
to distinguish the following information about discrete mathematics topics. 
characters: = Computer programs (in C or C++). 
1 = lowercase | 
I = uppercase | The URL goo. g1/f03Crh provides access to all of the above resources plus an errata 
1 =one list for the text. 
O = uppercase O 
0 = zero 
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Chapter 1 


SETS AND LOGIC 


Chapter 1 begins with sets. A set is a collection of objects; order is not taken into 
account. Discrete mathematics is concerned with objects such as graphs (sets of ver- 
tices and edges) and Boolean algebras (sets with certain operations defined on them). 
In this chapter, we introduce set terminology and notation. In Chapter 2, we treat sets 
more formally after discussing proof and proof techniques. However, in Section 1.1, we 
provide a taste of the logic and proofs to come in the remainder of Chapter 1 and in 
Chapter 2. 

Logic is the study of reasoning; it is specifically concerned with whether reasoning 
is correct. Logic focuses on the relationship among statements as opposed to the content 
of any particular statement. Consider, for example, the following argument: 


All mathematicians wear sandals. 
Anyone who wears sandals is an algebraist. 
Therefore, all mathematicians are algebraists. 


Technically, logic is of no help in determining whether any of these statements is true; 
however, if the first two statements are true, logic assures us that the statement, 


All mathematicians are algebraists, 


is also true. 

Logic is essential in reading and developing proofs, which we explore in detail in 
Chapter 2. An understanding of logic can also be useful in clarifying ordinary writing. 
For example, at one time, the following ordinance was in effect in Naperville, Illinois: 
“Tt shall be unlawful for any person to keep more than three dogs and three cats upon 
his property within the city.” Was one of the citizens, who owned five dogs and no cats, 
in violation of the ordinance? Think about this question now; then analyze it (see Exer- 
cise 75, Section 1.2) after reading Section 1.2. 
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Sets 


The concept of set is basic to all of mathematics and mathematical applications. A set 
is simply a collection of objects. The objects are sometimes referred to as elements or 
members. If a set is finite and not too large, we can describe it by listing the elements in 
it. For example, the equation 


A= (1,2, 3,4} (1.1.1) 


describes a set A made up of the four elements 1, 2, 3, and 4. A set is determined by 
its elements and not by any particular order in which the elements might be listed. Thus 
the set A might just as well be specified as A = {1, 3, 4, 2}. The elements making up a 
set are assumed to be distinct, and although for some reason we may have duplicates in 
our list, only one occurrence of each element is in the set. For this reason we may also 
describe the set A defined in (1.1.1) as A = {1, 2, 2, 3, 4}. 

If a set is a large finite set or an infinite set, we can describe it by listing a property 
necessary for membership. For example, the equation 


B= {x | x is a positive, even integer} (1.1.2) 


describes the set B made up of all positive, even integers; that is, B consists of the integers 
2, 4, 6, and so on. The vertical bar “|” is read “such that.” Equation (1.1.2) would be 
read “B equals the set of all x such that x is a positive, even integer.’ Here the property 
necessary for membership is “is a positive, even integer.” Note that the property appears 
after the vertical bar. The notation in (1.1.2) is called set-builder notation. 

A set may contain any kind of elements whatsoever, and they need not be of the 
same “type.” For example, 


{4.5, Lady Gaga, zr, 14} 


is a perfectly fine set. It consists of four elements: the number 4.5, the person Lady Gaga, 
the number z(= 3.1415...), and the number 14. 
A set may contain elements that are themselves sets. For example, the set 


(3, {5, 1}, 12, {r, 4.5, 40, 16}, Henry Cavill} 


consists of five elements: the number 3, the set {5, 1}, the number 12, the set {z, 4.5, 
40, 16}, and the person Henry Cavill. 

Some sets of numbers that occur frequently in mathematics generally, and in dis- 
crete mathematics in particular, are shown in Figure 1.1.1. The symbol Z comes from 
the German word, Zahlen, for integer. Rational numbers are quotients of integers, thus 
Q for quotient. The set of real numbers R can be depicted as consisting of all points on 
a straight line extending indefinitely in either direction (see Figure 1.1.2).' 


Symbol Set Example of Members 
Z Integers —3,0, 2, 145 
Q Rational numbers | —1/3,0, 24/15 
R Real numbers —3, —1.766, 0, 4/15, /2, 2.666...,7 


Figure 1.1.1 Sets of numbers. 


|The real numbers can be constructed by starting with a more primitive notion such as “set” or “integer,” or 
they can be obtained by stating properties (axioms) they are assumed to obey. For our purposes, it suffices to 
think of the real numbers as points on a straight line. The construction of the real numbers and the axioms 
for the real numbers are beyond the scope of this book. 


Example 1.1.1 


Example 1.1.2 


Example 1.1.3 


—1.766 


Figure 1.1.2 The real number line. 


To denote the negative numbers that belong to one of Z, Q, or R, we use the 
superscript minus. For example, Z™ denotes the set of negative integers, namely —1, —2, 
—3,.... Similarly, to denote the positive numbers that belong to one of the three sets, 
we use the superscript plus. For example, Qt denotes the set of positive rational num- 
bers. To denote the nonnegative numbers that belong to one of the three sets, we use the 
superscript nonneg. For example, Z”°””"*s denotes the set of nonnegative integers, namely 
Ops 2 35 3:5.50 

If X is a finite set, we let |X| = number of elements in X. We call |X| the cardi- 
nality of X. There is also a notion of cardinality of infinite sets, although we will not 
discuss it in this book. For example, the cardinality of the integers, Z, is denoted Xo, read 
“aleph null.” Aleph is the first letter of the Hebrew alphabet. 


For the set A in (1.1.1), we have |A| = 4, and the cardinality of A is 4. The cardinality 
of the set {R, Z} is 2 since it contains two elements, namely the two sets RandZ. < 


Given a description of a set X such as (1.1.1) or (1.1.2) and an element x, we can 
determine whether or not x belongs to X. If the members of X are listed as in (1.1.1), 
we simply look to see whether or not x appears in the listing. In a description such as 
(1.1.2), we check to see whether the element x has the property listed. If x is in the set 
X, we write x € X, and if x is not in X, we write x ¢ X. For example, 3 € {1, 2, 3, 4}, but 
3 ¢ {x | x is a positive, even integer}. 

The set with no elements is called the empty (or null or void) set and is denoted 
@. Thus @ = {}. 

Two sets X and Y are equal and we write X = Y if X and Y have the same elements. 
To put it another way, X = Y if the following two conditions hold: 


= For every x, if x € X, thenx € Y, 
and 
@ For every x, ifx € Y, thenx € X. 


The first condition ensures that every element of X is an element of Y, and the second 
condition ensures that every element of Y is an element of X. 


IfA = {1,3, 2} and B = {2, 3,2, 1}, by inspection, A and B have the same elements. 
Therefore A = B. < 
Show that if A = {x | x* +x — 6 = 0} and B = {2, —3}, thenA = B. 


SOLUTION According to the criteria in the paragraph immediately preceding Example 
1.1.2, we must show that for every x, 


if x € A, then x € B, (1.1.3) 


and for every x, 


ifx € B,thenx eA. (1.1.4) 
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Example 1.1.4 


Example 1.1.5 


Example 1.1.6 


Example 1.1.7 


Example 1.1.8 


Example 1.1.9 


To verify condition (1.1.3), suppose that x € A. Then 
vP+x-6=0. 


Solving for x, we find that x = 2 or x = —3. In either case, x € B. Therefore, condition 
(1.1.3) holds. 
To verify condition (1.1.4), suppose that x ¢ B. Then x = 2 orx = —3. Ifx = 2, 
then 
VP+x-6=27742-6=0. 
Therefore, x € A. If x = —3, then 


xv +x—6 = (-3)*+ (-3) -6=0. 


Again, x € A. Therefore, condition (1.1.4) holds. We conclude that A = B. < 


For a set X to not be equal to a set Y (written X ~ Y), X and Y must not have the 
same elements: There must be at least one element in X that is not in Y or at least one 
element in Y that is not in X (or both). 


Let A = {1, 2,3} and B = {2,4}. Then A + B since there is at least one element in A 
(1 for example) that is not in B. [Another way to see that A ¥ B is to note that there is 
at least one element in B (namely 4) that is not in A.] < 


Suppose that X and Y are sets. If every element of X is an element of Y, we say 
that X is a subset of Y and write X C Y. In other words, X is a subset of Y if for every 
x, ifx € X, thenx € Y. 


If C = {1,3} andA = {1, 2, 3, 4}, by inspection, every element of C is an element of A. 
Therefore, C is a subset of A and we write C C A. < 


Let X = {x | x? +x — 2 = 0}. Show that X C Z. 


SOLUTION We must show that for every x, if x € X, thenx € Z.Ifx € X, then 


r+x—-2=0. Solving for x, we obtain x = 1 or x = —2. In either case, x € Z. 
Therefore, for every x, if x € X, then x € Z. We conclude that X is a subset of Z and we 
write X C Z. < 


The set of integers Z is a subset of the set of rational numbers Q. If n € Z, n can 
be expressed as a quotient of integers, for example, n = n/1. Therefore n € Q and 
ZCQ. < 


The set of rational numbers Q is a subset of the set of real numbers R. If x € Q, x cor- 
responds to a point on the number line (see Figure 1.1.2) sox € R. < 


For X to not be a subset of Y, there must be at least one member of X that is not 
inY. 


Let X = {x | 3x7 — x — 2 = 0}. Show that X is not a subset of Z. 


Example 1.1.10 


Example 1.1.11 
Example 1.1.12 


Example 1.1.13 


Example 1.1.14 
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SOLUTION Ifx € X, then 3x*—x—2 = 0. Solving for x, we obtain x = 1 orx = —2/3. 
Taking x = —2/3, we have x € X but x ¢ Z. Therefore, X is not a subset of Z. < 


Any set X is a subset of itself, since any element in X is in X. Also, the empty set 
is a subset of every set. If @ is not a subset of some set Y, according to the discussion 
preceding Example 1.1.9, there would have to be at least one member of © that is not in 
Y. But this cannot happen because the empty set, by definition, has no members. 

Notice the difference between the terms “subset” and “element of.” The set X is a 
subset of the set Y(X C Y), if every element of X is an element of Y; x is an element of 
X(x € X), if x is a member of the set X. 


Let X = {1,3,5, 7} and Y = {1, 2,3, 4,5, 6, 7}. Then X C Y since every element of X 
is an element of Y. But X ¢ Y, since the set X is not a member of Y. Also, 1 € X, but 
1 is not a subset of X. Notice the difference between the number 1| and the set {1}. The 
set {1} is a subset of X. < 


If X is a subset of Y and X does not equal Y, we say that X is a proper subset of 
Y and write X C Y. 


Let C = {1,3} and A = {1, 2,3, 4}. Then C is a proper subset of A since C is a subset 
of A but C does not equal A. We write C C A. < 


Example 1.1.7 showed that Z is a subset of Q. In fact, Z is a proper subset of Q because, 
for example, 1/2 € Q, but 1/2 ¢ Z. < 


Example 1.1.8 showed that Q is a subset of R. In fact, Q is a proper subset of R because, 
for example, we) € R, but af2 ¢ Q. (in Example 2.2.3, we will show that J2 is not the 
quotient of integers). < 


The set of all subsets (proper or not) of a set X, denoted P(X), is called the power 
set of X. 


If A = {a, b, c}, the members of P(A) are 
©, {a}, {b}, {c}, ta, D}, {a, c}, {b, c}, {a, b,c}. 
All but {a, b, c} are proper subsets of A. < 
In Example 1.1.14, |A] = 3 and |P(A)| = 2° = 8. In Section 2.4 (Theorem 2.4.6), 
we will give a formal proof that this result holds in general; that is, the power set of a 
set with n elements has 2” elements. 


Given two sets X and Y, there are various set operations involving X and Y that 
can produce a new set. The set 


XUY={x|xeXorxe Y} 


is called the union of X and Y. The union consists of all elements belonging to either X 
or Y (or both). 
The set 


XNY={x|xeXandxe Y} 
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Example 1.1.15 


Example 1.1.16 


Example 1.1.17 


Example 1.1.18 


Example 1.1.19 


is called the intersection of X and Y. The intersection consists of all elements belonging 
to both X and Y. 
The set 


X—Y={x|xeXandx ¢ Y} 


is called the difference (or relative complement). The difference X — Y consists of all 
elements in X that are not in Y. 


IfA = {1, 3,5} and B = {4, 5, 6}, then 


AUB = {1,3,4,5, 6} 


ANB= {5} 
A—B= {1,3} 
B-A= {4,6}. 
Notice that, in general, A —B 4 B-A. < 
Since Q CR, 
RUQ=R 
RNQ=Q 
Q-R=2¢. 


The set R — Q, called the set of irrational numbers, consists of all real numbers that 
are not rational. < 


We call a set S, whose elements are sets, a collection of sets or a family of sets. 
For example, if 


S = {{1, 2}, {1, 3}, (1, 7, 10}}, 
then S is a collection or family of sets. The set S consists of the sets 
{1, 2}, {1, 3}, {1, 7, 10}. 


Sets X and Y are disjoint if XO Y = @. A collection of sets S is said to be pairwise 
disjoint if, whenever X and Y are distinct sets in S, X and Y are disjoint. 


The sets {1, 4, 5} and {2, 6} are disjoint. The collection of sets S = {{1, 4, 5}, {2, 6}, {3}, 
{7, 8}} is pairwise disjoint. < 


Sometimes we are dealing with sets, all of which are subsets of a set U. This set 
U is called a universal set or a universe. The set U must be explicitly given or inferred 
from the context. Given a universal set U and a subset X of U, the set U — X is called 
the complement of X and is written X. 


LetA = {1, 3, 5}. If U, auniversal set, is specified as U = {1, 2, 3, 4, 5}, thenA = {2, 4}. 
If, on the other hand, a universal set is specified as U = {1, 3,5, 7, 9}, then A = {7,9}. 
The complement obviously depends on the universe in which we are working. < 


Let the universal set be Z. Then Z—, the complement of the set of negative integers, is 
zens the set of nonnegative integers. < 


Go Online 
For more on Venn 
diagrams, see 
goo.gl/F/7b35e 


1.1 @ Sets 7 


Venn diagrams provide pictorial views of sets. Ina Venn diagram, a rectangle de- 
picts a universal set (see Figure 1.1.3). Subsets of the universal set are drawn as circles. 
The inside of a circle represents the members of that set. In Figure 1.1.3 we see two sets 
A and B within the universal set U. Region | represents (A U B), the elements in neither 
A nor B. Region 2 represents A — B, the elements in A but not in B. Region 3 represents 
A 1 B, the elements in both A and B. Region 4 represents B — A, the elements in B but 


notin A. 
GOs) 


Figure 1.1.3 A Venn 
diagram. 


U 


Example 1.1.20 Particular regions in Venn diagrams are depicted by shading. The set A U B is shown in 


A B 


CO) 


Figure 1.1.4 A Venn 
diagram of A U B. 


Example 1.1.21 


Figure 1.1.4, and Figure 1.1.5 represents the set A — B. < 


U U 


A B 


op 


Figure 1.1.5 A Venn 
diagram of A — B. 


CALC PSYCH 


BN 
QD 


9 COMPSCI 


Figure 1.1.6 A Venn diagram 
of three sets CALC, PSYCH, 
and COMPSCI. The numbers 
show how many students belong 
to the particular region depicted. 


To represent three sets, we use three overlapping circles (see Figure 1.1.6). 


Among a group of 165 students, 8 are taking calculus, psychology, and computer science; 
33 are taking calculus and computer science; 20 are taking calculus and psychology; 
24 are taking psychology and computer science; 79 are taking calculus; 83 are taking 
psychology; and 63 are taking computer science. How many are taking none of the three 
subjects? 


SOLUTION Let CALC, PSYCH, and COMPSCI denote the sets of students taking 
calculus, psychology, and computer science, respectively. Let U denote the set of all 
165 students (see Figure 1.1.6). Since 8 students are taking calculus, psychology, and 
computer science, we write 8 in the region representing CALC 7 PSYCH NM COMPSCI. 
Of the 33 students taking calculus and computer science, 8 are also taking psychol- 
ogy; thus 25 are taking calculus and computer science but not psychology. We write 
25 in the region representing CALC N PSYCH M COMPSCI. Similarly, we write 12 in 
the region representing CALC ™ PSYCH 1 COMPSCI and 16 in the region repre- 
senting CALC M PSYCH M COMPSCI. Of the 79 students taking calculus, 45 have 
now been accounted for. This leaves 34 students taking only calculus. We write 34 in 
the region representing CALC M PSYCH N COMPSCI. Similarly, we write 47 in the 
region representing CALC 1 PSYCH M COMPSCI and 14 in the region representing 
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CALC mM PSYCH M COMPSCI. At this point, 156 students have been accounted for. 
This leaves 9 students taking none of the three subjects. 


Venn diagrams can also be used to visualize certain properties of sets. For exam- 
ple, by sketching both (A U B) and A 1 B (see Figure 1.1.7), we see that these sets are 
equal. A formal proof would show that for every x, if x € (A U B), then x € ANB, and if 
x € ANB, then x € (A UB). We state many useful properties of sets as Theorem 1.1.22. 


Theorem 1.1.22 Let U be a universal set and let A, B, and C be subsets of U. The following properties 


hold. 
U 
A B (a) Associative laws: 


C0) (AUB)UC=AU(BUQ), (ANB)NC=AN(BNO 
(b) Commutative laws: 


ANU) SIAN, ANE SIR INA 


The 
shaded region depicts (c) Distributive laws: 
both (A U B) and 
AB; thus these sets AN(BUC)=(ANB)U(ANC), AUBNC)=AVUB)NAUC 
are equal. (d) Identity laws: 


AUGSA, ANW=A 


(e) Complement laws: 


(f) Idempotent laws: 


(g) Bound laws: 


(h) Absorption laws: 


AUB) =A. AAU B) —A 


(i) Involution law: 


A=At 
Gj) 0/1 laws: 
Bev, C=e 
For a biography of 
De Morgan, see (k) De Morgan’s laws for sets: 
ero FIRS Te (AUB) =ANB, (NB) =AUB 


Proof The proofs are left as exercises (Exercises 46-56, Section 2.1) to be done after 
more discussion of logic and proof techniques. < 


We define the union of a collection of sets S to be those elements x belonging to 
at least one set X in S. Formally, 


US= {x |x eX for some X € S}. 


1A denotes the complement of the complement of A, that is, A= (A). 


Example 1.1.23 


Example 1.1.24 


Example 1.1.25 


Example 1.1.26 
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Similarly, we define the intersection of a collection of sets S to be those elements x 
belonging to every set X in S. Formally, 
AS = {x |x eX for all X € S}. 


Let S = {{1, 2}, {1, 3}, {1, 7, 10}}. Then US = {1, 2, 3, 7, 10} since each of the elements 
1, 2, 3, 7, 10 belongs to at least one set in S, and no other element belongs to any of the 


sets in S. Also NS = {1} since only the element | belong to every set in S. < 
If 
S = {Aj, Ao, »An}, 
we write 


and if 
S = {Aj, A2,...}, 
we write 
CO oe) 
Us=U4, (\s=fai 
i=l i=l 
For i > 1, define A; = {i,i + 1,...} and S = {A),Ao,...}. As examples, 


A, = {1,2,3,...} and Ay = {2, 3,4,...}. Then 


Us=( a= 1.2... (\s=(\a=e. : 
= i=l 


A partition of a set X divides X into nonoverlapping subsets. More formally, a 
collection S of nonempty subsets of X is said to be a partition of the set X if every 
element in X belongs to exactly one member of S. Notice that if S is a partition of X, S 
is pairwise disjoint and US = X. 


Since each element of X = {1,2,3,4,5,6,7,8} is in exactly one member of 
S = {{1, 4, 5}, {2, 6}, {3}, {7, 8}}, Sis a partition of X. < 


At the beginning of this section, we pointed out that a set is an unordered collection 
of elements; that is, a set is determined by its elements and not by any particular order 
in which the elements are listed. Sometimes, however, we do want to take order into 
account. An ordered pair of elements, written (a, b), is considered distinct from the or- 
dered pair (b, a), unless, of course, a = b. To put it another way, (a, b) = (c, d) precisely 
when a = c and b = d. If X and Y are sets, we let X x Y denote the set of all ordered 
pairs (x, y) where x € X and y € Y. We call X x Y the Cartesian product of X and Y. 


If X = {1, 2, 3} and Y = {a, b}, then 
Xx Y = {(1,a), (1,b), (2, a), (2, 6), (3,4), 3, b)} 
Yx X = {(a, 1), (b, 1), (a, 2), (b, 2), (a, 3), (b, 3)} 
Xx X = {(1, 1), (1, 2), (1, 3), (2, 1), (2, 2), (2, 3), 3, 1), G, 2), (3, 3)} 
Y x Y = {(a,a), (a,b), (b, a), (b, b)}. < 
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Example 1.1.27 


Example 1.1.28 


Example 1.1.29 


Example 1.1.26 shows that, in general, X x YAY x X. 

Notice that in Example 1.1.26, |X x Y| = |X| + |Y| (both are equal to 6). The reason 
is that there are 3 ways to choose an element of X for the first member of the ordered 
pair, there are 2 ways to choose an element of Y for the second member of the ordered 
pair, and 3 - 2 = 6 (see Figure 1.1.8). The preceding argument holds for arbitrary finite 
sets X and Y; it is always true that |X x Y| = |X| - |Y|. 


(1,4) (1,5) (2,4) (2,5) (3,4) 3,5) 
Figure 1.1.8 |X x Y| = |X| + |Y|, where X = {1, 2, 3} and Y = {a, b}. There 
are 3 ways to choose an element of X for the first member of the ordered pair 
(shown at the top of the diagram) and, for each of these choices, there are 
2 ways to choose an element of Y for the second member of the ordered pair 


(shown at the bottom of the diagram). Since there are 3 groups of 2, there are 
3 - 2 = 6 elements in X x Y (labeled at the bottom of the figure). 


A restaurant serves four appetizers, 
r=ribs, nm=nachos, s=shrimp, ff = fried cheese, 
and three entrees, 
c=chicken, b=beef, t= trout. 


If we let A = {r,n,s,f} and E = {c, b, t}, the Cartesian product A x E lists the 12 
possible dinners consisting of one appetizer and one entree. < 


Ordered lists need not be restricted to two elements. An n-tuple, written 
(a1, A2,..., Gy), takes order into account; that is, 


(a, @, seey Qn) _ (hi, bo, ten bn) 
precisely when 
a, = bi, ay = bo, ..., An = bn. 
The Cartesian product of sets X;, X2,..., Xn is defined to be the set of all n-tuples 
(x1, X2,...,X,) where x; € X; fori = 1,...,; it is denoted X,; x X2 x --- x Xp. 
If X = {1, 2}, Y = {a, b}, and Z = {a, 6}, then 
XxY¥xZ=({(l,a,q@), (a, B), 1,5, a), (1, b, B), (2,4, a), (2, a, B), 
(2, b, a), (2, b, B)}. < 
Notice that in Example 1.1.28, |X x Y x Z| = |X| - |Y| + |Z]. In general, 

|X) x XX: x X| = |X| “ |X2| ++ ([Xnl. 
We leave the proof of this last statement as an exercise (see Exercise 27, Section 2.4). 
If A is a set of appetizers, FE is a set of entrees, and D is a set of desserts, the Cartesian 


product A x E x D lists all possible dinners consisting of one appetizer, one entree, and 
one dessert. < 
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1.1 Problem-Solving Tips 


To verify that two sets A and B are equal, written A = B, show that for every x, if 
x € A, then x € B, and if x € B, thenx € A. 


To verify that two sets A and B are not equal, written A # B, find at least one 
element that is in A but not in B, or find at least one element that is in B but not 
in A. One or the other conditions suffices; you need not (and may not be able to) 
show both conditions. 


To verify that A is a subset of B, written A C B, show that for every x, if x € A, 
then x € B. Notice that if A is a subset of B, it is possible that A = B. 


= To verify that A is nota subset of B, find at least one element that is in A but not in B. 


= To verify that A is a proper subset of B, written A C B, verify that A is a subset of 


Bas described previously, and that A + B, that is, that there is at least one element 
that is in B but not in A. 


To visualize relationships among sets, use a Venn diagram. A Venn diagram can 
suggest whether a statement about sets is true or false. 


A set of elements is determined by its members; order is irrelevant. On the other 
hand, ordered pairs and n-tuples take order into account. 


1.1 Review Exercises 


11, 
2 
3. 


onan nun 


What is a set? 
What is set notation? 


Describe the sets Z, Q, R, Zt, Qt, Rt, Z~, Q°>, R™, Z2""’s, 
qrnnes and R”""“S, and give two examples of members of 
each set. 


. If X is a finite set, what is |X|? 

. How do we denote x is an element of the set X? 

. How do we denote x is not an element of the set X? 
. How do we denote the empty set? 


. Define set X is equal to set Y. How do we denote X is equal 


to Y? 


. Explain a method of verifying that sets X and Y are equal. 
. Explain a method of verifying that sets X and Y are not equal. 


. Define X is a subset of Y. How do we denote X is a subset 


of Y? 


. Explain a method of verifying that X is a subset of Y. 
. Explain a method of verifying that X is not a subset of Y. 


. Define X is a proper subset of Y. How do we denote X is a 


proper subset of Y? 


15. 


19, 


20. 


21. 
22. 
23. 


24. 
25. 
26. 
27. 


Explain a method of verifying that X is a proper subset 
of Y. 


. What is the power set of X? How is it denoted? 
. Define X union Y. How is the union of X and Y denoted? 


. If Sis a family of sets, how do we define the union of S? How 


is the union denoted? 


Define X intersect Y. How is the intersection of X and Y 
denoted? 


If S is a family of sets, how do we define the intersection of 
S? How is the intersection denoted? 


Define X and Y are disjoint sets. 
What is a pairwise disjoint family of sets? 


Define the difference of sets X and Y. How is the difference 
denoted? 


What is a universal set? 
What is the complement of the set X? How is it denoted? 
What is a Venn diagram? 


Draw a Venn diagram of three sets and identify the set repre- 
sented by each region. 


‘Exercise numbers in color indicate that a hint or solution appears at the back of the book in the section 


following the References. 
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28. State the associative laws for sets. 
29. State the commutative laws for sets. 
30. State the distributive laws for sets. 
31. State the identity laws for sets. 

32. State the complement laws for sets. 
33. State the idempotent laws for sets. 
34. State the bound laws for sets. 


35. State the absorption laws for sets. 


36. State the involution law for sets. 
37. State the 0/1 laws for sets. 

38. State De Morgan’s laws for sets. 
39. What is a partition of a set X? 


40. Define the Cartesian product of sets X and Y. How is this 
Cartesian product denoted? 


41. Define the Cartesian product of the sets X;, X2,..., Xn. How 
is this Cartesian product denoted? 


In Exercises 1-16, let the universe be the set U = {1, 2,3, ..., 10}. 
Let A = {1,4,7, 10}, B = {1, 2,3, 4, 5}, and C = {2, 4, 6, 8}. List 
the elements of each set. 


1. AUB 2. BNC 

3. A—B 4. B—A 
5.A 6. U-C 
7.U 8. AUD 

9. BANS 10. AUU 

11. BNU 12, AN(BUC) 
13. BN(C—A) 

14. (ANB) —C 

15. ANBUC 


16. (AUB) —(C—B) 


In Exercises 17-27, let the universe be the set Z*. Let X = 
{1, 2, 3,4, 5} and let Y be the set of positive, even integers. In set- 
builder notation, Y = {2n | n € Z*}. In Exercises 18-27, give a 
mathematical notation for the set by listing the elements if the set is 
finite, by using set-builder notation if the set is infinite, or by using 
a predefined set such as ©. 


17. Describe Y in words. 


18. X 19. Y 

20. XNY 21. XUY 
22. XNY 23. XUY 
24. XNY 25. XUY 
26. XNY 27. XUY 


28. What is the cardinality of 0? 

29. What is the cardinality of {@}? 

30. What is the cardinality of {a, b, a, c}? 

31. What is the cardinality of {{a}, {a, b}, {a, c}, a, b}? 

In Exercises 32-35, show, as in Examples 1.1.2 and 1.1.3, that 
A=B. 

32. A = (3,2, 1}, B= {1, 2, 3} 

33. C = {1, 2,3}, D= {2,3,4}, A = {2,3}, B=CND 


34, A= {1,2,3}, B={n|ne Z* andn* < 10} 
35. A={x|2x?-—4x4+4=1}, B= {1,3} 


In Exercises 36-39, show, as in Example 1.1.4, that A # B. 
36. A={I, 2,3}, B=@ 

37. A= {1,2}, B= {x| x — 2x7 —x+2=0} 

38. A = {1,3,5}, B={n|ne Zt andr? —1 <n} 

39. B= {1,2,3,4}, C= {2,4,6,8}, A=BNC 


In Exercises 40-43, determine whether each pair of sets is equal. 
40. {1, 2, 2, 3}, {1, 2, 3} 

41. {1, 1, 3}, (3,3, 1} 42. {x |x? 4+x=2}, {1,-l} 
43. {x |x € Rand 0 < x < 2}, {1, 2} 

In Exercises 44-47, show, as in Examples 1.1.5 and 1.1.6, that 
ACB. 

44. A= {1,2}, B= (3,2, 1} 

45. A= {1,2}, B= {x| 2° — 6x? + 1lx = 6} 

46. A= {1} x {1,2}, B= {1} x {1, 2, 3} 

47. A= {(2n|neZ*}, B={n|neZ*} 


In Exercises 48-51, show, as in Example 1.1.9, that A is not a sub- 
set of B. 

48. A = {1,2, 3}, B= {1, 2} 

49, A = {x |x? — 2x? —x+2=0}, B= {1,2} 


50. A={1, 2,3, 4}, C={5, 6,7, 8}, B={n|ne€ Aandn+m=8 
for some m € C} 


51. A={l,2,3}, B=@ 


In Exercises 52-59, draw a Venn diagram and shade the given set. 


52. ANB 53. A-B 
54. BU(B—A) 55. (AUB)—B 
56. BN (CUA) 57. (AUB)N(C—A) 


58. (CNA) — (B—A)) NC 
59. (B— C)U ((B—A) N(CUB)) 


60. A television commercial for a popular beverage showed the 
following Venn diagram 


Great Taste Less Filling 


What does the shaded area represent? 


Exercises 61-65 refer to a group of 191 students, of which 10 are 
taking French, business, and music; 36 are taking French and busi- 
ness; 20 are taking French and music; 18 are taking business and 
music; 65 are taking French; 76 are taking business; and 63 are 
taking music. 


61. How many are taking French and music but not business? 
62. How many are taking business and neither French nor music? 
63. How many are taking French or business (or both)? 


64. How many are taking music or French (or both) but not busi- 
ness? 


65. How many are taking none of the three subjects? 


66. A television poll of 151 persons found that 68 watched 
“Law and Disorder”; 61 watched “25”; 52 watched “The 
Tenors”; 16 watched both “Law and Disorder” and “25”; 
25 watched both “Law and Disorder” and “The Tenors”; 
19 watched both “25” and “The Tenors”; and 26 watched 
none of these shows. How many persons watched all three 
shows? 


67. In a group of students, each student is taking a mathemat- 
ics course or a computer science course or both. One-fifth of 
those taking a mathematics course are also taking a computer 
science course, and one-eighth of those taking a computer 
science course are also taking a mathematics course. Are 
more than one-third of the students taking a mathematics 
course? 


In Exercises 68-71, let X = {1,2} and Y = {a, b, c}. List the ele- 


ments in each set. 


68. X x Y 
70. Xx X 


69. Yx X 
71.YxyY 


In Exercises 72-75, let X = {1,2}, Y = {a}, and Z = {a, B}. List 
the elements of each set. 


72 Xx Y XZ. 
74. XxXxX 


73. Xx YxY 
75. Y¥xXxYxZ 


In Exercises 76-82, give a geometric description of each set in 
words. Consider the elements of the sets to be coordinates. For 
example, R x Z is the set {(x,n) |x € Rand n € Z}. Interpreting 
the ordered pairs (x, n) as coordinates in the plane, the graph of all 
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such ordered pairs is the set of all parallel horizontal lines spaced 
one unit apart, one of which passes through (0,0). 

76. RxR 

77. Z2xR 

78. R x Zronnes 

79. ZxZ 

80. Rx RxR 

81. RxRxZ 

82. Rx ZxZ 


In Exercises 83-86, list all partitions of the set. 


$3. {1} 84. {1, 2} 

85. {a, b, c} 86. {a, b, c, d} 

In Exercises 87-92, answer true or false. 

87. {x} C {x} 88. {x} € {x} 

89. {x} € {x, {x}} 90. {x} S {x, {x} 
91. {2} C P({1, 2}) 92. {2} € P({1, 2}) 


93. List the members of P ({a, b}). Which are proper subsets of 
{a, b}? 


94. List the members of P ({a, b, c, d}). Which are proper subsets 
of {a, b, c, d}? 


95. If X has 10 members, how many members does P(X) have? 
How many proper subsets does X have? 


96. If X has n members, how many proper subsets does X have? 


In Exercises 97-100, what relation must hold between sets A and 
B in order for the given condition to be true? 


97. ANB=A 
99. ANU=2 


98. AUB=A 
100. ANB=B 


The symmetric difference of two sets A and B is the set 


AAB=(AUB)—(ANB). 


101. IfA = {1, 2, 3} and B = {2, 3, 4, 5}, find A A B. 

102. Describe the symmetric difference of sets A and B in 
words. 

103. Given a universe U, describe AAA, A AA, UAA, and @ AA. 


104. Let C be acircle and let D be the set of all diameters of C. What 
is MD? (Here, by “diameter” we mean a line segment through 
the center of the circle with its endpoints on the circumference 
of the circle.) 


Let P denote the set of integers greater than |. For i > 2, define 
Xj = {ik | k € P}. 


Describe P — lls Xj. 


7A starred exercise indicates a problem of above-average difficulty. 
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1.2 Propositions 


Which of sentences (a)-(f) are either true or false (but not both)? 


(a) The only positive integers that divide! 7 are 1 and 7 itself. 

(b) Alfred Hitchcock won an Academy Award in 1940 for directing Rebecca. 
(c) For every positive integer n, there is a prime number" larger than n. 

(d) Earth is the only planet in the universe that contains life. 

(e) Buy two tickets to the “Unhinged Universe” rock concert for Friday. 

(f) x+4=6. 


Sentence (a), which is another way to say that 7 is prime, is true. 

Sentence (b) is false. Although Rebecca won the Academy Award for best picture 
in 1940, John Ford won the directing award for The Grapes of Wrath. It is a surprising 
fact that Alfred Hitchcock never won an Academy Award for directing. 

Sentence (c), which is another way to say that the number of primes is infinite, is 
true. 

Sentence (d) is either true or false (but not both), but no one knows which at this 
time. 

Sentence (e) is neither true nor false [sentence (e) is a command]. 

The truth of equation (f) depends on the value of the variable x. 

A sentence that is either true or false, but not both, is called a proposition. Sen- 
tences (a)—(d) are propositions, whereas sentences (e) and (f) are not propositions. A 
proposition is typically expressed as a declarative sentence (as opposed to a question, 
command, etc.). Propositions are the basic building blocks of any theory of logic. 

We will use variables, such as p, g, and r, to represent propositions, much as we 
use letters in algebra to represent numbers. We will also use the notation 


prl+tl=3 


to define p to be the proposition 1 + 1 = 3. 

In ordinary speech and writing, we combine propositions using connectives such 
as and and or. For example, the propositions “It is raining” and “It is cold” can be com- 
bined to form the single proposition “It is raining and it is cold.” The formal definitions 
of and and or follow. 


Definition 1.2.1 >  Letp and q be propositions. 
The conjunction of p and q, denoted p A q, is the proposition 


p and gq. 
The disjunction of p and q, denoted p Vv q, is the proposition 


p o gq. < 


Example 1.2.2  Ifp: It is raining, and gq: It is cold, then the conjunction of p and q is 


P Aq: Itis raining and it is cold. (1.2.1) 


The disjunction of p and q is 


t“Divides” means “divides evenly.” More formally, we say that a nonzero integer d divides an integer m if there 
is an integer g such that m = dq. We call g the quotient. We will explore the integers in detail in Chapter 5. 
¥An integer n > 1 is prime if the only positive integers that divide n are | and n itself. For example, 2, 3, and 
11 are prime numbers. 


Example 1.2.4 
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p\ q: Itis raining or it is cold. < 


The truth value of the conjunction p A g is determined by the truth values of p 
and q, and the definition is based upon the usual interpretation of “and.” Consider the 
proposition (1.2.1) of Example 1.2.2. If it is raining (i.e., p is true) and it is also cold 
(i.e., g is also true), then we would consider the proposition (1.2.1) to be true. However, 
if it is not raining (i.e., p is false) or it is not cold (i.e., g is false) or both, then we would 
consider the proposition (1.2.1) to be false. 

The truth values of propositions such as conjunctions and disjunctions can be de- 
scribed by truth tables. The truth table of a proposition P made up of the individual 
propositions pi, ..., Pn lists all possible combinations of truth values for pi, ..., Pn, 
T denoting true and F denoting false, and for each such combination lists the truth value 
of P. We use a truth table to formally define the truth value of p A q. 

A truth table for a proposition P made up of n propositions has r = 2” rows. Tra- 
ditionally, for the first proposition, the first r/2 rows list T and the last r/2 rows list F. 
The next proposition has r/4 T’s alternate with r/4 F’s. The next proposition has r/8 T’s 
alternate with r/8 F’s, and so on. For example, a proposition P made up of three propo- 
sitions pi, p2, and p3 has 8 = 2? rows. Proposition p; will list 4 = 8/2 T’s followed by 
4 F’s. Proposition py» will list 2 = 8/4 T’s followed by 2 F’s, followed by 2 T’s, followed 
by 2 F’s. Proposition p3 will have one T, followed by one F, followed by one T, and so 
on. The truth table without the truth values of P would be 


Pi P2 P3 P 

T T T 

T T F 

A F T Here is where the 

T F F truth values of P go. 
F T T 

F au F 

F F T 

F F F 


Notice that all possible combinations of truth values for pj, po, p3 are listed. 


Definition 1.2.3 > The truth value of the proposition p A q is defined by the 
truth table 


v 
Ss 


PA 


Moa4 
MamyA 
aA -m7H 


< 


Definition 1.2.3 states that the conjunction p A q is true provided that p and q are 
both true; p A q is false otherwise. 


If p: A decade is 10 years, and gq: A millennium is 100 years, then p is true, q is false (a 
millennium is 1000 years), and the conjunction, 


Pp Aq: A decade is 10 years and a millennium is 100 years, 


is false. < 
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Example 1.2.5 


Example 1.2.7 


Example 1.2.8 


Most programming languages define “and” exactly as in Definition 1.2.3. For example, 
in the Java programming language, (logical) “and” is denoted &&, and the expression 


x < 10 &&y>4 


is true precisely when the value of the variable x is less than 10 (.e.,x < 10 is true) and 
the value of the variable y is greater than 4 (i.e., y > 4 is also true). < 


The truth value of the disjunction p V q is also determined by the truth values of 
p and q, and the definition is based upon the “inclusive” interpretation of “or.’ Consider 
the proposition, 


p\ q: Itis raining or it is cold, (1.2.2) 


of Example 1.2.2. If it is raining (i.e., p is true) or it is cold (i.e., g is also true) or both, then 
we would consider the proposition (1.2.2) to be true (i.e., p V q is true). If it is not raining 
(i.e., p is false) and it is not cold (i.e., g is also false), then we would consider the propo- 
sition (1.2.2) to be false (i.e., p V q is false). The inclusive-or of propositions p and q is 
true if p or q, or both, is true, and false otherwise. There is also an exclusive-or (see Exer- 
cise 67) that defines p exor q to be true if p or q, but not both, is true, and false otherwise. 


Definition 1.2.6 > The truth value of the proposition pv q, called the inclusive- 
or of p and q, is defined by the truth table 


P 4)|PV4 


mMHaaa 


< 
If p: A millennium is 100 years, and g: A millennium is 1000 years, then p is false, g is 
true, and the disjunction, 

pV gq: Amillennium is 100 years or a millennium is 1000 years, 
is true. < 
Most programming languages define (inclusive) “or” exactly as in Definition 1.2.6. For 


example, in the Java programming language, (logical) “or” is denoted | |, and the ex- 
pression 


x< 10 || y>4 
is true precisely when the value of the variable x is less than 10 (i.e.,x < 10 is true) or 


the value of the variable y is greater than 4 (i.e., y > 4 is true) or both. < 


In ordinary language, propositions being combined (e.g., p and g combined to 
give the proposition p V q) are normally related; but in logic, these propositions are not 
required to refer to the same subject matter. For example, in logic, we permit propositions 
such as 


3 < 5 or Paris is the capital of England. 


Example 1.2.10 


Example 1.2.11 


Example 1.2.12 
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Logic is concerned with the form of propositions and the relation of propositions to each 
other and not with the subject matter itself. (The given proposition is true because 3 < 5 
is true.) 

The final operator on a proposition p that we discuss in this section is the negation 
of p. 


Definition 1.2.9 >» The negation of p, denoted —p, is the proposition 
not p. 


The truth value of the proposition —p is defined by the truth table 


Geos laa 
Iso [2 | 


< 
In English, we sometimes write —p as “It is not the case that p.” For example, if 
p: Paris is the capital of England, 
the negation of p could be written 
—p: Itis not the case that Paris is the capital of England, 
or more simply as 


=p: Paris is not the capital of England. 


If 
p: a was calculated to 1,000,000 decimal digits in 1954, 
the negation of p is the proposition 
=p: a was not calculated to 1,000,000 decimal digits in 1954. 


It was not until 1973 that 1,000,000 decimal digits of 2 were computed; so, p is false. 
(Since then over 12 trillion decimal digits of 7 have been computed.) Since p is false, 
—p is true. < 


Most programming languages define “not” exactly as in Definition 1.2.9. For example, 
in the Java programming language, “not” is denoted !, and the expression 


'(x < 10) 


is true precisely when the value of the variable x is not less than 10 (1.e., x is greater than 
or equal to 10). < 


In expressions involving some or all of the operators —, A, and V, in the absence of 
parentheses, we first evaluate —, then A, and then V. We call such a convention operator 
precedence. In algebra, operator precedence tells us to evaluate - and / before + and —. 


Given that proposition p is false, proposition g is true, and proposition r is false, deter- 
mine whether the proposition =p V g A ris true or false. 
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Example 1.2.13 


SOLUTION We first evaluate sp, which is true. We next evaluate g A r, which is false. 
Finally, we evaluate =p V qg A r, which is true. < 


Searching the Web A variety of Web search engines are available (e.g., Google, Yahoo!, 
Baidu) that allow the user to enter keywords that the search engine then tries to match 
with Web pages. For example, entering mathematics produces a (huge!) list of pages 
that contain the word “mathematics.” Some search engines allow the user to use and, or, 
and not operators to combine keywords (see Figure 1.2.1), thus allowing more complex 
searches. In the Google search engine, and is the default operator so that, for exam- 
ple, entering discrete mathematics produces a list of pages containing both of the words 
“discrete” and “mathematics.” The or operator is OR, and the not operator is the mi- 
nus sign —. Furthermore, enclosing a phrase, typically with embedded spaces, in double 
quotation marks causes the phrase to be treated as a single word. For example, to search 
for pages containing the keywords 


“Shonda Rhimes” and (Grey’s or Scandal) and (not Murder), 


the user could enter 


“Shonda Rhimes” Grey's OR Scandal -Murder 


A 
Go gle “Shonda Rhimes* Grey's OR Scandal -Murder 2. | Pa | signin | 


Shonda Rhimes Says 2016 Election Is 
Mirroring Her Show ‘Scandal’ 
BE Humngton Post -2.da 


cee Shorda Rhimes says tes 


More news for “Shonda Rhimes" Grey's OR Scandal -Murder 


Images for "Shonda Rhimes” Grey's OR Scandal 


0) Ea) 


More images for "Shonda Rhimes* Grey's OR Scandal -Murder 


and Fitz Had 


< 
Google and the Google logo are registered trademarks 


of Google Inc., used with permission. 


Figure 1.2.1 The Google search engine, which allows the user to use and (space), or (OR), 
and not (—) operators to combine keywords. As shown, Google found about 573,000 Web 
pages containing “Shonda Rhimes” and (Grey’s or Scandal) and 

(not Murder). 


1.2 Problem-Solving Tips 


Although there may be a shorter way to determine the truth values of a proposition P 
formed by combining propositions p;, ..., P, using operators such as — and V, a truth 
table will always supply all possible truth values of P for various truth values of the 
constituent propositions p),..., Dp. 
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1.2 Review Exercises 


1. What is a proposition? 
2. What is a truth table? 
3. What is the conjunction of p and g? How is it denoted? 


4. Give the truth table for the conjunction of p and q. 


5. What is the disjunction of p and g? How is it denoted? 
6. Give the truth table for the disjunction of p and q. 
7. What is the negation of p? How is it denoted? 


8. Give the truth table for the negation of p. 


Determine whether each sentence in Exercises 1-12 is a proposi- 
tion. If the sentence is a proposition, write its negation. (You are not 
being asked for the truth values of the sentences that are proposi- 
tions.) 


1.24+5= 19. 2.6+9= 15. 
3 ¥+9'= 15: 4. 7 =3.14 


5. Waiter, will you serve the nuts—I mean, would you serve the 
guests the nuts? 


6. For some positive integer n, 19340 =n - 17. 


7. Audrey Meadows was the original “Alice” in “The Honey- 
mooners.” 


8. Peel me a grape. 

9. The line “Play it again, Sam” occurs in the movie Casablanca. 
10. Every even integer greater than 4 is the sum of two primes. 
11. The difference of two primes. 

x12. This statement is false. 
Exercises 13-16 refer to a coin that is flipped 10 times. Write the 
negation of the proposition. 
13. Ten heads were obtained. 
14. Some heads were obtained. 
15. Some heads and some tails were obtained. 
16. At least one head was obtained. 
Given that proposition p is false, proposition q is true, and proposi- 


tion r is false, determine whether each proposition in Exercises 17— 
22 is true or false. 
17. pVq 18. =p Vv 7q 
19. =pvq 

21. =(p V q) A (-=pVvr) 
22. (pV ar) A7A((q Vr) V 7(r V p)) 


20. =pV 7A(q Ar) 


Write the truth table of each proposition in Exercises 23-30. 


23. pA7q 24. (=pV 7q) Vp 

25. (pV gq) Ap 26. (PAG) An7p 

27. (PAG) VY (7pVv 4) 28. =(p Aq) V (r A7p) 
29. (DV g) \ (=p V q) A (PV 7g) A (pV 79) 

30. =(p Aq) V (-qG Vr) 


In Exercises 31-33, represent the given proposition symbolically 
by letting 

pe 29, gt DeT7, 72 37. 
Determine whether each proposition is true or false. 
31. 5 <9and9 <7. 
32. It is not the case that (5 < 9 and9 < 7). 
33. 5 < 9 or it is not the case that (9 < 7 and 5 < 7). 
In Exercises 34-39, formulate the symbolic expression in words 
using 


p: Lee takes computer science. 
q: Lee takes mathematics. 


34. =p 
37. pV 7q 


35. pAq 36. pV q 


38. pA 7q 39. =p A -=q 


In Exercises 40-44, formulate the symbolic expression in words 
using 
Pp: You play football. 


q: You miss the midterm exam. 
r: You pass the course. 


40. pAq 
43. A(pVqg)vVr 


41. -gAr 
44. (pAq)V (-qAr) 


In Exercises 45-49, formulate the symbolic expression in words 
using 


42. pVqvr 


Pp: Today is Monday. 
q: Itis raining. 
r: It is hot. 


45. pvq 
47. ~(pV q) Ar 
49. (PAV NAVY GV P)) 


In Exercises 50-55, represent the proposition symbolically by 
letting 


46. =p A (qv r) 
48. (pA g) A7(rV p) 


Pp: There is a hurricane. 
q: It is raining. 


50. There is no hurricane. 
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51. There is a hurricane and it is raining. 

52. There is a hurricane, but it is not raining. 

53. There is no hurricane and it is not raining. 

54, Either there is a hurricane or it is raining (or both). 

55. Either there is a hurricane or it is raining, but there is no hur- 
ricane. 

In Exercises 56-60, represent the proposition symbolically by let- 

ting 


p: You run 10 laps daily. 
q: You are healthy. 
r: You take multi-vitamins. 


56. You run 10 laps daily, but you are not healthy. 

57. You run 10 laps daily, you take multi-vitamins, and you are 
healthy. 

58. You run 10 laps daily or you take multi-vitamins, and you are 
healthy. 

59. You do not run 10 laps daily, you do not take multi-vitamins, 
and you are not healthy. 


60. Either you are healthy or you do not run 10 laps daily, and you 
do not take multi-vitamins. 


In Exercises 61-66, represent the proposition symbolically by 
letting 


p: You heard the “Flying Pigs” rock concert. 
q: You heard the “Y2K” rock concert. 
r: You have sore eardrums. 


61. You heard the “Flying Pigs” rock concert, and you have sore 
eardrums. 


62. You heard the ‘Flying Pigs” rock concert, but you do not have 
sore eardrums. 


63. You heard the “Flying Pigs” rock concert, you heard the 
“Y2K” rock concert, and you have sore eardrums. 


1.3 


64. You heard either the “Flying Pigs” rock concert or the “Y2K” 
rock concert, but you do not have sore eardrums. 


65. You did not hear the “Flying Pigs” rock concert and you did 
not hear the “Y2K” rock concert, but you have sore eardrums. 


66. Itis not the case that: You heard the “Flying Pigs” rock concert 
or you heard the “Y2K” rock concert or you do not have sore 
eardrums. 


67. Give the truth table for the exclusive-or of p and g in which 
p exor q is true if either p or q, but not both, is true. 


In Exercises 68-74, state the meaning of each sentence if “or” is 
interpreted as the inclusive-or; then, state the meaning of each sen- 
tence if “or” is interpreted as the exclusive-or (see Exercise 67). 
In each case, which meaning do you think is intended? 


68. To enter Utopia, you must show a driver’s license or a passport. 


69. To enter Utopia, you must possess a driver’s license or a 
passport. 


70. The prerequisite to data structures is a course in Java or C++. 
71. The car comes with a cupholder that heats or cools your drink. 
72. We offer $1000 cash or 0 percent interest for two years. 

73. Do you want fries or a salad with your burger? 


74. The meeting will be canceled if fewer than 10 persons sign up 
or at least 3 inches of snow falls. 


75. At one time, the following ordinance was in effect in 
Naperville, Illinois: “It shall be unlawful for any person to 
keep more than three [3] dogs and three [3] cats upon his prop- 
erty within the city.” Was Charles Marko, who owned five dogs 
and no cats, in violation of the ordinance? Explain. 


76. Write acommand to search the Web for national parks in North 
or South Dakota. 

77. Write a command to search the Web for information on lung 
disease other than cancer. 


78. Write acommand to search the Web for minor league baseball 
teams in Illinois that are not in the Midwest League. 


Conditional Propositions 


and Logical Equivalence 


The dean has announced that 


If the Mathematics Department gets an additional $60,000, 


then it will hire one new faculty member. 


(1.3.1) 


Statement (1.3.1) states that on the condition that the Mathematics Department gets an 
additional $60,000, then the Mathematics Department will hire one new faculty member. 
A proposition such as (1.3.1) is called a conditional proposition. 


Definition 1.3.1 > 


If p and q are propositions, the proposition 


if p then g (1.3.2) 


Example 1.3.2 
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is called a conditional proposition and is denoted 


Dp q. 
The proposition p is called the hypothesis (or antecedent), and the proposition gq is called 
the conclusion (or consequent). < 
If we define 


p: The Mathematics Department gets an additional $60,000, 


q: The Mathematics Department will hire one new faculty member, 


then proposition (1.3.1) assumes the form (1.3.2). The hypothesis is the statement “The 
Mathematics Department gets an additional $60,000,” and the conclusion is the state- 
ment “The Mathematics Department will hire one new faculty member.” < 


What is the truth value of the dean’s statement (1.3.1)? First, suppose that the 
Mathematics Department gets an additional $60,000. If the Mathematics Department 
does hire an additional faculty member, surely the dean’s statement is true. (Using the 
notation of Example 1.3.2, if p and q are both true, then p — gq is true.) On the other 
hand, if the Mathematics Department gets an additional $60,000 and does not hire an 
additional faculty member, the dean is wrong—statement (1.3.1) is false. (If p is true and 
q is false, then p — q is false.) Now, suppose that the Mathematics Department does 
not get an additional $60,000. In this case, the Mathematics Department might or might 
not hire an additional faculty member. (Perhaps a member of the department retires and 
someone is hired to replace the retiree. On the other hand, the department might not 
hire anyone.) Surely we would not consider the dean’s statement to be false. Thus, if the 
Mathematics Department does not get an additional $60,000, the dean’s statement must 
be true, regardless of whether the department hires an additional faculty member or not. 
(If p is false, then p — q is true whether gq is true or false.) This discussion motivates 
the following definition. 


Definition 1.3.3 > The truth value of the conditional proposition p > q is de- 
fined by the following truth table: 


P @\|Pp7@ 
bo oF T 
T & F 
F T Tr 
F F T 


< 


For those who need additional evidence that we should define p— q to be 
true when p is false, we offer further justification. Most people would agree that the 
proposition, 


For all real numbers x, if x > 0, then x? > 0, (1.3.3) 


is true. (In Section 1.5, we will discuss such “for all” statements formally and in detail.) 
In the following discussion, we let P(x) denote x > 0 and Q(x) denote x* > 0. That 
proposition (1.3.3) is true means that no matter which real number we replace x with, 
the proposition 


if P(x) then Q(x) (1.3.4) 
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Example 1.3.4 


Example 1.3.5 


that results is true. For example, if x =3, then P(3) and Q(3) are both true (3 > 0 and 
3? > 0 are both true), and, by Definition 1.3.3, (1.3.4) is true. Now let us consider the 
situation when P(x) is false. If x = — 2, then P(—2) is false (—2 > 0 is false) and Q(—2) 
is true [(—2)* > 0 is true]. In order for proposition (1.3.4) to be true in this case, we 
must define p — g to be true when p is false and gq is true. This is exactly what oc- 
curs in the third line of the truth table of Definition 1.3.3. If x =0, then P(O) and Q(0) 
are both false (0>0 and 07 > 0 are both false). In order for proposition (1.3.4) to be 
true in this case, we must define p — q to be true when both p and q are false. This 
is exactly what occurs in the fourth line of the truth table of Definition 1.3.3. Even 
more motivation for defining p — gq to be true when p is false is given in Exercises 
77 and 78. 


Letp:1>2andq:4 < 8. Then pis false and gq is true. Therefore, p — q is true and 
q — pis false. < 


In expressions that involve the logical operators A, V, =, and —, the conditional 
operator — is evaluated last. For example, 


pvq-7r 
is interpreted as 
(pV q) > (*r). 
Assuming that p is true, g is false, and r is true, find the truth value of each proposition. 


(a) pAq>r 

(b) pyq>-r 
(c) PA@>1r) 
(4) p> (q>1r) 


SOLUTION 


(a) We first evaluate p A q because — is evaluated last. Since p is true and q is false, 
PA qis false. Therefore, p \ g — r is true (regardless of whether r is true or 
false). 


(b) We first evaluate sr. Since r is true, —r is false. We next evaluate p V q. Since p 
is true and q is false, p V q is true. Therefore, p V g > —r is false. 


(c) Since g is false, g — r is true (regardless of whether r is true or false). Since p 
is true, p A (¢ > 1) is true. 

(d) Since gq is false, g — r is true (regardless of whether r is true or false). Thus, 
Pp — (q > 1r) is true (regardless of whether p is true or false). < 


A conditional proposition that is true because the hypothesis is false is said to be 
true by default or vacuously true. For example, if the proposition, 


If the Mathematics Department gets an additional $60,000, then it will hire one 
new faculty member, 


is true because the Mathematics Department did not get an additional $60,000, we would 
say that the proposition is true by default or that it is vacuously true. 

Some statements not of the form (1.3.2) may be rephrased as conditional proposi- 
tions, as the next example illustrates. 
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Example 1.3.6 Restate each proposition in the form (1.3.2) of a conditional proposition. 


(a) 
(b) 
(c) 
(d) 


(e) 


Mary will be a good student if she studies hard. 
John takes calculus only if he has sophomore, junior, or senior standing. 
When you sing, my ears hurt. 


A necessary condition for the Cubs to win the World Series is that they sign a 
right-handed relief pitcher. 


A sufficient condition for Maria to visit France is that she goes to the Eiffel Tower. 


SOLUTION 


(a) 


(b) 


(c) 


(d) 


The hypothesis is the clause following if; thus an equivalent formulation is 
If Mary studies hard, then she will be a good student. 


The statement means that in order for John to take calculus, he must have sopho- 
more, junior, or senior standing. In particular, if he is a freshman, he may not take 
calculus. Thus, we can conclude that if he takes calculus, then he has sophomore, 
junior, or senior standing. Therefore an equivalent formulation is 


If John takes calculus, then he has sophomore, junior, or senior standing. 
Notice that 
If John has sophomore, junior, or senior standing, then he takes calculus, 


is not an equivalent formulation. If John has sophomore, junior, or senior stand- 
ing, he may or may not take calculus. (Although eligible to take calculus, he may 
have decided not to.) 

The “if p then g’” formulation emphasizes the hypothesis, whereas the “p 
only if g” formulation emphasizes the conclusion; the difference is only stylistic. 


When means the same as /f; thus an equivalent formulation is 
If you sing, then my ears hurt. 


A necessary condition is just that: a condition that is necessary for a particular 
outcome to be achieved. The condition does not guarantee the outcome; but, if 
the condition does not hold, the outcome will not be achieved. Here, the given 
statement means that if the Cubs win the World Series, we can be sure that they 
signed a right-handed relief pitcher since, without such a signing, they would 
not have won the World Series. Thus, an equivalent formulation of the given 
statement is 


If the Cubs win the World Series, then they signed a right-handed relief 
pitcher. 


The conclusion expresses a necessary condition. 
Notice that 


If the Cubs sign a right-handed relief pitcher, then they win the World 
Series, 


is not an equivalent formulation. Signing a right-handed relief pitcher does not 
guarantee a World Series win. However, not signing a right-handed relief pitcher 
guarantees that they will not win the World Series. 
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Example 1.3.7 


(e) Similarly, a sufficient condition is a condition that suffices to guarantee a partic- 
ular outcome. If the condition does not hold, the outcome might be achieved in 
other ways or it might not be achieved at all; but if the condition does hold, the 
outcome is guaranteed. Here, to be sure that Maria visits France, it suffices for her 
to go to the Eiffel Tower. (There are surely other ways to ensure that Maria visits 
France; for example, she could go to Lyon.) Thus, an equivalent formulation of 
the given statement is 


If Maria goes to the Eiffel Tower, then she visits France. 


The hypothesis expresses a sufficient condition. 
Notice that 


If Maria visits France, then she goes to the Eiffel Tower, 


is not an equivalent formulation. As we have already noted, there are ways other 
than going to the Eiffel Tower to ensure that Maria visits France. < 


Example 1.3.4 shows that the proposition p — q can be true while the proposition 
q — pis false. We call the proposition g — p the converse of the proposition p > q. 
Thus a conditional proposition can be true while its converse is false. 
Write the conditional proposition, 


If Jerry receives a scholarship, then he will go to college, 


and its converse symbolically and in words. Also, assuming that Jerry does not receive 
a scholarship, but wins the lottery and goes to college anyway, find the truth value of the 
original proposition and its converse. 


SOLUTION Let p: Jerry receives a scholarship, and q: Jerry goes to college. The given 
proposition can be written symbolically as p — gq. Since the hypothesis p is false, the 
conditional proposition is true. 

The converse of the proposition is 


If Jerry goes to college, then he receives a scholarship. 


The converse can be written symbolically as g — p. Since the hypothesis g is true and 
the conclusion p is false, the converse is false. < 


Another useful proposition is 
p if and only if q, 


which is considered to be true precisely when p and q have the same truth values (i.e., 
p and q are both true or p and q are both false). 


Definition 1.3.8 > If p and q are propositions, the proposition 
p if and only if g 
is called a biconditional proposition and is denoted 


p<>4q. 


Example 1.3.9 


Example 1.3.11 
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The truth value of the proposition p <> q is defined by the following truth table: 


P4\|podqd 
T T T 
T F F 
F T BR 
F F T 
< 


It is traditional in mathematical definitions to use “if” to mean “if and only if?’ Con- 
sider, for example, the definition of set equality: If sets X and Y have the same elements, 
then X and Y are equal. The meaning of this definition is that sets X and Y have the same 
elements if and only if X and Y are equal. 

An alternative way to state “‘p if and only if q” is “p is a necessary and sufficient 
condition for g.” The proposition “p if and only if g” is sometimes written “p iff g.” 


The proposition 
1 < 5 if and only if 2 < 8 (1.3.5) 
can be written symbolically as p <> q if we define p: 1 < 5 and gq: 2 < 8. Since both p 


and q are true, the proposition p < q is true. < 


An alternative way to state (1.3.5) is: A necessary and sufficient condition for 
1 < 5isthat2 <8. 

In some cases, two different propositions have the same truth values no matter 
what truth values their constituent propositions have. Such propositions are said to be 
logically equivalent. 


Definition 1.3.10 > Suppose that the propositions P and Q are made up of the 


propositions p1,..., Py. We say that P and Q are logically equivalent and write 
P=Q, 

provided that, given any truth values of p,,...,P,, either P and Q are both true, or P 

and Q are both false. < 


De Morgan's Laws for Logic We will verify the first of De Morgan’s laws 


(pV qQ) =—pA7q, ADAG) = pV 79, 


and leave the second as an exercise (see Exercise 79). 
By writing the truth tables for P = —(pV q) and Q = =pA -g, we can verify that, 
given any truth values of p and q, either P and Q are both true or P and Q are both false: 


“(pV q) =pA7q 


TD MHHy\|s 
jAHomea|s 


F 
F 
F 
T 


Sn 


Thus P and Q are logically equivalent. < 
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Example 1.3.12 


Example 1.3.13 


Example 1.3.14 


Show that, in Java, the expressions 
x < 10 || x > 20 
and 
(x >= 10 && x <= 20) 


are equivalent. (In Java, >= means >, and <= means <.) 


SOLUTION If we let p denote the expression x >= 10 and q denote the expression 
x <= 20, the expression !(x >= 10 && x <= 20) becomes —(p A q). By De Mor- 
gan’s second law, —(p A q) is equivalent to —p V —q. Since —p translates as x < 10 
and —g translates asx > 20,-—pV-qtranslatesasx < 10 || x > 20. Therefore, the 
expressions x < 10 || x > 20and!(x >= 10 && x <= 20) are equivalent. < 


Our next example gives a logically equivalent form of the negation of p > q. 


Show that the negation of p — q is logically equivalent to p A 7g. 


SOLUTION By writing the truth tables for P = =(p > gq) and Q = pA 7g, we can 
verify that, given any truth values of p and q, either P and Q are both true or P and Q are 
both false: 


Po 4@|7@>4q pA~d 
T Ff F F 
T F T T 
FT F RF 
F F FP F 
Thus P and Q are logically equivalent. < 


Use the logical equivalence of —=(p — gq) and p A 7 q (see Example 1.3.13) to write the 
negation of 


If Jerry receives a scholarship, then he goes to college, 


symbolically and in words. 


SOLUTION We let p: Jerry receives a scholarship, and g: Jerry goes to college. The 
given proposition can be written symbolically as p — q. Its negation is logically equiv- 
alent to p A 7g. In words, this last expression is 


Jerry receives a scholarship and he does not go to college. < 


We now show that, according to our definitions, p < gq is logically equivalent to 
Pp —> qandq — p.\n words, 


p if and only if g 
is logically equivalent to 


if p then q and if g then p. 


Example 1.3.15 


Example 1.3.17 
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The truth table shows that 


POq=P7-QAQ-—P). 


Pod P>qd G>Pp P>OHNAG>P) 


MoH |s 
Meaty [ss 
Aad} 
amy 


T T 
F F 
ae F 
a T 


< 


Consider again the definition of set equality: If sets X and Y have the same ele- 
ments, then X and Y are equal. We noted that the meaning of this definition is that sets 
X and Y have the same elements if and only if X and Y are equal. Example 1.3.15 shows 
that an equivalent formulation is: If sets X and Y have the same elements, then X and Y 
are equal, and if X and Y are equal, then X and Y have the same elements. 

We conclude this section by defining the contrapositive of a conditional propo- 
sition. We will see (in Theorem 1.3.18) that the contrapositive is an alternative, logi- 
cally equivalent form of the conditional proposition. Exercise 80 gives another logically 
equivalent form of the conditional proposition. 


Definition 1.3.16 >» The contrapositive (or transposition) of the conditional 
proposition p — q is the proposition ~q — —p. 4 


Notice the difference between the contrapositive and the converse. The converse of 
a conditional proposition merely reverses the roles of p and q, whereas the contrapositive 
reverses the roles of p and g and negates each of them. 


Write the conditional proposition, 
If the network is down, then Dale cannot access the internet, 


symbolically. Write the contrapositive and the converse symbolically and in words. Also, 
assuming that the network is not down and Dale can access the internet, find the truth 
value of the original proposition, its contrapositive, and its converse. 


SOLUTION Let p: The network is down, and q: Dale cannot access the internet. The 
given proposition can be written symbolically as p — q. Since the hypothesis p is false, 
the conditional proposition is true. 

The contrapositive can be written symbolically as ~qg — —p and, in words, 


If Dale can access the internet, then the network is not down. 


Since the hypothesis —q and conclusion —p are both true, the contrapositive is true. 
(Theorem 1.3.18 will show that the conditional proposition and its contrapositive are 
logically equivalent, that is, that they always have the same truth value.) 

The converse of the given proposition can be written symbolically as g — p and, 
in words, 


If Dale cannot access the internet, then the network is down. 
Since the hypothesis q is false, the converse is true. < 


An important fact is that a conditional proposition and its contrapositive are 
logically equivalent. 
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Theorem 1.3.18 


The conditional proposition p — gq and its contrapositive —qg — —p are logically 
equivalent. 


Proof The truth table 


P @|P>q “Q->—p 
a T a 
YY F F F 
BE T T 
EF T T 
shows that p > q and —q — —p are logically equivalent. < 


In ordinary language, “if” is often used to mean “if and only if”’ Consider the 
statement 


If you fix my computer, then I’ll pay you $50. 
The intended meaning is 


If you fix my computer, then I’ll pay you $50, and 
if you do not fix my computer, then I will not pay you $50, 


which is logically equivalent to (see Theorem 1.3.18) 


If you fix my computer, then I’ll pay you $50, and 
if I pay you $50, then you fix my computer, 


which, in turn, is logically equivalent to (see Example 1.3.15) 
You fix my computer if and only if I pay you $50. 


In ordinary discourse, the intended meaning of statements involving logical operators 
can often (but, not always!) be inferred. However, in mathematics and science, precision 
is required. Only by carefully defining what we mean by terms such as “if” and “if 
and only if” can we obtain unambiguous and precise statements. In particular, logic 
carefully distinguishes among conditional, biconditional, converse, and contrapositive 
propositions. 


1.3 Problem-Solving Tips 


= Informal logic, “if” and “if and only if” are quite different. The conditional propo- 
sition p — q (if p then q) is true except when p is true and q is false. On the other 
hand, the biconditional proposition p < gq (p if and only if q) is true precisely 
when p and gq are both true or both false. 

= To determine whether propositions P and Q, made up of the propositions p),..., 
Pn, are logically equivalent, write the truth tables for P and Q. If all of the entries 
for P and Q are always both true or both false, then P and Q are equivalent. If 
some entry is true for one of P or Q and false for the other, then P and Q are not 
equivalent. 


= De Morgan’s laws for logic 


a(pV q) = =p A~4, (Pp Aq) = pv 74 
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give formulas for negating “or” (V) and negating “and” (A). Roughly speaking, 
negating “or” results in “and,” and negating “and” results in “or.” 


m Example 1.3.13 states a very important equivalence 


(Pp > gq) =PpA~4, 


which we will meet throughout this book. This equivalence shows that the negation 
of the conditional proposition can be written using the “and” (A) operator. Notice 
that there is no conditional operator on the right-hand side of the equation. 


1.3 Review Exercises 


= 


. What is a conditional proposition? How is it denoted? 
. Give the truth table for the conditional proposition. 

. Inaconditional proposition, what is the hypothesis? 

. Ina conditional proposition, what is the conclusion? 


. What is a necessary condition? 


na un F&F WwW WN 


. What is a sufficient condition? 


7. What is the converse of p > q? 

8. What is a biconditional proposition? How is it denoted? 

9. Give the truth table for the biconditional proposition. 
10. What does it mean for P to be logically equivalent to Q? 
11. State De Morgan’s laws for logic. 


12. What is the contrapositive of p > q? 


In Exercises I-11, restate each proposition in the form (1.3.2) of a 
conditional proposition. 


1. Joey will pass the discrete mathematics exam if he studies 
hard. 


2. Rosa may graduate if she has 160 quarter-hours of credits. 


3. A necessary condition for Fernando to buy a computer is that 
he obtain $2000. 


4. A sufficient condition for Katrina to take the algorithms course 
is that she pass discrete mathematics. 


5. Getting that job requires knowing someone who knows the 
boss. 


6. Youcan go to the Super Bowl unless you can’t afford the ticket. 


7. You may inspect the aircraft only if you have the proper secu- 
rity clearance. 


8. When better cars are built, Buick will build them. 


9. The audience will go to sleep if the chairperson gives the 
lecture. 


10. The program is readable only if it is well structured. 


11. A necessary condition for the switch to not be turned properly 
is that the light is not on. 


12. Write the converse of each proposition in Exercises 1-11. 

13. Write the contrapositive of each proposition in Exercises 1-11. 
Assuming that p and r are false and that q and s are true, find the 
truth value of each proposition in Exercises 14-22. 

15. =p > -q 

17. (p> QgA(q->r) 


18. (poqgor 
20. (s > (pPA=r)) A ((p > (FV g)) AS) 

21. (pA7q) > (GA7r)) > (SV 79) 

22. (pV gq) A (qv 5)) > (7rv p) AVS) 


19. p> (q-> 1) 


Exercises 23-32 refer to the propositions p, q, and r; p is true, 
q is false, and r’s status is unknown at this time. Tell whether 
each proposition is true, is false, or has unknown status at this 
time. 


23. pvr 24. pAr 25. por 
26.q>r 27. r—>p 28. r> q 
29. (pAnor 30. (pVnor 31. qAner 


32. (qvneor 


Determine the truth value of each proposition in Exercises 
33-42. 

33. If3 +5 < 2,then]1+3=4. 

34. If3 +5 <2, then1+3 44. 

35. If3 +5 > 2, then1+3=4. 

36. If3 +5 > 2, then1+3 44. 

37. 3+5 > 2 if and only if 1+3=4. 
38. 345 < 2 if and only if1+3=4. 
39. 345 <2 if and only if1+3 44. 
40. If the earth has six moons, then | < 3. 
41. If 1 < 3, then the earth has six moons. 
42. If < 3.14, then 7? < 9.85. 
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In Exercises 43-46, represent the given proposition symbolically 
by letting 


p: 4<2, q: 7<10, r: 6<6. 


43. If 4 < 2, then 7 < 10. 
44. If (4 < 2 and 6 < 6), then7 < 10. 


45. If it is not the case that (6 < 6 and 7 is not less than 10), then 
6 <6. 


46. 7 < 10 if and only if (4 < 2 and 6 is not less than 6). 
In Exercises 47-52, represent the given proposition symbolically 
by letting 


p: You run 10 laps daily. 
q: Youare healthy. 
r: You take multi-vitamins. 


47. If you run 10 laps daily, then you will be healthy. 


48. If you do not run 10 laps daily or do not take multi-vitamins, 
then you will not be healthy. 


49. Taking multi-vitamins is sufficient for being healthy. 


50. You will be healthy if and only if you run 10 laps daily and 
take multi-vitamins. 


51. If you are healthy, then you run 10 laps daily or you take multi- 
vitamins. 


52. If you are healthy and run 10 laps daily, then you do not take 
multi-vitamins. 


In Exercises 53-58, formulate the symbolic expression in words 
using 

p: Today is Monday, 

q: It is raining, 

r: It is hot. 
53. p> q 54. -=q > (r Ap) 
56. A(pyvger 
57. (pA (qv r)) > (FV Gv P)) 
58. (pV (7PATGVN)) > PV ATV Q)) 
In Exercises 59-62, write each conditional proposition sym- 
bolically. Write the converse and contrapositive of each 
proposition symbolically and in words. Also, find the truth 
value of each conditional proposition, its converse, and its 
contrapositive. 
59. If 4 < 6, then9 > 12. 
61. |1| < 3if -—3 <1 <3. 


55. =p > (qVr) 


60. If 4 > 6, then9 > 12. 
62. |4| < 3if -3 <4 <3. 


For each pair of propositions P and Q in Exercises 63-72, state 
whether or not P = Q. 


63. P=p,Q=pVvq 64. P=pAq,Q=—pv -q 


65. P=p>q,Q0=-pvq 

66. P=pA(-qVr),O0=pv (qAn-r) 

67. P=pA(QVr),0=(PVQDAPVN 

68. P=p—>q,Q=-q—>-p 

69. P=p>q,Q0=poq 

70. P=(p>qQAqQ>1r),Q=p>r 

1. P=(p>qg)>r,Q=p>(q>Nr) 

72. P=(s > (pA=r)) A (p> (FV Q))AS),O=pVvt 


Using De Morgan’s laws for logic, write the negation of each 
proposition in Exercises 73-76. 


73. Pat will use the treadmill or lift weights. 

74. Dale is smart and funny. 

75. Shirley will either take the bus or catch a ride to school. 

76. Red pepper and onions are required to make chili. 

Exercises 77 and 78 provide further motivation for defining p > q 
to be true when p is false. We consider changing the truth table for 
Pp — qwhen p is false. For the first change, we call the resulting 
operator impI (Exercise 77), and, for the second change, we call 


the resulting operator imp2 (Exercise 78). In both cases, we see 
that pathologies result. 


77. Define the truth table for imp/ by 


pimplq 


v 
& 


mom44 
mMHamy 


- 
F 
iP 
T 


Show that p impl q = q imp! p. 
78. Define the truth table for imp2 by 


P q | pimp2q 
Tt =f T 
T F F 
F T T 
F F F 


(a) Show that 


(p imp2 q) A (q imp2 p) 4 p = q. (1.3.6) 


(b) Show that (1.3.6) remains true if we change the third row 
of imp2’s truth table to F T F. 

79. Verify the second of De Morgan’s laws, =(p A gq) = =p V 7. 
80. Show that (p > q) = (—=p V q). 


1.4 ® Arguments and Rules of Inference 31 


1.4 Arguments and Rules of Inference 


Consider the following sequence of propositions. 


The bug is either in module 17 or in module 81. 
The bug is a numerical error. 


Module 81 has no numerical error. (1.4.1) 
Assuming that these statements are true, it is reasonable to conclude 
The bug is in module 17. (1.4.2) 


This process of drawing a conclusion from a sequence of propositions is called deductive 
reasoning. The given propositions, such as (1.4.1), are called hypotheses or premises, 
and the proposition that follows from the hypotheses, such as (1.4.2), is called the con- 
clusion. A (deductive) argument consists of hypotheses together with a conclusion. 
Many proofs in mathematics and computer science are deductive arguments. 

Any argument has the form 


If p; and p2 and --- and p,, then q. (1.4.3) 
Argument (1.4.3) is said to be valid if the conclusion follows from the hypotheses; that 


is, if pj and p and - - - and p, are true, then g must also be true. This discussion motivates 
the following definition. 


Definition 1.4.1 >» An argument is a sequence of propositions written 


Pi 
P2 
Pn 
q 
or 
P1sP2,+++5Pn/..- 
The symbol .°, is read “therefore.” The propositions p;,p2,...,Pn are called the 
hypotheses (or premises), and the proposition g is called the conclusion. The argument 
Go Online is valid provided that if p; and p2 and --- and py are all true, then q must also be true; 
For more on otherwise, the argument is invalid (or a fallacy). < 
fallacies, see 
goo.gl/F7b35e In a valid argument, we sometimes say that the conclusion follows from the hy- 


potheses. Notice that we are not saying that the conclusion is true; we are only saying 
that if you grant the hypotheses, you must also grant the conclusion. An argument is 
valid because of its form, not because of its content. 

Each step of an extended argument involves drawing intermediate conclusions. 
For the argument as a whole to be valid, each step of the argument must result in a valid, 
intermediate conclusion. Rules of inference, brief, valid arguments, are used within a 
larger argument. 
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Example 1.4.2 Determine whether the argument 


p?>q 


is valid. 


FIRST SOLUTION We construct a truth table for all the propositions involved: 


P @\|P~@ P @ 
T T E T T 
T F FR T F 
F T T F T 
F OF T F F 


We observe that whenever the hypotheses p — gq and p are true, the conclusion g is also 
true; therefore, the argument is valid. 


SECOND SOLUTION We can avoid writing the truth table by directly verifying that 
whenever the hypotheses are true, the conclusion is also true. 

Suppose that p — gq and p are true. Then g must be true, for otherwise p > q 
would be false. Therefore, the argument is valid. < 


The argument in Example 1.4.2 is used extensively and is known as the modus 
ponens rule of inference or law of detachment. Several useful rules of inference for 
propositions, which may be verified using truth tables (see Exercises 33-38), are listed 
in Table 1.4.1. 


TABLE 1.4.1 m@ Rules of Inference for Propositions 


Rule of Inference Name Rule of Inference Name 
p>4 P 
P q 
“4g Modus ponens SpA Conjunction 
p> 4 p>4 
mi q7>r Hypothetical 
“ap Modus tollens por syllogism 
PY 
P ™?p Disjunctive 
JpVvag Addition 2G syllogism 
PA 
Pp Simplification 


Which rule of inference is used in the following argument? 

If the computer has one gigabyte of memory, then it can run “Blast ’em.” If the 
computer can run “Blast ’em,” then the sonics will be impressive. Therefore, if the com- 
puter has one gigabyte of memory, then the sonics will be impressive. 


Example 1.4.4 


Example 1.4.5 
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SOLUTION Let p denote the proposition “the computer has one gigabyte of memory,” 
let g denote the proposition “the computer can run ‘Blast ’em,’” and let r denote the 
proposition “the sonics will be impressive.” The argument can be written symbolically as 


Pq 
q->r 


cap er 


Therefore, the argument uses the hypothetical syllogism rule of inference. < 


Represent the argument 


If 2 = 3, then I ate my hat. 
Tate my hat. 


i235 
symbolically and determine whether the argument is valid. 


SOLUTION If we let p: 2 = 3 and q: I ate my hat, the argument may be written 


pq 
q 


sep 


If the argument is valid, then whenever p — g and q are both true, p must also be 
true. Suppose that p — q and q are true. This is possible if p is false and q is true. In this 
case, p is not true; thus the argument is invalid. This fallacy is known as the fallacy of 
affirming the conclusion. < 


We can also determine whether the argument in Example 1.4.4 is valid or not by 
examining the truth table of Example 1.4.2. In the third row of the table, the hypotheses 
are true and the conclusion is false; thus the argument is invalid. 


Represent the argument 


The bug is either in module 17 or in module 81. 
The bug is a numerical error. 


Module 81 has no numerical error. 


.. The bug is in module 17. 
given at the beginning of this section symbolically and show that it is valid. 


SOLUTION If we let 


p.: The bug is in module 17. 
q: The bug is in module 81. 


r: The bug is a numerical error. 
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Example 1.4.6 


the argument may be written 


From r — -—q and r, we may use modus ponens to conclude —q. From p V q and 7=q, 
we may use the disjunctive syllogism to conclude p. Thus the conclusion p follows from 
the hypotheses and the argument is valid. < 


We are given the following hypotheses: If the Chargers get a good linebacker, then the 
Chargers can beat the Broncos. If the Chargers can beat the Broncos, then the Chargers 
can beat the Jets. If the Chargers can beat the Broncos, then the Chargers can beat the 
Dolphins. The Chargers get a good linebacker. Show by using the rules of inference (see 
Table 1.4.1) that the conclusion, the Chargers can beat the Jets and the Chargers can beat 
the Dolphins, follows from the hypotheses. 


SOLUTION Let p denote the proposition “the Chargers get a good linebacker,” let g 
denote the proposition “the Chargers can beat the Broncos,” let r denote the proposition 
“the Chargers can beat the Jets,” and let s denote the proposition “the Chargers can beat 
the Dolphins.” Then the hypotheses are: 

pq 

q>r 

q>s 

Dp. 


From p — q and gq — r, we may use the hypothetical syllogism to conclude p — r. 
From p — rand p, we may use modus ponens to conclude r. From p > q andq > s, 
we may use the hypothetical syllogism to conclude p > s. From p > s and p, we may 
use modus ponens to conclude s. From r ands, we may use conjunction to conclude r/s. 
Since r/s represents the proposition “the Chargers can beat the Jets and the Chargers can 
beat the Dolphins,” we conclude that the conclusion does follow from the hypotheses. < 


1.4 Problem-Solving Tips 


The validity of a very short argument or proof might be verified using a truth table. In 
practice, arguments and proofs use rules of inference. 


1.4 Review Exercises 


NI nA un + B&B NY = 


8. State the modus tollens rule of inference. 


. What is deductive reasoning? 

. What is a hypothesis in an argument? 
. What is a premise in an argument? 

. What is a conclusion in an argument? 
. What is a valid argument? 

. What is an invalid argument? 


. State the modus ponens rule of inference. 


9. State the addition rule of inference. 
10. State the simplification rule of inference. 
11. State the conjunction rule of inference. 
12. State the hypothetical syllogism rule of inference. 


13. State the disjunctive syllogism rule of inference. 
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Formulate the arguments of Exercises 1-5 symbolically and deter- 
mine whether each is valid. Let 


p: I study hard. q: IgetA’s. r: I get rich. 


1. IfI study hard, then I get A’s. 
I study hard. 


.. Iget A’s. 
2. If I study hard, then I get A’s. 
If I don’t get rich, then I don’t get A’s. 


.. I get rich. 

3. I study hard if and only if I get rich. 
I get rich. 
.. Istudy hard. 


4. If I study hard or I get rich, then I get A’s. 
I get A’s. 


.. If I don’t study hard, then I get rich. 


5. If I study hard, then I get A’s or I get rich. 
I don’t get A’s and I don’t get rich. 


.. I don’t study hard. 


Formulate the arguments of Exercises 6-9 symbolically and deter- 
mine whether each is valid. 


p: The Democrats win. 
q: The Republicans win. 
r: Unemployment is up. 
s: The economy is up. 
6. If the Democrats win, then the economy is up; and, if the 
Republicans win, then unemployment is up. 


The Democrats win or the Republicans win. 


., Unemployment is up or the economy is up. 


7. If the Democrats win or the Republicans win, then unemploy- 
ment is up or the economy is up. 


The Democrats win and unemployment is not up. 


.. The economy is up. 

8. If the Democrats win, then unemployment is up. 
If the Republicans win, then the economy is up 
The Republicans and Democrats do not both win. 


The Democrats do not win. 


.. The economy is up. 


9. If the Democrats win, then unemployment is up or the econ- 
omy is up. 


If the Republicans win, then unemployment is up. 


The economy is not up. 


The Democrats win 


.. Unemployment is up or the Republicans win. 


In Exercises 10-14, write the given argument in words and deter- 
mine whether each argument is valid. Let 


p:4 gigabytes is better than no memory at all. 
q: We will buy more memory. 


r: We will buy a new computer. 


10. por ll. p> (rvq) 12. por 
p> rg r>q 
“p> (rag) por “4 

13. ar— —p 14. por 
r P= 
Ap —— 

q 


In Exercises 15-19, write the given argument in words and deter- 
mine whether each argument is valid. Let 


p: The for loop is faulty. 

q: The while loop is faulty. 

r: The hardware is unreliable. 
s: The output is correct. 


15. (PVqg > (rvs) 16. (rVs)>p 


Pp sq 
=F pVs 
a} Part a 

17. (pon-| 18. p> q 
(Qo s)—> p q7>r 
FAS a! 
PYG sor 
“pA Para 

19. p> (qv r) 
q> (VS) 
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Determine whether each argument in Exercises 20-24 is valid. 


20. p> 4q 21. p> q 
—P 74 


22. pA-=p 


4 


7q ete 1 
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23. p> (q> Nr) 24. (p>q)A(r>s) 


q> pn) pvr 
A(pVvgr>r S.qNVS 
25. Show that if 
Pi,p2/..p and —p, p3,-- +, Pn/.". € 


are valid arguments, the argument 


P1,P2,-+- »Pn/e.c 
is also valid. 


26. Comment on the following argument: 


Hard disk drive storage is better than nothing. 


Nothing is better than a solid state drive. 


.. Hard disk drive is better than a solid state drive. 


For each argument in Exercises 27-29, tell which rule of inference 
is used. 


27. Fishing is a popular sport. Therefore, fishing is a popular sport 
or lacrosse is wildly popular in California. 

28. If fishing is a popular sport, then lacrosse is wildly popular in 
California. Fishing is a popular sport. Therefore, lacrosse is 
wildly popular in California. 


1.5 Quantifiers 
Go Online 

For more on 
quantifiers, see 
goo.gl/F7b35e 


statement 


29. Fishing is a popular sport or lacrosse is wildly popular in 
California. Lacrosse is not wildly popular in California. There- 
fore, fishing is a popular sport. 


In Exercises 30-32, give an argument using rules of inference to 
show that the conclusion follows from the hypotheses. 


30. Hypotheses: If there is gas in the car, then I will go to the store. 
If I go to the store, then I will get a soda. There is gas in the 
car. Conclusion: I will get a soda. 


31. Hypotheses: If there is gas in the car, then I will go to the store. 
If I go to the store, then I will get a soda. I do not get a soda. 
Conclusion: There is not gas in the car, or the car transmission 
is defective. 


32. Hypotheses: If Jill can sing or Dweezle can play, then I'll buy 
the compact disc. Jill can sing. I'l buy the compact disc player. 
Conclusion: ll buy the compact disc and the compact disc 
player. 


33. Show that modus tollens (see Table 1.4.1) is valid. 

34. Show that addition (see Table 1.4.1) is valid. 

35. Show that simplification (see Table 1.4.1) is valid. 

36. Show that conjunction (see Table 1.4.1) is valid. 

37. Show that hypothetical syllogism (see Table 1.4.1) is valid. 
38. Show that disjunctive syllogism (see Table 1.4.1) is valid. 


The logic in Sections 1.2 and 1.3 that deals with propositions is incapable of describing 
most of the statements in mathematics and computer science. Consider, for example, the 


p: nis an odd integer. 


A proposition is a statement that is either true or false. The statement p is not a propo- 
sition, because whether p is true or false depends on the value of n. For example, p 
is true if n= 103 and false if n=8. Since most of the statements in mathematics and 
computer science use variables, we must extend the system of logic to include such 


statements. 


Definition 1.5.1 >» Let P(x) be a statement involving the variable x and let D 
be a set. We call P a propositional function or predicate (with respect to D) if for each 
x € D, P(x) is a proposition. We call D the domain of discourse of P. < 


In Definition 1.5.1, the domain of discourse specifies the allowable values 


for x. 


Example 1.5.2 


Let P(n) be the statement 


n is an odd integer. 


Example 1.5.3 
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Then P is a propositional function with domain of discourse Z* since for each n € Zt, 
P(n) is a proposition [i.e., for each n € Z*, P(n) is true or false but not both]. For 
example, if n = 1, we obtain the proposition 


P(1): 1 is an odd integer 
(which is true). If = 2, we obtain the proposition 
P(2): 2 is an odd integer 


(which is false). < 


A propositional function P, by itself, is neither true nor false. However, for each 
x in the domain of discourse, P(x) is a proposition and is, therefore, either true or false. 
We can think of a propositional function as defining a class of propositions, one for each 
element in the domain of discourse. For example, if P is a propositional function with 
domain of discourse Z*, we obtain the class of propositions 


P(1), P(2),.... 


Each of P(1), P(2), ... is either true or false. 


Explain why the following are propositional functions. 


(a) n? + 2n is an odd integer (domain of discourse = Z*). 

(b) x? — x — 6 = 0 (domain of discourse = R). 

(c) The baseball player hit over .300 in 2015 (domain of discourse = set of baseball 
players). 

(d) The film is rated over 20% by Rotten Tomatoes [the scale is 0% (awful) to 


100% (terrific)]. The domain of discourse is the set of films rated by Rotten 
Tomatoes. 


SOLUTION In statement (a), for each positive integer n, we obtain a proposition; there- 
fore, statement (a) is a propositional function. 

Similarly, in statement (b), for each real number x, we obtain a proposition; there- 
fore, statement (b) is a propositional function. 

We can regard the variable in statement (c) as “baseball player.” Whenever we 
substitute a particular baseball player for the variable “baseball player,” the statement is 
a proposition. For example, if we substitute “Joey Votto” for “baseball player,” statement 
(c) is 


Joey Votto hit over .300 in 2015, 


which is true. If we substitute “Andrew McCutchen” for “baseball player,” statement 
(c) is 


Andrew McCutchen hit over .300 in 2015, 


which is false. Thus statement (c) is a propositional function. 
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Example 1.5.5 


Example 1.5.6 


Statement (d) is similar in form to statement (c). Here the variable is “film.” When- 
ever we substitute a film rated by Rotten Tomatoes for the variable “film,” the statement 
is a proposition. For example if we substitute Spectre for “film,” statement (d) is 


Spectre is rated over 20% by Rotten Tomatoes, 


which is true since Spectre is rated 64% by Rotten Tomatoes. If we substitute Blended 
for “film,” statement (d) is 


Blended is rated over 20% by Rotten Tomatoes, 


which is false since Blended is rated 14% by Rotten Tomatoes. Thus statement (d) is a 
propositional function. < 


Most of the statements in mathematics and computer science use terms such as “for 
every” and “for some.” For example, in mathematics we have the following theorem: 


For every triangle T, the sum of the angles of T is equal to 180°. 
In computer science, we have this theorem: 
For some program P, the output of P is P itself. 


We now extend the logical system of Sections 1.2 and 1.3 so that we can handle state- 
ments that include “for every” and “for some.” 


Definition 1.5.4 >» Let P bea propositional function with domain of discourse 
D. The statement 
for every x, P(x) 


is said to be a universally quantified statement. The symbol V means “for every.” Thus 
the statement 


for every x, P(x) 
may be written 
Vx P(x). (1.5.1) 


The symbol V is called a universal quantifier. 
The statement (1.5.1) is true if P(x) is true for every x in D. The statement (1.5.1) 
is false if P(x) is false for at least one x in D. <4 


Consider the universally quantified statement 
Yx(x" > 0). 
The domain of discourse is R. The statement is true because, for every real number x, it 


is true that the square of x is positive or zero. < 


According to Definition 1.5.4, the universally quantified statement (1.5.1) is false 
if for at least one x in the domain of discourse, the proposition P(x) is false. A value 
x in the domain of discourse that makes P(x) false is called a counterexample to the 
statement (1.5.1). 


Determine whether the universally quantified statement Vx(x? — 1 > 0) is true or false. 
The domain of discourse is R. 


Example 1.5.7 
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SOLUTION The statement is false since, if x = 1, the proposition 17 — 1 > 0 is 
false. The value | is a counterexample to the statement Vx(x* — 1 > 0). Although there 
are values of x that make the propositional function true, the counterexample provided 
shows that the universally quantified statement is false. < 


Suppose that P is a propositional function whose domain of discourse is the set 
{d,,...,d,}. The following pseudocode determines whether Vx P(x) is true or false: 


fori=lton 
if (~P(dj)) 
return false 
return true 


The for loop examines the members d; of the domain of discourse one by one. If it finds 
a value d; for which P(d;) is false, the condition —P(d;) in the if statement is true; so the 
code returns false [to indicate that Vx P(x) is false] and terminates. In this case, d; is a 
counterexample. If P(d;) is true for every d;, the condition —P(d;) in the if statement is 
always false. In this case, the for loop runs to completion, after which the code returns 
true [to indicate that Vx P(x) is true] and terminates. 

Notice that if Vx P(x) is true, the for loop necessarily runs to completion so that 
every member of the domain of discourse is checked to ensure that P(x) is true for every 
x. If Vx P(x) is false, the for loop terminates as soon as one element x of the domain of 
discourse is found for which P(x) is false. < 


We call the variable x in the propositional function P(x) a free variable. (The idea 
is that x is “free” to roam over the domain of discourse.) We call the variable x in the 
universally quantified statement Vx P(x) a bound variable. (The idea is that x is “bound” 
by the quantifier V.) 

We previously pointed out that a propositional function does not have a truth value. 
On the other hand, Definition 1.5.4 assigns a truth value to the quantified statement 
Vx P(x). In sum, a statement with free (unquantified) variables is not a proposition, and 
a statement with no free variables (no unquantified variables) is a proposition. 

Alternative ways to write Vx P(x) are 


for all x, P(x) 
and 
for any x, P(x). 


The symbol V may be read “for every,” “for all,” or “for any.” 

To prove that Vx P(x) is true, we must, in effect, examine every value of x in the 
domain of discourse and show that for every x, P(x) is true. One technique for proving 
that Vx P(x) is true is to let x denote an arbitrary element of the domain of discourse D. 
The argument then proceeds using the symbol x. Whatever is claimed about x must be 
true no matter what value x might have in D. The argument must conclude by proving 
that P(x) is true. 

Sometimes to specify the domain of discourse D, we write a universally quantified 
statement as 


for every x in D, P(x). 


The pseudocode used in this book is explained in Appendix C. 
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Example 1.5.8 


Verify that the universally quantified statement 
for every real number x, if x > 1, thenx+1> 1 


is true. 


SOLUTION This time we must verify that the statement 
ifx >1,thenx+1> 1 (1.5.2) 


is true for every real number x. 

Let x be any real number whatsoever. It is true that for any real number x, either 
x < lorx > 1. Ifx < 1, the conditional proposition (1.5.2) is vacuously true. (The 
proposition is true because the hypothesis x > 1 is false. Recall that when the hypothesis 
is false, the conditional proposition is true regardless of whether the conclusion is true 
or false.) In most arguments, the vacuous case is omitted. 

Now suppose that x > 1. Regardless of the specific value of x, x + 1 > x. Since 
x+1> xandx > 1, we conclude that x + 1 > 1, so the conclusion is true. If x > 1, 
the hypothesis and conclusion are both true; hence the conditional proposition (1.5.2) is 
true. 

We have shown that for every real number x, the proposition (1.5.2) is true. There- 
fore, the universally quantified statement 


for every real number x, ifx > 1, thenx+1> 1 


is true. < 


The method of disproving the statement Vx P(x) is quite different from the method 
used to prove that the statement is true. To show that the universally quantified statement 
Vx P(x) is false, it is sufficient to find one value x in the domain of discourse for which 
the proposition P(x) is false. Such a value, we recall, is called a counterexample to the 
universally quantified statement. 

We turn next to existentially quantified statements. 


Definition 1.5.9 >» Let Pbeapropositional function with domain of discourse D. 
The statement 


there exists x, P(x) 


is said to be an existentially quantified statement. The symbol 3 means “there exists.” 
Thus the statement 


there exists x, P(x) 
may be written 
ax P(x). (1.5.3) 


The symbol J is called an existential quantifier. 
The statement (1.5.3) is true if P(x) is true for at least one x in D. The statement 
(1.5.3) is false if P(x) is false for every x in D. < 


Example 1.5.10 Consider the existentially quantified statement 


x 2 
dx | ——- = -}. 
(a5 3) 


Example 1.5.11 


1.5 ® Quantifiers 41 


The domain of discourse is R. The statement is true because it is possible to find at least 
one real number x for which the proposition 


x 2 


etl 5 
is true. For example, if x = 2, we obtain the true proposition 


2 2 


P+1 5 
It is not the case that every value of x results in a true proposition. For example, if x = 1, 
the proposition 


1 2 


P41 5 
is false. < 


According to Definition 1.5.9, the existentially quantified statement (1.5.3) is false 
if for every x in the domain of discourse, the proposition P(x) is false. 


Verify that the existentially quantified statement 
1 
axeR ead >1 


SOLUTION We must show that 


is false. 


is false for every real number x. Now 


is false precisely when 


pal <1 (1.5.4) 
is true. Thus, we must show that (1.5.4) is true for every real number x. To this end, 
let x be any real number whatsoever. Since 0 < x”, we may add 1 to both sides of this 
inequality to obtain 1 < x? + 1. If we divide both sides of this last inequality by x? + 1, 
we obtain (1.5.4) Therefore, the statement (1.5.4) is true for every real number x. Thus 
the statement 


: 1 
> > 
x41 


is false for every real number x. We have shown that the existentially quantified 
statement 


is false. < 
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Example 1.5.12 


Example 1.5.13 


Suppose that P is a propositional function whose domain of discourse is the set 
{d,,...,d,}. The following pseudocode determines whether 4x P(x) is true or false: 


fori=lton 
if (P(dj)) 
return true 
return false 


The for loop examines the members d; in the domain of discourse one by one. If it finds 
a value d; for which P(d;) is true, the condition P(d;) in the if statement is true; so the 
code returns true [to indicate that 4x P(x) is true] and terminates. In this case, the code 
found a value in the domain of discourse, namely dj, for which P(d;) is true. If P(d;) is 
false for every d;, the condition P(d;) in the if statement is always false. In this case, the 
for loop runs to completion, after which the code returns false [to indicate that 4x P(x) 
is true] and terminates. 

Notice that if 4x P(x) is true, the for loop terminates as soon as one element x 
in the domain of discourse is found for which P(x) is true. If 4x P(x) is false, the for 
loop necessarily runs to completion so that every member in the domain of discourse is 
checked to ensure that P(x) is false for every x. < 


Alternative ways to write dx P(x) are 
there exists x such that, P(x) 
and 
for some x, P(x) 
and 
for at least one x, P(x). 


The symbol J may be read “there exists,” “for some,” or “for at least one.” 


Consider the existentially quantified statement 
for some n, if n is prime, thenn + 1,n+2,n-+ 3, andn-+ 4 are not prime. 


The domain of discourse is Z*. This statement is true because we can find at least one 
positive integer n that makes the conditional proposition 


if nis prime, thenn+ 1,n+2,n+3, andn-+ 4 are not prime 
true. For example, if n = 23, we obtain the true proposition 
if 23 is prime, then 24, 25, 26, and 27 are not prime. 


(This conditional proposition is true because both the hypothesis “23 is prime” and the 
conclusion “24, 25, 26, and 27 are not prime” are true.) Some values of n make the 
conditional proposition true (e.g., 2 = 23,n = 4,n = 47), while others make it false 
(e.g., 2 = 2,n = 101). The point is that we found one value that makes the conditional 
proposition 


if nis prime, thenn + 1,n+2,n-+ 3, andn-+ 4 are not prime 


Theorem 1.5.14 


Example 1.5.15 


Example 1.5.16 
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true. For this reason, the existentially quantified statement 
for some n, if n is prime, thenn + 1,n+2,n+3, andn-+ 4 are not prime 


is true. < 


In Example 1.5.11, we showed that an existentially quantified statement was false 
by proving that a related universally quantified statement was true. The following theo- 
rem makes this relationship precise. The theorem generalizes De Morgan’s laws of logic 
(Example 1.3.11). 


Generalized De Morgan's Laws for Logic 


If P is a propositional function, each pair of propositions in (a) and (b) has the same 
truth values (i.e., either both are true or both are false). 


(a) =(Vx P(x)); dx >P(x) 
(b) =(Ax P(x); Vx PQ) 


Proof We prove only part (a) and leave the proof of part (b) to the reader 
(Exercise 73). 

Suppose that the proposition —(Vx P(x)) is true. Then the proposition Vx P(x) is 
false. By Definition 1.5.4, the proposition Vx P(x) is false precisely when P(x) is false 
for at least one x in the domain of discourse. But if P(x) is false for at least one x in 
the domain of discourse, —P(x) is true for at least one x in the domain of discourse. By 
Definition 1.5.9, when —P(x) is true for at least one x in the domain of discourse, the 
proposition 4x —P(x) is true. Thus, if the proposition —(Vx P(x)) is true, the proposi- 
tion 4x —P(x) is true. Similarly, if the proposition =(Vx P(x)) is false, the proposition 
dx -P(x) is false. 

Therefore, the pair of propositions in part (a) always has the same truth values. ¢ 


Let P(x) be the statement 


1 


——. 1, 
wt) . 


In Example 1.5.11 we showed that 4x P(x) is false by verifying that 
Vx aP(x) (1.5.5) 


is true. 

The technique can be justified by appealing to Theorem 1.5.14. After we prove 
that proposition (1.5.5) is true, we may use Theorem 1.5.14, part (b), to conclude that 
—=(AxP(x)) is also true. Thus ~7>(AxP(x)) or, equivalently, 4x P(x) is false. < 


Write the statement 
Every rock fan loves U2, 
symbolically. Write its negation symbolically and in words. 
SOLUTION Let P(x) be the propositional function “x loves U2.” The given state- 


ment can be written symbolically as Vx P(x). The domain of discourse is the set of 
rock fans. 
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Example 1.5.17 


Example 1.5.18 


Example 1.5.19 


By Theorem 1.5.14, part (a), the negation of the preceding proposition —(Vx P(x)) 
is equivalent to 4x —P(x). In words, this last proposition can be stated as: There exists a 
rock fan who does not love U2. < 
Write the statement 

Some birds cannot fly, 


symbolically. Write its negation symbolically and in words. 


SOLUTION Let P(x) be the propositional function “x flies.” The given statement can be 
written symbolically as 4x —P(x). [The statement could also be written 4x Q(x), where 
Q(x) is the propositional function “x cannot fly.” As in algebra, there are many ways to 
represent text symbolically.] The domain of discourse is the set of birds. 

By Theorem 1.5.14, part (b), the negation —(Ax —P(x)) of the preceding proposi- 
tion is equivalent to Vx ~—P(x) or, equivalently, Vx P(x). In words, this last proposition 
can be stated as: Every bird can fly. < 


A universally quantified proposition generalizes the proposition 
Pi AP2A-+-:-APy (1.5.6) 


in the sense that the individual propositions P;, P2,..., P, are replaced by an arbitrary 
family P(x), where x is in the domain of discourse, and (1.5.6) is replaced by 


Vx P(x). (1.5.7) 


The proposition (1.5.6) is true if and only if P; is true for every i = 1,...,n. The truth 
value of proposition (1.5.7) is defined similarly: (1.5.7) is true if and only if P(x) is true 
for every x in the domain of discourse. 


Suppose that the domain of discourse of the propositional function P is {—1, 0, 1}. The 
propositional function Vx P(x) is equivalent to 


P(—1) A P(O) A PC). 4 
Similarly, an existentially quantified proposition generalizes the proposition 
Pi V Po V +++ V Py (1.5.8) 


in the sense that the individual propositions P;, P2,..., P, are replaced by an arbitrary 
family P(x), where x is in the domain of discourse, and (1.5.8) is replaced by Ax P(x). 


Suppose that the domain of discourse of the propositional function P is {1, 2, 3, 4}. The 
propositional function 4x P(x) is equivalent to 


P(1) V P(2) v P(3) V P(4). a 


The preceding observations explain how Theorem 1.5.14 generalizes De Morgan’s 
laws for logic (Example 1.3.11). Recall that the first of De Morgan’s law for logic states 
that the propositions 


(P| V P27 V---V Py) and aP, AAP A+++ A APy 
have the same truth values. In Theorem 1.5.14, part (b), 


aP; AAP) A--- AP, 


Example 1.5.20 
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is replaced by Vx —P(x) and 
a(P, V P27 V---V Py) 


is replaced by =(Ax P(x)). 
Statements in words often have more than one possible interpretation. Consider the well- 
known quotation from Shakespeare’s “The Merchant of Venice”: 

All that glitters is not gold. 


One possible interpretation of this quotation is: Every object that glitters is not gold. 
However, this is surely not what Shakespeare intended. The correct interpretation is: 
Some object that glitters is not gold. 

If we let P(x) be the propositional function “x glitters” and Q(x) be the proposi- 
tional function “‘x is gold,” the first interpretation becomes 


Wx(P(x) > 7Q(x)), (1.5.9) 
and the second interpretation becomes 
Ax(P(x) A =Q(x)). 
Using the result of Example 1.3.13, we see that the truth values of 
Ax(P(x) A =Q(x)) 
and 
dx 7(P(x) > O()) 
are the same. By Theorem 1.5.14, the truth values of 
ax =(P(x) > Q(x) 
and 
a(Wx P(x) > Q(x) 
are the same. Thus an equivalent way to represent the second interpretation is 
a(Vx P(x) > Q(x)). (1.5.10) 


Comparing (1.5.9) and (1.5.10), we see that the ambiguity results from whether the nega- 
tion applies to Q(x) (the first interpretation) or to the entire statement 


Wx(P(x) > Q(x) 
(the second interpretation). The correct interpretation of the statement 
All that glitters is not gold 


results from negating the entire statement. 
In positive statements, “any,” “all,” “each,” and “every” have the same meaning. 
In negative statements, the situation changes: 
Not all x satisfy P(x). 
Not each x satisfies P(x). 


Not every x satisfies P(x). 


46 


Chapter 1 ® Sets and Logic 


are considered to have the same meaning as 


For some x, —P(x); 


whereas 
Not any x satisfies P(x). 
No x satisfies P(x). 
mean 
For all x, —P(x). 
See Exercises 61—71 for other examples. < 


Rules of Inference for Quantified Statements 


We conclude this section by introducing some rules of inference for quantified state- 
ments and showing how they can be used with rules of inference for propositions (see 
Section 1.4). 

Suppose that VxP(x) is true. By Definition 1.5.4, P(x) is true for every x in D, the 
domain of discourse. In particular, if d is in D, then P(d) is true. We have shown that the 
argument 


Vx P(x) 
..P(d) ifd¢eD 


is valid. This rule of inference is called universal instantiation. Similar arguments (see 
Exercises 79-81) justify the other rules of inference listed in Table 1.5.1. 


TABLE 1.5.1 @ Rules of Inference for Quantified Statementst 


Rule of Inference Name 
Vx P(x) 
,P(d) ifdeD Universal instantiation 


P(d) for every de D 


J Wx P(x) Universal generalization 
Ax P(x) 
.. P(d) for some de D Existential instantiation 


P(d) for some d € D 


dy P(x) Existential generalization 


+ The domain of discourse is D. 


Example 1.5.21 Given that 


2 


for every positive integer n,n“ > n 


is true, we may use universal instantiation to conclude that 547 > 54 since 54 is a positive 
integer (i.e., a member of the domain of discourse). < 


Example 1.5.22 


Example 1.5.23 


Example 1.5.24 
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Let P(x) denote the propositional function “x owns a laptop computer,” where the domain 
of discourse is the set of students taking MATH 201 (discrete mathematics). Suppose that 
Taylor, who is taking MATH 201, owns a laptop computer; in symbols, P(Taylor) is true. 
We may then use existential generalization to conclude that 4x P(x) is true. < 


Write the following argument symbolically and then, using rules of inference, show that 
the argument is valid. 


SOLUTION For every real number x, if x is an integer, then x is a rational number. The 
number J2 is not rational. Therefore, J2 is not an integer. 

If we let P(x) denote the propositional function “x is an integer” and Q(x) denote 
the propositional function “‘x is rational,” the argument becomes 


Vx € R(P(x) > O(n) 
=0(V2) 


+, aP(/2) 


Since J2 € R, we may use universal instantiation to conclude P(/2) > O(V2). 
Combining P(/2) => O(V2) and =0(/2), we may use modus tollens (see Table 1.4.1) 
to conclude =P(/2). Thus the argument is valid. < 


The argument in Example 1.5.23 is called universal modus tollens. 


We are given these hypotheses: Everyone loves either Microsoft or Apple. Lynn does 
not love Microsoft. Show that the conclusion, Lynn loves Apple, follows from the hy- 
potheses. 


SOLUTION Let P(x) denote the propositional function “x loves Microsoft,” and let 
Q(x) denote the propositional function “x loves Apple.” The first hypothesis is Vx(P(x) V 
Q(x)). By universal instantiation, we have P(Lynn) V Q(Lynn). The second hypothesis 
is —P(Lynn). The disjunctive syllogism rule of inference (see Table 1.4.1) now gives 
Q(Lynn), which represents the proposition “Lynn loves Apple.” We conclude that the 
conclusion does follow from the hypotheses. < 


1.5 Problem-Solving Tips 


= To prove that the universally quantified statement Vx P(x) is true, show that for 
every x in the domain of discourse, the proposition P(x) is true. Showing that P(x) 
is true for a particular value x does not prove that Vx P(x) is true. 


= To prove that the existentially quantified statement 4x P(x) is true, find one value 
of x in the domain of discourse for which the proposition P(x) is true. One value 
suffices. 

= To prove that the universally quantified statement Vx P(x) is false, find one value 
of x (a counterexample) in the domain of discourse for which the proposition P(x) 
is false. 

= To prove that the existentially quantified statement 4x P(x) is false, show that for 
every x in the domain of discourse, the proposition P(x) is false. Showing that P(x) 
is false for a particular value x does not prove that 4x P(x) is false. 
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1.5 Review Exercises 


— 


. What is a propositional function? 

. What is a domain of discourse? 

What is a universally quantified statement? 
What is a counterexample? 

What is an existentially quantified statement? 


. State the generalized De Morgan’s laws for logic. 


1A we w 


. Explain how to prove that a universally quantified statement is 
true. 


oo 


. Explain how to prove that an existentially quantified statement 
is true. 


9. Explain how to prove that a universally quantified statement is 
false. 


10. Explain how to prove that an existentially quantified statement 
is false. 


11. State the universal instantiation rule of inference. 
12. State the universal generalization rule of inference. 
13. State the existential instantiation rule of inference. 


14. State the existential generalization rule of inference. 


In Exercises 1-6, tell whether the statement is a propositional 
function. For each statement that is a propositional function, give 
a domain of discourse. 

. (2n + 1)? is an odd integer. 

. Choose an integer between | and 10. 


. Let x be a real number. 


& WN = 


. The movie won the Academy Award as the best picture of 
1955. 


5. 14+3=4. 
6. There exists x such that x < y (x, y real numbers). 
Let P(n) be the propositional function “n divides 77.” Write each 
proposition in Exercises 7-15 in words and tell whether it is true 
or false. The domain of discourse is Z*. 
wa - PC) 8. P(1) 
10. Vn P(n) 11. An P(n) 
13. dn-P(n) 14. =(Vn P(n)) 


9. P(3) 
12. Vna-P(n) 
15. =(Gn P(n)) 
Let P(x) be the propositional function “x > x.” Tell whether each 


proposition in Exercises 16-24 is true or false. The domain of dis- 
course is R. 


16. P(1) 17. P(2) 18. P(1/2) 
19. Vx P(x) 20. Ax P(x) 21. =A(Vx P(x)) 
22. —(Ax P(x)) 23. Vx =P(x) 24. Ax P(x) 


Suppose that the domain of discourse of the propositional 
function P is {1,2,3,4}. Rewrite each propositional func- 
tion in Exercises 25-31 using only negation, disjunction, and 
conjunction. 


25. Vx P(x) 26. Vx =P(x) 
28. Ax P(x) 29. Ax -=P(x) 
31. Vx((x 4 1) > P(x)) 


27. =(Wx P(x)) 
30. (Ax P(x) 


Let P(x) denote the statement “x is taking a math course.” The do- 
main of discourse is the set of all students. Write each proposition 
in Exercises 32-37 in words. 
32. Wx P(x) 

34, Vx =P(x) 35. Ax P(x) 
36. (Vx P(x)) 37. =(Ax P(x)) 


38. Write the negation of each proposition in Exercises 32-37 
symbolically and in words. 


33. Ax P(x) 


Let P(x) denote the statement “x is a professional athlete,” and 
let Q(x) denote the statement “x plays soccer.” The domain of dis- 
course is the set of all people. Write each proposition in Exercises 
39_46 in words. Determine the truth value of each statement. 


39. Vx (P(x) > Q(x)) 40. Ax (P(x) > Q(x)) 
41. Vx (Q(x) > P(x)) 42. Ax (Q(x) > P(x)) 
43. Vx (P(x) V Q(x)) 44, Ax (P(x) V Q(x)) 
45. Vx (P(x) A Q(x)) 46. Ax (P(x) A Q(x)) 


47. Write the negation of each proposition in Exercises 39-46 
symbolically and in words. 


Let P(x) denote the statement “x is an accountant,” and let Q(x) 
denote the statement “x owns a Porsche.” Write each statement in 
Exercises 48-51 symbolically. 

48. All accountants own Porsches. 

49. Some accountant owns a Porsche. 

50. All owners of Porsches are accountants. 

51. Someone who owns a Porsche is an accountant. 


52. Write the negation of each proposition in Exercises 48-51 
symbolically and in words. 


Determine the truth value of each statement in Exercises 53-58. 
The domain of discourse is R. Justify your answers. 


53. Vx(x? > x) 54, Ax(x? > x) 


55. Wx(x > 13 x2 > x) 
56. axw>1—ox>x) 
57. Vx(x > 1 > x/(x? +1) < 1/3) 
58. Ax(x > 1 > x/Q? +1) < 1/3) 


59. Write the negation of each proposition in Exercises 53-58 
symbolically and in words. 


60. Could the pseudocode of Example 1.5.7 be written as follows? 


fori=l1ton 
if (-P(di)) 
return false 
else 


return true 


What is the literal meaning of each statement in Exercises 61-71? 

What is the intended meaning? Clarify each statement by rephras- 

ing it and writing it symbolically. 

61. From Dear Abby: All men do not cheat on their wives. 

62. From the San Antonio Express-News: All old things don’t 
covet twenty-somethings. 

63. All 74 hospitals did not report every month. 

64. Economist Robert J. Samuelson: Every environmental prob- 
lem is not a tragedy. 

65. Comment from a Door County alderman: This is still Door 
County and we all don’t have a degree. 

66. Headline over a Martha Stewart column: All lampshades can’t 
be cleaned. 

67. Headline in the New York Times: A World Where All Is Not 
Sweetness and Light. 

68. Headline over a story about subsidized housing: Everyone 
can’t afford home. 

69. George W. Bush: I understand everybody in this country 
doesn’t agree with the decisions I’ve made. 


70. From Newsweek: Formal investigations are a sound prac- 
tice in the right circumstances, but every circumstance is not 
right. 


71. Joe Girardi (manager of the New York Yankees): Every move 
is not going to work out. 
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72. (a) Use a truth table to prove that if p and q are propositions, 
at least one of p > gq or q > pis true. 


(b 


~S 


Let /(x) be the propositional function “x is an integer” 

and let P(x) be the propositional function “‘x is a positive 

number.’ The domain of discourse is R. Determine 

whether or not the following proof that all integers are 

positive or all positive real numbers are integers is correct. 
By part (a), 


Wx (L(x) > P(x)) V (PQ) > I@))) 


is true. In words: For all x, if x is an integer, then x is pos- 
itive; or if x is positive, then x is an integer. Therefore, 
all integers are positive or all positive real numbers are 
integers. 


73. Prove Theorem 1.5.14, part (b). 


74. Analyze the following comments by film critic Roger Ebert: 
No good movie is too long. No bad movie is short enough. 
Love Actually is good, but it is too long. 


75. Which rule of inference is used in the following argument? 
Every rational number is of the form p/q, where p and q are 
integers. Therefore, 9.345 is of the form p/q. 


In Exercises 76-78, give an argument using rules of inference to 
show that the conclusion follows from the hypotheses. 


76. Hypotheses: Everyone in the class has a graphing calcula- 
tor. Everyone who has a graphing calculator understands the 
trigonometric functions. Conclusion: Ralphie, who is in the 
class, understands the trigonometric functions. 


77. Hypotheses: Ken, a member of the Titans, can hit the ball a 
long way. Everyone who can hit the ball a long way can make 
a lot of money. Conclusion: Some member of the Titans can 
make a lot of money. 


78. Hypotheses: Everyone in the discrete mathematics class loves 
proofs. Someone in the discrete mathematics class has never 
taken calculus. Conclusion: Someone who loves proofs has 
never taken calculus. 

79. Show that universal generalization (see Table 1.5.1) is valid. 

80. Show that existential instantiation (see Table 1.5.1) is valid. 


81. Show that existential generalization (see Table 1.5.1) is 
valid. 


Nested Quantifiers 


Consider writing the statement 


The sum of any two positive real numbers is positive, 


symbolically. We first note that since two numbers are involved, we will need two vari- 
ables, say x and y. The assertion can be restated as: If x > 0 and y > 0, thenx+y> 0. 
The given statement says that the sum of any two positive real numbers is positive, so 
we need two universal quantifiers. If we let P(x, y) denote the expression (x > 0) A (y > 
0) — («+ y > 0), the given statement can be written symbolically as 


VxVy P(x, y). 
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Example 1.6.1 


Example 1.6.2 


Example 1.6.3 


Example 1.6.4 


In words, for every x and for every y, ifx > 0 and y > 0, then x + y > 0. The domain 
of discourse of the two-variable propositional function P is R x R, which means that 
each variable x and y must belong to the set of real numbers. Multiple quantifiers such 
as VxVy are said to be nested quantifiers. In this section we explore nested quantifiers 
in detail. 


Restate Vmsn(m < n) in words. The domain of discourse is the set Z x Z. 


SOLUTION We may first rephrase this statement as: For every m, there exists n such 
that m <n. Less formally, this means that if you take any integer m whatsoever, there is 
an integer n greater than m. Another restatement is then: There is no greatest integer. <4 
Write the assertion 

Everybody loves somebody, 


symbolically, letting L(x, y) be the statement “‘x loves y.” 


SOLUTION “Everybody” requires universal quantification and “somebody” requires 
existential quantification. Thus, the given statement may be written symbolically as 


Vxady L(x, y). 


In words, for every person x, there exists a person y such that x loves y. 
Notice that 


AxVy L(x, y) 


is not a correct interpretation of the original statement. This latter statement is: There 
exists a person x such that for all y, x loves y. Less formally, someone loves everyone. 
The order of quantifiers is important; changing the order can change the meaning. <4 


By definition, the statement VxVy P(x, y), with domain of discourse X x Y, is true 
if, for every x € X and for every y € Y, P(x, y) is true. The statement VxVy P(x, y) is 
false if there is at least one x € X and at least one y € Y such that P(x, y) is false. 


Consider the statement 
VxVy((x > 0) A (vy > 0) > X+y > 0)). 


The domain of discourse is R x R. This statement is true because, for every real number 
x and for every real number y, the conditional proposition 


(x>0)AQ > 0) > (x+y > 0) 
is true. In words, for every real number x and for every real number y, if x and y are 
positive, their sum is positive. < 
Consider the statement 
VxVy((x > 0) A (vy < 0) > &+y 40)). 


The domain of discourse is R x R. This statement is false because if x = | and y = —1, 
the conditional proposition 


(x > 0)A (y <0) > @+y £0) 


is false. We say that the pair x = 1 and y = —1 is a counterexample. < 


Example 1.6.5 


Example 1.6.6 


Example 1.6.7 


Example 1.6.8 
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Suppose that P is a propositional function with domain of discourse {d,..., dy} x 
{d),..., d,}. The following pseudocode determines whether VxVy P(x, y) is true or false: 


fori=lton 
forj = 1ton 
if (~P(di, dj) 
return false 
return true 


The for loops examine members of the domain of discourse. If they find a pair dj, d 
for which P(dj, dj) is false, the condition —P(dj, d;) in the if statement is true; so the 
code returns false [to indicate that VxVy P(x, y) is false] and terminates. In this case, 
the pair d;, dj is a counterexample. If P(d;, d;) is true for every pair dj, d;, the condition 
—P(dj, d;) in the if statement is always false. In this case, the for loops run to completion, 
after which the code returns true [to indicate that VxVy P(x, y) is true] and terminates. <4 


By definition, the statement Vxdy P(x, y), with domain of discourse X x Y, is true 
if, for every x € X, there is at least one y € Y for which P(x, y) is true. The statement 
Vxdy P(x, y) is false if there is at least one x € X such that P(x, y) is false for every y € Y. 


Consider the statement 
Vxdy(x + y = 0). 


The domain of discourse is R x R. This statement is true because, for every real number 
x, there is at least one y (namely y = —x) for which x+y = 0 is true. In words, for every 
real number x, there is a number that when added to x makes the sum zero. < 


Consider the statement 
Vxdy(x > y). 


The domain of discourse is Z* x Z*. This statement is false because there is at least one 
x, namely x = 1, such that x > y is false for every positive integer y. < 


Suppose that P is a propositional function with domain of discourse {d),...,d,} x 
{d,,...,d,}. The following pseudocode determines whether Vxdy P(x, y) is true or false: 


fori=lton 
if (— exists_dj(i)) 
return false 
return true 
exists_dj(i) { 
forj = 1ton 
if (P(di, dj)) 
return true 


return false 


} 


If for each dj, there exists d; such that P(d;, d;) is true, then for each i, P(d;, d;) is true for 
some j. Thus, exists_dj(i) returns true for every i. Since — exists_dj(i) is always false, 
the first for loop eventually terminates and true is returned to indicate that Vxdy P(x, y) 
is true. 
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Example 1.6.9 


Example 1.6.10 


Example 1.6.11 


Example 1.6.12 


Example 1.6.13 


Example 1.6.14 


If for some d;, P(dj, dj) is false for every j, then, for this i, P(d;, d;) is false for 
every j. In this case, the for loop in exists_dj(i) runs to termination and false is returned. 
Since — exists_dj(i) is true, false is returned to indicate that Vxdy P(x, y) is false. < 


By definition, the statement 4xVy P(x, y), with domain of discourse X x Y, is true 
if there is at least one x € X such that P(x, y) is true for every y € Y. The statement 
AxVy P(x, y) is false if, for every x € X, there is at least one y € Y such that P(x, y) is 
false. 


Consider the statement 4xVy(x < y). The domain of discourse is Z* x Z*. This state- 
ment is true because there is at least one positive integer x (namely x = 1) for which 
x < y is true for every positive integer y. In words, there is a smallest positive integer 
(namely 1). < 


Consider the statement 4xVy(x > y). The domain of discourse is Z* x Z*. This statement 
is false because, for every positive integer x, there is at least one positive integer y, namely 
y =x+ 1, such that x > y is false. In words, there is no greatest positive integer. < 


By definition, the statement 4xAy P(x, y), with domain of discourse X x Y, is true if 
there is at least one x € X and at least one y € Y such that P(x, y) is true. The statement 
Axdy P(x, y) is false if, for every x € X and for every y € Y, P(x, y) is false. 


Consider the statement 
dxdy(a~ > IA (> 1) A (ry = 6)). 


The domain of discourse is Z+ x Z*. This statement is true because there is at least one 
integer x > 1 (namely x = 2) and at least one integer y > 1 (namely y = 3) such that 
xy = 6. In words, 6 is composite (1.e., not prime). < 


Consider the statement 
Axdy((x > 1) A (y > 1) A Gy =7)). 


The domain of discourse is Zt x Z*. This statement is false because for every positive 
integer x and for every positive integer y, 


w>DAQ> DA (ay =7) 
is false. In words, 7 is prime. < 


The generalized De Morgan’s laws for logic (Theorem 1.5.14) can be used to 
negate a proposition containing nested quantifiers. 


Using the generalized De Morgan’s laws for logic, we find that the negation of 
Vxdy P(x, y) is 


a(VWxdy P(x, y)) = ax-(Ay P(, y)) = ary a=P(x, y). 


Notice how in the negation, V and J are interchanged. < 


Write the negation of 4xVy(xy < 1), where the domain of discourse is R x R. Determine 
the truth value of the given statement and its negation. 


Example 1.6.15 
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SOLUTION Using the generalized De Morgan’s laws for logic, we find that the nega- 
tion is 


a(AxVy(axy < 1)) = Vxr(Vy(@xy < 1)) = Vxdyr(ay < 1) = Vxdy(xy = 1). 


The given statement 4xVy(xy < 1) is true because there is at least one x (namely x = 0) 
such that xy < | for every y. Since the given statement is true, its negation is false. < 


We conclude with a logic game, which presents an alternative way to determine 
whether a quantified propositional function is true or false. André Berthiaume con- 
tributed this example. 


The Logic Game Given a quantified propositional function such as Vxdy P(x, y), you 
and your opponent, whom we call Farley, play a logic game. Your goal is to try to make 
P(x, y) true, and Farley’s goal is to try to make P(x, y) false. The game begins with the 
first (left) quantifier. If the quantifier is V, Farley chooses a value for that variable; if the 
quantifier is 3, you choose a value for that variable. The game continues with the second 
quantifier. After values are chosen for all the variables, if P(x, y) is true, you win; if 
P(x, y) is false, Farley wins. We will show that if you can always win regardless of how 
Farley chooses values for the variables, the quantified statement is true, but if Farley can 
choose values for the variables so that you cannot win, the quantified statement is false. 
Consider the statement 


Vxdy(x + y = 0). (1.6.1) 


The domain of discourse is R x R. Since the first quantifier is V, Farley goes first and 
chooses a value for x. Since the second quantifier is 3, you go second. Regardless of 
what value Farley chose, you can choose y= — x, which makes the statement x + y = 0 
true. You can always win the game, so the statement (1.6.1) is true. 

Next, consider the statement 


AxVy(x + y = 0). (1.6.2) 


Again, the domain of discourse is R x R. Since the first quantifier is 4, you go first and 
choose a value for x. Since the second quantifier is V, Farley goes second. Regardless of 
what value you chose, Farley can always choose a value for y, which makes the statement 
x+y = 0 false. (If you choose x = 0, Farley can choose y = 1. If you choose x # 0, 
Farley can choose y = 0.) Farley can always win the game, so the statement (1.6.2) is 
false. 

We discuss why the game correctly determines the truth value of a quantified 
propositional function. Consider VxVy P(x, y). If Farley can always win the game, this 
means that Farley can find values for x and y that make P(x, y) false. In this case, the 
propositional function is false; the values Farley found provide a counterexample. If 
Farley cannot win the game, no counterexample exists; in this case, the propositional 
function is true. 

Consider Vxdy P(x, y). Farley goes first and chooses a value for x. You choose 
second. If, no matter what value Farley chose, you can choose a value for y that makes 
P(x, y) true, you can always win the game and the propositional function is true. How- 
ever, if Farley can choose a value for x so that every value you choose for y makes P(x, y) 
false, then you will always lose the game and the propositional function is false. 

An analysis of the other cases also shows that if you can always win the game, the 
propositional function is true; but if Farley can always win the game, the propositional 
function is false. 

The logic game extends to propositional functions of more than two variables. The 
rules are the same and, again, if you can always win the game, the propositional function 
is true; but if Farley can always win the game, the propositional function is false. < 
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1.6 Problem-Solving Tips 


= To prove that VxVy P(x, y) is true, where the domain of discourse is X x Y, you 
must show that P(x, y) is true for all values of x € X and y € Y. One technique is to 
argue that P(x, y) is true using the symbols x and y to stand for arbitrary elements 
in X and Y. 


= To prove that VxVy P(x, y) is false, where the domain of discourse is X x Y, find 
one value of x € X and one value of y € Y (two values suffice—one for x and one 
for y) that make P(x, y) false. 


= To prove that Vxay P(x, y) is true, where the domain of discourse is X x Y, you 
must show that for all x € X, there is at least one y € Y such that P(x, y) is true. 
One technique is to let x stand for an arbitrary element in X and then find a value 
for y € Y (one value suffices!) that makes P(x, y) true. 


= To prove that Vxdy P(x, y) is false, where the domain of discourse is X x Y, you 
must show that for at least one x € X, P(x, y) is false for every y € Y. One 
technique is to find a value of x € X (again one value suffices!) that has the property 
that P(x, y) is false for every y € Y. Having chosen a value for x, let y stand for an 
arbitrary element of Y and show that P(x, y) is always false. 


= To prove that dxVy P(x, y) is true, where the domain of discourse is X x Y, you must 
show that for at least one x € X, P(x, y) is true for every y € Y. One technique is to 
find a value of x € X (again one value suffices!) that has the property that P(x, y) 
is true for every y € Y. Having chosen a value for x, let y stand for an arbitrary 
element of Y and show that P(x, y) is always true. 


= To prove that 4xVy P(x, y) is false, where the domain of discourse is X x Y, you 
must show that for all x € X, there is at least one y € Y such that P(x, y) is false. 
One technique is to let x stand for an arbitrary element in X and then find a value 
for y € Y (one value suffices!) that makes P(x, y) false. 


= To prove that Sxdy P(x, y) is true, where the domain of discourse is X x Y, find 
one value of x € X and one value of y € Y (two values suffice—one for x and one 
for y) that make P(x, y) true. 


= To prove that dxdy P(x, y) is false, where the domain of discourse is X x Y, you 
must show that P(x, y) is false for all values of x € X and y € Y. One technique 
is to argue that P(x, y) is false using the symbols x and y to stand for arbitrary 
elements in X and Y. 


= To negate an expression with nested quantifiers, use the generalized De Morgan’s 
laws for logic. Loosely speaking, V and J are interchanged. Don’t forget that the 
negation of p — gq is equivalent to p A 7q. 


1.6 Review Exercises 


iL. 


What is the interpretation of VxVyP(x, y)? When is this quan- 4. What is the interpretation of Sx4yP(x, y)? When is this quan- 
tified expression true? When is it false? tified expression true? When is it false? 

. What is the interpretation of VxdyP(x, y)? When is this quan- 5. Give an example to show that, in general, VxdyP(x, y) and 
tified expression true? When is it false? AxVyP(x, y) have different meanings. 

. What is the interpretation of 4xVyP(x, y)? When is this quan- 6. Write the negation of VxVyP(x, y) using the generalized 


tified expression true? When is it false? De Morgan’s laws for logic. 


7. Write the negation of VxiyP(x,y) using the generalized 
De Morgan’s laws for logic. 


8. Write the negation of AxVyP(x, y) using the generalized 
De Morgan’s laws for logic. 
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9. Write the negation of AxdyP(x,y) using the generalized 
De Morgan’s laws for logic. 


10. Explain the rules for playing the logic game. How can the 
logic game be used to determine the truth value of a quanti- 
fied expression? 


In Exercises 1-33, the set D, consists of three students: Garth, who 
is 5 feet 11 inches tall; Erin, who is 5 feet 6 inches tall; and Marty, 
who is 6 feet tall. The set Dz consists of four students: Dale, who 
is 6 feet tall; Garth, who is 5 feet 11 inches tall; Erin, who is 5 feet 
6 inches tall; and Marty, who is 6 feet tall. The set D3 consists of 
one student: Dale, who is 6 feet tall. The set D4 consists of three stu- 
dents: Pat, Sandy, and Gale, each of whom is 5 feet 11 inches tall. 

In Exercises 1-21, T,(x, y) is the propositional function “x 
is taller than y.” Write each proposition in Exercises 1-4 in words. 


1. WxVy T1(, y) 2. Wxdy T1 (x, y) 
3. AxVy Ti (x, y) 4. Axdy T\ (x, y) 
5. Write the negation of each proposition in Exercises 1-4 in 


words and symbolically. 


In Exercises 6-21, tell whether each proposition in Exercises 1-4 
is true or false if the domain of discourse is D; x Dj; for the given 
values of i and j. 


6. i=1,j=1 7.i=1,j=2 
8. i=1,/=3 9 i=1,j=4 
10. i=2, j=1 ll. i=2, j=2 
12. i=2, j=3 13. i=2,j=4 
14. i=3,j=1 15. i=3, j=2 
16. i=3, j=3 17. i=3,j=4 
18. i=4, j=1 19. i=4, j=2 
20. i= 4, j=3 2. i=4, j=4 


In Exercises 22-27, T2(x, y) is the propositional function “x is 
taller than or the same height as y.” Write each proposition in Ex- 
ercises 22—25 in words. 


22. WxVy Ta(x, y) 23. Wxay T2(x, y) 
24. AxVy T(x, y) 25. Axdy T(x, y) 


26. Write the negation of each proposition in Exercises 22-25 in 
words and symbolically. 


27. Tell whether each proposition in Exercises 22—25 is true or 
false if the domain of discourse is D; x D; for each pair of 
values i,j given in Exercises 6-21. The sets D;,..., D4 are 
defined before Exercise 1. 


In Exercises 28-33, T3(x, y) is the propositional function “if x and 
y are distinct persons, then x is taller than y.” Write each proposi- 
tion in Exercises 28-31 in words. 
28. VxVy T3(x, y) 
30. AxVy T3(x, y) 


29. Vxdy T3(x, y) 
31. Axdy T3(x, y) 


32. Write the negation of each proposition in Exercises 28-31 in 
words and symbolically. 


33. Tell whether each proposition in Exercises 28-31 is true or 
false if the domain of discourse is D; x Dj; for each pair of 
values i,j given in Exercises 6—21. The sets D;,..., D4 are 
defined before Exercise 1. 


Let L(x, y) be the propositional function “x loves y.” The domain 
of discourse is the Cartesian product of the set of all living people 
with itself (i.e., both x and y take on values in the set of all living 
people). Write each proposition in Exercises 34-37 symbolically. 
Which do you think are true? 

34. Someone loves everybody. 

35. Everybody loves everybody. 

36. Somebody loves somebody. 

37. Everybody loves somebody. 

38. Write the negation of each proposition in Exercises 34-37 in 


words and symbolically. 


Let A(x, y) be the propositional function “x attended y’s office 
hours” and let E(x) be the propositional function “x is enrolled 
in a discrete math class.” Let S be the set of students and let T de- 
note the set of teachers—all at Hudson University. The domain of 
discourse of A is Sx T and the domain of discourse of E is S. Write 
each proposition in Exercises 39-42 symbolically. 

39. Brit attended someone’s office hours. 

40. No one attended Professor Sandwich’s office hours. 

41. Every discrete math student attended someone’s office hours. 

42. All teachers had at least one student attend their office hours. 

Let P(x, y) be the propositional function x > y. The domain of 
discourse is Z* x Z*. Tell whether each proposition in Exercises 
4346 is true or false. 

43. VxVy P(x, y) 44. Vxdy P(x, y) 

45. AxVy P(x, y) 46. Axdy PG, y) 

47. Write the negation of each proposition in Exercises 43-46. 


Determine the truth value of each statement in Exercises 48-65. 
The domain of discourse is R x R. Justify your answers. 


48. VxVy(x2 < y+ 1) 49, Vxdy(x? < y+ 1) 
50. AxVy(? < y+ 1) 51. Ardy? < y+ 1) 
52. dyvx(x2 < y+ 1) 53. VWyax(x? <y+1) 
54. WxVy(x? + y? = 9) 55. Wxdy(x? + y* = 9) 
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56. AxVy(x? +y* = 9) 
58. VxVy(x? + y? > 0) 
60. AxVy(x? + y? > 0) 
62. VxVy((x < y) > (x2 < y*)) 
63. Vxdy((x < y) > (* < y*)) 
64. AxVy((x < y) > (0? < y*)) 
65. Axdy((x < y) > (@? < y*)) 


66. Write the negation of each proposition in Exercises 48-65. 


57. Axay(x? + y* = 9) 
59. Wxdy(x? + y* > 0) 
61. Ardy? + y* = 0) 


67. Suppose that P is a propositional function with domain of dis- 
course {d|,..., dn} x {d1,..., d,}. Write pseudocode that de- 
termines whether 


AxVy P(x, y) 
is true or false. 


68. Suppose that P is a propositional function with domain of dis- 
course {d,,..., dn} x {d1,..., dy}. Write pseudocode that de- 
termines whether 


Axay P(x, y) 
is true or false. 


69. Explain how the logic game (Example 1.6.15) determines 
whether each proposition in Exercises 48-65 is true or false. 


70. Use the logic game (Example 1.6.15) to determine whether the 
proposition 
Wav ydz((z > x) A (z < y)) 
is true or false. The domain of discourse is Z x Z x Z. 
71. Use the logic game (Example 1.6.15) to determine whether the 
proposition 
VaVydz((z < x) A (z < y)) 
is true or false. The domain of discourse is Z x Z x Z. 


72. Use the logic game (Example 1.6.15) to determine whether the 


proposition 
VWxVyaz((x < y) > (2 > x) A(Z<y))) 
is true or false. The domain of discourse is Z x Z x Z. 


73. Use the logic game (Example 1.6.15) to determine whether the 
proposition 


VaWyac((x < y) > (2 >) A(z <y))) 
is true or false. The domain of discourse is R x R x R. 


Assume that VxVy P(x, y) is true and that the domain of discourse 
is nonempty. Which of Exercises 74-76 must also be true? Prove 
your answer. 


74. WxAy P(x, y) 75. AxVy P(x, y) 76. Axdy P(x, y) 


Assume that AxVy P(x, y) is true and that the domain of discourse 
is nonempty. Which of Exercises 77-79 must also be true? Prove 
your answer. 


77. VxVy P(x, y) 78. Vxay P(x, y) 79. Axdy P(x, y) 


Assume that Axdy P(x, y) is true and that the domain of discourse 
is nonempty. Which of Exercises 80-82 must also be true? Prove 
your answer. 

80. VxVy P(x, y) 

81. Vxdy P(x, y) 

82. AxVy P(x, y) 

Assume that VxVy P(x, y) is false and that the domain of discourse 


is nonempty. Which of Exercises 83-85 must also be false? Prove 
your answer. 


83. Wxdy PO, y) 84. AxVy P(x, y) 85. Axdy P(x, y) 


Assume that VxAy P(x, y) is false and that the domain of discourse 
is nonempty. Which of Exercises 86-88 must also be false? Prove 
your answer. 

86. VxVy P(x, y) 

87. AxVy P(x, y) 

88. Axdy P(x, y) 

Assume that AxVy P(x, y) is false and that the domain of discourse 


is nonempty. Which of Exercises 89-91 must also be false? Prove 
your answer. 


89. VxVy P(x, y) 90. Vxdy P(x, y) 91. Axdy P(x, y) 


Assume that Axdy P(x, y) is false and that the domain of discourse 
is nonempty. Which of Exercises 92-94 must also be false? Prove 
your answer. 

92. WxVy P(x, y) 

93. Wxay P(x, y) 

94, AxVy P(x, y) 


Which of Exercises 
=(Vxdy P(x, y))? Explain. 


95-98 is logically equivalent to 


95. Ax7(Vy P(x, y)) 96. Wx-(Ay P(x, y)) 
97. AxVy —P(x, y) 98. Axdy =P(x, y) 
99. [Requires calculus] The definition of 


lim f(x) = L 


is: For every ¢ > 0, there exists 6 > O such that for all x if 
0 < |x-—al| < 4, then |f(x) — L| < e. Write this definition 
symbolically using V and 4. 


100. [Requires calculus] Write the negation of the definition of limit 
(see Exercise 99) in words and symbolically using V and J but 


not —. 


*101. [Requires calculus] Write the definition of “lim,—._ f(x) does 
not exist” (see Exercise 99) in words and symbolically using 


V and 4 but not -. 


102. Consider the headline: Every school may not be right for 
every child. What is the literal meaning? What is the intended 
meaning? Clarify the headline by rephrasing it and writing it 


symbolically. 


Problem-Solving Corner 


Problem 


Assume that Vxdy P(x, y) is true and that the domain 
of discourse is nonempty. Which of the following must 
also be true? If the statement is true, explain; otherwise, 
give a counterexample. 


(a) VxVy P(x, y) 
(b) AxVy P(x, y) 
(c) Ardy P(x, y) 


Attacking the Problem 


Let’s begin with part (a). We are given that Vxay P(x, y) 
is true, which says, in words, for every x, there exists 
at least one y for which P(x, y) is true. If (a) is also 
true, then, in words, for every x, for every y, P(x, y) 
is true. Let the words sink in. If for every x, P(x, y) is 
true for at least one y, doesn’t it seem unlikely that it 
would follow that P(x, y) is true for every y? We sus- 
pect that (a) could be false. We’ll need to come up with 
a counterexample. 

Contrasting statement (b) with the given state- 
ment, we see that the quantifiers V and 4 have been 
swapped. There is a difference. In the given true state- 
ment Vxdy P(x, y), given any x, it’s possible to find 
a y, which may depend on x, that makes P(x, y) 
true. For statement (b), 4xVy P(x, y), to be true, for 
some x, P(x, y) would need to be true for every 
y. Again, let the words sink in. These two state- 
ments seem quite different. We suspect that (b) also 
could be false. Again, we’ll need to come up with a 
counterexample. 

Now let’s turn to part (c). We are given that 
Vxdy P(x, y) is true, which says, in words, for every 
x, there exists at least one y for which P(x, y) is true. 
For statement (c), Sxdy P(x, y), to be true, for some x 
and for some y, P(x, y) must be true. But the given 
statement says that for every x, there exists at least 
one y for which P(x, y) is true. So if we pick one x 
(and we know we can since the domain of discourse is 
nonempty), the given statement assures us that there 
exists at least one y for which P(x, y) is true. Thus 
part (c) must be true. In fact, we have just given an 
explanation! 


Problem-Solving Corner: Quantifiers 


Quantifiers 


Finding a Solution 


As noted, we have already solved part (c). We need 
counterexamples for parts (a) and (b). 

For part (a), we need the given statement, 
Vxady P(x, y), to be true and VxVy P(x, y) to be false. In 
order for the given statement to be true, we must find a 
propositional function P(x, y) satisfying 


for every x, there exists y such that P(x, y) is true. 
(1) 


In order for (a) to be false, we must have 


at least one value of x and at least one value of y 
such that P(x, y) is false. (2) 


We can arrange for (1) and (2) to hold simultaneously 
if we choose P(x, y) so that for every x, P(x, y) is true 
for some y, but for at least one x, P(x, y) is also false 
for some other value of y. Upon reflection, many math- 
ematical statements have this property. For example, 
x > y, x,y € R, suffices. For every x, there exists y 
such that x > y is true. Furthermore, for every x (and, 
in particular, for at least one value of x), there exists y 
such that x > y is false. 

For part (b), we again need the given statement, 
Vxdy P(x, y), to be true and 4xVy P(x, y) to be false. In 
order for the given statement to be true, we must find 
a propositional function P(x, y) satisfying (1). In order 
for (b) to be false, we must have 


for every x, there exists at least one value of y such 
that P(x, y) is false. (3) 


We can arrange for (1) and (3) to hold simultaneously if 
we choose P(x, y) so that for every x, P(x, y) is true for 
some y and false for some other value of y. We noted 
in the preceding paragraph that x > y, x,y € R, has 
this property. 


Formal Solution 


(a) We give an example to show that statement 
(a) can be false while the given statement is 
true. Let P(x, y) be the propositional function 
x > y with domain of discourse R x R. Then 
Vxdy P(x, y) is true since for any x, we may 
choose y = x — | to make P(x, y) true. At 
the same time, VxVy P(x, y) is false. A counter- 
example is x = 0,y = 1. 
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(b) We give an example to show that statement (b) 
can be false while the given statement is true. 
Let P(x, y) be the propositional function x > y 
with domain of discourse R x R. As we showed 
in part (a), Vxdy P(x, y) is true. Now we show 
that 4xVy P(x, y) is false. Let x be an arbitrary 
element in R. We may choose y = x + 1 to 
make x > y false. Thus for every x, there exists 
y such that P(x, y) is false. Therefore statement 
(b) is false. 


We show that if the given statement is true, 
statement (c) is necessarily true. 

We are given that for every x, there ex- 
ists y such that P(x, y) is true. We must show 
that there exist x and y such that P(x, y) is true. 
Since the domain of discourse is nonempty, we 
may choose a value for x. For this chosen x, 
there exists y such that P(x, y) is true. We have 
found at least one value for x and at least one 
value for y that make P(x, y) true. Therefore 
Axdy P(x, y) is true. 


(c) 


Summary of Problem-Solving Techniques 


m When dealing with quantified statements, it is 
sometimes useful to write out the statements in 
words. For example, in this problem, it helped to 
write out exactly what Vxdy P(x, y) means. Take 
time to let the words sink in. 


If you have trouble finding examples, look at 
existing examples (e.g., examples in this book). 
To solve problems (a) and (b), we could have 
used the statement in Example 1.6.6. Sometimes, 
an existing example can be modified to solve a 
given problem. 


Exercises 


1. Show that the statement in Example 1.6.6 solves 


problems (a) and (b) in this Problem-Solving 
Corner. 


2. Could examples in Section 1.6 other than Example 


1.6.6 have been used to solve problems (a) and (b) 
in this Problem-Solving Corner? 


Chapter 1 Notes 


General references on discrete mathematics are [Graham, 1994; Liu, 1985; Tucker]. [Knuth, 
1997, 1998a, 1998b] is the classic reference for much of this material. 
{Halmos; Lipschutz; and Stoll] are recommended to the reader wanting to study set 


theory in more detail. 
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For a history of logic, see [Kline]. The role of logic in reasoning about computer programs 


is discussed by [Gries]. 


Chapter 1 Review 


Section 1.1 


ADAM RwWN 


. Set: any collection of objects 
. Notation for sets: {x | x has property P} 
. |X|, the cardinality of X: the number of elements in the set X 


x € X: xis an element of the set X 
x ¢ X: x is not an element of the set X 


. Empty set: © or { } 


X = Y, where X and ¥ are sets: X and Y have the same 
elements 


X C Y,X isa subset of Y: every element in X is also in Y 


. X C Y, X is a proper subset of Y: X C Y andX A Y 
. P(X), the power set of X: set of all subsets of X 
. [P(X)| = 2”! 


12 
13 


. X UY,X union Y: set of elements in X or Y or both 

. Union of a family S of sets: US = {x | x € X for some 
XeES} 

. X11 Y,X intersect Y: set of elements in X and Y 

. Intersection of a family S of sets: NS = {x|x € X for 
all X € S} 

. Disjoint sets X and Y: XN Y= 2 

. Pairwise disjoint family of sets 

. X — Y, difference of X and Y, relative complement: set of 
elements in X but not in Y 

. Universal set, universe 

. X, complement of X: U — X, where U is a universal set 

. Venn diagram 


22. 
23. 


24. 


25. 


26. 


27. 


Properties of sets (see Theorem 1.1.22) 

De Morgan’s laws for sets: (AU B) = ANB, (ANB) = 
AUB 

Partition of X: a collection S of nonempty subsets of X 
such that every element in X belongs to exactly one mem- 
ber of S 

Ordered pair: (x, y) 

Cartesian product of X and Y: X x Y = {(x,y) | x € X, 
ye Y} 

Cartesian product of X;, X2,.. 


ee. Oe 


X, x XX +++ kK Xy = {(Q1, a, ..., yn) | a € Xj} 


Section 1.2 


28. 
29. 
30. 
31. 
32. 
33. 
34. 


Logic 

Proposition 

Conjunction: p and q,p Aq 

Disjunction: p or g,p V q 

Negation: not p, =p 

Truth table 

Exclusive-or of propositions p, g: p or qg, but not both 


Section 1.3 


35. 
36. 
37. 
38. 
39. 
40. 
41. 
42. 
43. 


44. 


Conditional proposition: if p, then g; p > q 
Hypothesis 

Conclusion 

Necessary condition 

Sufficient condition 

Converse of p > g: gq > p 

Biconditional proposition: p if and only if g, p = q 
Logical equivalence: P = Q 

De Morgan’s laws for logic: -(p V q) = 
7(p \ gq) = pv 7q 

Contrapositive of p > q: —q — =p 


—p A 4, 


Section 1.4 


45. 
46. 
47. 
48. 
49. 
50. 
51. 
52. 


Deductive reasoning 

Hypothesis 

Premises 

Conclusion 

Argument 

Valid argument 

Invalid argument 

Rules of inference for propositions: modus ponens, modus 
tollens, addition, simplification, conjunction, hypothetical 
syllogism, disjunctive syllogism 


Section 1.5 


53. 
54. 


Propositional function 
Domain of discourse 


55. 
56. 
57. 
58. 
59. 
60. 


61. 


62. 


63. 


64. 


65. 
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Universal quantifier 

Universally quantified statement 
Counterexample 

Existential quantifier 

Existentially quantified statement 
Generalized De Morgan’s laws for logic: 


-=(VxP(x)) and 4x-P(x) have the same truth values. 
-(AxP(x)) and Vx-P(x) have the same truth values. 


To prove that the universally quantified statement Vx P(x) is 
true, show that for every x in the domain of discourse, the 
proposition P(x) is true. 

To prove that the existentially quantified statement 4x P(x) 
is true, find one value of x in the domain of discourse for 
which P(x) is true. 

To prove that the universally quantified statement Vx P(x) is 
false, find one value of x (a counterexample) in the domain 
of discourse for which P(x) is false. 

To prove that the existentially quantified statement 4x P(x) 
is false, show that for every x in the domain of discourse, 
the proposition P(x) is false. 

Rules of inference for quantified statements: universal in- 
stantiation, universal generalization, existential instantia- 
tion, existential generalization 


Section 1.6 


66. 


67. 


68. 


69. 


70. 


71. 


72. 


73. 


74. 


75. 


To prove that VxVy P(x, y) is true, show that P(x, y) is true 
for all values of x € X and y € Y, where the domain of 
discourse is X x Y. 

To prove that Vxdy P(x, y) is true, show that for all x € X, 
there is at least one y € Y such that P(x, y) is true, where 
the domain of discourse is X x Y. 

To prove that 4xVy P(x, y) is true, show that for at least one 
x € X, P(x, y) is true for every y € Y, where the domain of 
discourse is X x Y. 

To prove that 4xdy P(x, y) is true, find one value of x € X 
and one value of y € Y that make P(x, y) true, where the 
domain of discourse is X x Y. 

To prove that VxVy P(x, y) is false, find one value of x € X 
and one value of y € Y that make P(x, y) false, where the 
domain of discourse is X x Y. 

To prove that Vxdy P(x, y) is false, show that for at least one 
x € X, P(x, y) 1s false for every y € Y, where the domain of 
discourse is X x Y. 

To prove that 4xVy P(x, y) is false, show that for all x € X, 
there is at least one y € Y such that P(x, y) is false, where 
the domain of discourse is X x Y. 

To prove that Sxdy P(x, y) is false, show that P(x, y) is false 
for all values of x € X and y € Y, where the domain of 
discourse is X x Y. 

To negate an expression with nested quantifiers, use the gen- 
eralized De Morgan’s laws for logic. 

The logic game 
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Chapter 1 Self-Test 


1. 


10. 


11. 


12. 


13. 


IfA = {1,3,4,5,6,7}, B = {x | xis an even integer}, 
C = {2, 3, 4,5, 6}, find (AN B) — C. 


. Ifp,q, and rare true, find the truth value of the proposition 


(PV qg) \7((7p Ar)V q). 


. Restate the proposition “A necessary condition for Leah to 


get an A in discrete mathematics is to study hard” in the 
form of a conditional proposition. 


. Write the converse and contrapositive of the proposition of 


Exercise 3. 


. If AUB =B, what relation must hold between A and B? 
. Are the sets 


{3,252} {x | x is an integer and | < x < 3} 


equal? Explain. 


. Determine whether the following argument is valid. 


pomqvr 
pV~q 
rVq 


od 


. If p is true and q and r are false, find the truth value of the 


proposition (p V q) > =r. 


. Write the following argument symbolically and determine 


whether it is valid. If the Skyscrapers win, I’ll eat my hat. 
If I eat my hat, I’1l be quite full. Therefore, if ’'m quite full, 
the Skyscrapers won. 


Is the statement 


The team won the 2006 National Basketball 
Association championship 


a proposition? Explain. 

Is the statement of Exercise 10 a propositional function? 
Explain. 

Let K(x, y) be the propositional function “x knows y.” The 
domain of discourse is the Cartesian product of the set of 
students taking discrete math with itself (i.e., both x and y 
take on values in the set of students taking discrete math). 
Represent the assertion “someone does not know anyone” 
symbolically. 


Write the negation of the assertion of Exercise 12 symboli- 
cally and in words. 


14 
15 
16 


17 


18. 


19. 
20. 


21. 


22. 


23. 


24. 


. IfA = {a, b,c}, how many elements are in P(A) x A? 
. Write the truth table of the proposition =(p A q) V (pV =r). 


. Formulate the proposition p A (—g V r) in words using 


p: I take hotel management. 
q: [take recreation supervision. 
r: [take popular culture. 


. Assume that a,b, and c are real numbers. Represent the 
statement 


a<bor(b<canda>c) 
symbolically, letting 


p:a<b, q: b<c, ri a<c. 
Let P(n) be the statement n and n + 2 are prime. 
In Exercises 18 and 19, write the statement in words and 


tell whether it is true or false. 


Vn P(n) 
dn P(n) 
Which rule of inference is used in the following argument? 
If the Skyscrapers win, [ll eat my hat. If I eat my hat, I'll 


be quite full. Therefore, if the Skyscrapers win, I'll be quite 
full. 


Give an argument using rules of inference to show that the 
conclusion follows from the hypotheses. 


Hypotheses: If the Council approves the funds, then New 
Atlantic will get the Olympic Games. If New Atlantic gets 
the Olympic Games, then New Atlantic will build a new 
stadium. New Atlantic does not build a new stadium. Con- 
clusion: The Council does not approve the funds, or the 
Olympic Games are canceled. 

Determine whether the statement Vxdy(x = y°) is true or 
false. The domain of discourse is R x R. Explain your an- 
swer. Explain, in words, the meaning of the statement. 


Use the generalized De Morgan’s laws for logic to write the 
negation of VxdyVz P(x, y, Z). 


Represent the statement 
If(a>corb <c), thnb>c 


symbolically using the definitions of Exercise 17. 


Chapter 1 Computer Exercises 


In Exercises 1-6, assume that a set X of n elements is represented 
as an array A of size at leastn + 1. The elements of X are listed 
consecutively in A starting in the first position and terminating 
with 0. Assume further that no set contains 0. 


1 


. Write a program to represent the sets X UY, XN Y, X—Y, 
and X A Y, given the arrays representing X and Y. (The sym- 
metric difference is denoted A.) 


. Write a program to determine whether X C Y, given arrays 
representing X and Y. 


. Write a program to determine whether X = Y, given arrays 
representing X and Y. 


. Assuming a universe represented as an array, write a 
program to represent the set X, given the array represent- 
ing X. 

. Given an element E and the array A that represents X, write 
a program that determines whether E € X. 


. Given the array representing X, write a program that lists all 
subsets of X. 


10. 
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. Write a program that reads a logical expression in p and q 


and prints the truth table of the expression. 


. Write a program that reads a logical expression in p, g, and 


r and prints the truth table of the expression. 


. Write a program that tests whether two logical expressions 


in p and q are logically equivalent. 


Write a program that tests whether two logical expressions 
in p, q, and r are logically equivalent. 


2.1 Mathematical Systems, 
Direct Proofs, and 
Counterexamples 

2.2 More Methods of Proof 

12.3. Resolution Proofs 

2.4 Mathematical Induction 


2.5 Strong Form of 
Induction and the 
Well-Ordering Property 


Go Online 


For more on proofs, see 
goo.gl/gHgyey 
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Chapter 2 


PROOFS 


This chapter uses the logic in Chapter | to discuss proofs. Logical methods are used 
in mathematics to prove theorems and in computer science to prove that programs do 
what they are alleged to do. Suppose, for example, that a student is assigned a program to 
compute shortest paths between cities. The program must be able to accept as input an ar- 
bitrary number of cities and the distances between cities directly connected by roads and 
produce as output the shortest paths (routes) between each distinct pair of cities. After 
the student writes the program, it is easy to test it for a small number of cities. Using pa- 
per and pencil, the student could simply list all possible paths between pairs of cities and 
find the shortest paths. This brute-force solution could then be compared with the output 
of the program. However, for a large number of cities, the brute-force technique would 
take too long. How can the student be sure that the program works properly for large 
input—almost surely the kind of input on which the instructor will test the program? 
The student will have to use logic to prove that the program is correct. The proof might 
be informal or formal using the techniques presented in this chapter, but a proof will be 
required. 

After introducing some context and terminology in Section 2.1, we devote the 
remainder of this chapter to various proof techniques. Sections 2.1 and 2.2 introduce 
several proof techniques that build directly on the material in Chapter 1. Resolution, the 
topic of Section 2.3, is a special proof technique that can be automated. Sections 2.4 
and 2.5 are concerned with mathematical induction, a proof technique especially useful 
in discrete mathematics and computer science. 


iThis section can be omitted without loss of continuity. 
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2.1 Mathematical Systems, Direct Proofs, 
and Counterexamples 


A mathematical system consists of axioms, definitions, and undefined terms. Axioms 
are assumed to be true. Definitions are used to create new concepts in terms of existing 
ones. Some terms are not explicitly defined but rather are implicitly defined by the 
axioms. Within a mathematical system we can derive theorems. A theorem is a proposi- 
tion that has been proved to be true. Special kinds of theorems are referred to as lemmas 
and corollaries. A lemma is a theorem that is usually not too interesting in its own right 
but is useful in proving another theorem. A corollary is a theorem that follows easily 
from another theorem. 

An argument that establishes the truth of a theorem is called a proof. Logic is a tool 
for the analysis of proofs. In this section and the next, we introduce some general meth- 
ods of proof. In Sections 2.3—2.5, we discuss resolution and mathematical induction, 
which are special proof techniques. We begin by giving some examples of mathematical 
systems. 


Example 2.1.1 Euclidean geometry furnishes an example of a mathematical system. Among the 
——= axioms are 


= Given two distinct points, there is exactly one line that contains them. 


m Given a line and a point not on the line, there is exactly one line parallel to the line 
through the point. 


The terms point and line are undefined terms that are implicitly defined by the 
axioms that describe their properties. 
Among the definitions are 


= Two triangles are congruent if their vertices can be paired so that the corresponding 
sides and corresponding angles are equal. 


= Two angles are supplementary if the sum of their measures is 180°. < 


Example 2.1.2 The real numbers furnish another example of a mathematical system. Among the axioms 
are 


= For all real numbers x and y, xy = yx. 


= There is a subset P of real numbers satisfying 


(a) If x and y are in P, then x + y and xy are in P. 


(b) If x is a real number, then exactly one of the following statements is true: 
xis in P, x=0, —x isin P. 


Multiplication is implicitly defined by the first axiom and others that describe the prop- 
erties multiplication is assumed to have. 
Among the definitions are 


= The elements in P (of the preceding axiom) are called positive real numbers. 


= The absolute value |x| of a real number x is defined to be x if x is positive or 0 and 
—x otherwise. < 


We give several examples of theorems, corollaries, and lemmas in Euclidean 
geometry and in the system of real numbers. 
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Example 2.1.3 


Example 2.1.4 


Example 2.1.5 


Example 2.1.6 


Example 2.1.8 


Example 2.1.9 


Example 2.1.10 


Examples of theorems in Euclidean geometry are 


= If two sides of a triangle are equal, then the angles opposite them are equal. 


= If the diagonals of a quadrilateral bisect each other, then the quadrilateral is a 
parallelogram. < 


An example of a corollary in Euclidean geometry is 
= Ifa triangle is equilateral, then it is equiangular. 


This corollary follows immediately from the first theorem of Example 2.1.3. < 


Examples of theorems about real numbers are 


m= x - 0 =0 for every real number x. 


= For all real numbers x, y, and z, if x < y and y < z, then x < z. < 


An example of a lemma about real numbers is 
= Ifn is a positive integer, then either n — | is a positive integer or n — 1 = 0. 


Surely this result is not that interesting in its own right, but it can be used to prove 
other results. < 


Direct Proofs 


Theorems are often of the form 
For all x1, 2, ...,Xn, tf p(X, X2,--.,X), then Gg), Xo, ..., Xn). 
This universally quantified statement is true provided that the conditional proposition 
if p(x, X2,.--,Xn), then gv, X2,..., Xn) (2.1.1) 


is true for all x1, .x2,..., 2%, in the domain of discourse. To prove (2.1.1), we assume 
that x1, .X2,...,X, are arbitrary members of the domain of discourse. If p(x), x2, ..., Xn) 
is false, by Definition 1.3.3, (2.1.1) is vacuously true; thus, we need only consider the 
case that p(x1, X2,...,Xn) is true. A direct proof assumes that p(x), x2,..., Xn) is true 
and then, using p(x;, x2, ...,X,) aS well as other axioms, definitions, previously derived 
theorems, and rules of inference, shows directly that q(x), x2, ..., Xn) is true. 

Everyone “knows” what an even or odd integer is, but the following definition 
makes these terms precise and provides a formal way to use the terms “even integer” 
and “odd integer” in proofs. 


Definition 2.1.7 >» An integer n is even if there exists an integer k such that 
n = 2k. An integer n is odd if there exists an integer k such that n = 2k + 1. < 


The integer n = 12 is even because there exists an integer k (namely k = 6) such that 


n = 2k; that is, 12 =2 - 6. < 
The integer n = —21 is odd because there exists an integer k (namely k = —11) such 
that n = 2k + 1; that is, -21 = 2(-11) + 1. < 


Give a direct proof of the following statement. For all integers m and n, if m is odd and 
nis even, then m+ n is odd. 


Example 2.1.11 
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SOLUTION Discussion Ina direct proof, we assume the hypotheses and derive the 
conclusion. A good start is achieved by writing out the hypotheses and conclusion so 
that we are clear where we start and where we are headed. In the case at hand, we have 


mis odd and n is even. (Hypotheses) 


m+ nis odd. (Conclusion) 


The gap (---) represents the part of the proof to be completed that leads from the hy- 
potheses to the conclusion. 

We can begin to fill in the gap by using the definitions of “odd” and “even” to 
obtain 


mis odd and n is even. (Hypotheses) 


There exists an integer, say k,, such that m = 2k; + 1. (Because m is odd) 
There exists an integer, say kz, such that n = 2k. (Because n is even) 


m+ nis odd. (Conclusion) 


(Notice that we cannot assume that kj = kz. For example if m = 15 andn = 4, then 
k, = 7 and ky = 2. That k, is not necessarily equal to ky is the reason that we must 
denote the two integers with different symbols.) 

The missing part of our proof is the argument to show that m+ n is odd. How can 
we reach this conclusion? We can use the definition of “odd” again if we can show that 
m + nis equal to 


2 x some integer + 1. (2.1.2) 


We already know that m = 2k; + 1 and n = 2ky. How can we use these facts to reach 
our goal (2.1.2)? Since the goal involves m + n, we can add the equations m = 2k; + 1 
and n = 2k» to obtain a fact about m + n, namely, 


m+n = (2k; + 1) + 2k». 


Now this expression is supposed to be of the form (2.1.2). We can use a little algebra to 
show that it is of the desired form: 


me n= 2h + 1) +2 = 2h 4a) 1, 


We have our proof. 


Proof Let mand be arbitrary integers, and suppose that m is odd and n is even. We 
prove that m + n is odd. By definition, since m is odd, there exists an integer k, such 
that m = 2k; + 1. Also, by definition, since n is even, there exists an integer kz such that 
n = 2k». Now the sum is 


m+n = (2k; +1) + (2k) = 2k) tho) +1. 


Thus, there exists an integer k (namely k = k; +k) such that m+n = 2k+1. Therefore, 
m-+nis odd. qd < 


Give a direct proof of the following statement. For all sets X, Y, and Z, XN (Y — Z) = 
(XN Y)-—(XNZ). 
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SOLUTION Discussion The outline of the proof is 
X, Y, and Z are sets. (Hypothesis) 


XN (Y¥ —Z) = (XN Y) — (XN Z) (Conclusion) 


The conclusion asserts that the two sets X N (Y — Z) and (XN Y) — (X 9 Z) are equal. 
Recall (see Section 1.1) that to prove from the definition of set equality that these sets 
are equal, we must show that for all x, 


ifxe XN(Y —Z), thenx € (XN Y) — (XNZ) (2.1.3) 
and 


ifx € (XNY)—(XNZ), thenx eXN(Y—Z). (2.1.4) 


Thus our proof outline becomes 


X, Y, and Z are sets. (Hypothesis) 

IfxeXN(Y—Z), thenx € (XN Y)— (XZ). 
Ifx e (XN Y)— (XZ), thenx e XN (Y —Z). 
XN (¥ —Z) = (XN Y) — (XN Z) (Conclusion) 


We should be able to use the definitions of intersection (N) and set difference (—) to 
complete the proof. 

To prove (2.1.3), we begin by assuming that (the arbitrary element) x is in X 1 
(Y —Z). Because this latter set is an intersection, we immediately deduce that x € X and 
x € Y — Z. The proof proceeds in this way. As one constructs the proof, it is essential 
to keep the goal in mind: x € (X 1 Y) — (XZ). To help guide the construction of 
the proof, it may be helpful to translate the goal using the definition of set difference: 
xe (XN Y)— (XZ) meansx e XN Yandx g XN Z. 


Proof Let X, Y, and Z be arbitrary sets. We prove 


XN(Y¥—Z) =(XNY)—(XNZ) 


by proving (2.1.3) and (2.1.4). 

To prove equation (2.1.3), letx ¢ XM (Y — Z). By the definition of intersection, 
x € X andx € Y — Z. By the definition of set difference, since x € Y — Z, x € Y and 
x ¢ Z. By the definition of intersection, since x € X andx € Y,x e XN Y. Again by 
the definition of intersection, since x ¢ Z, x ¢ X 1 Z. By the definition of set difference, 
since x € XM Y, butx ¢ XNZ, x € (XN Y)— (XNMZ). We have proved equation (2.1.3). 

To prove equation (2.1.4), let x € (X MY) — (XM Z). By the definition of set 
difference, x € XM Y and x ¢ XM Z. By the definition of intersection, since x € XN Y, 
x € X and x € Y. Again, by the definition of intersection, since x ¢ XM Z and x € X, 
x ¢ Z. By the definition of set difference, since x € Y andx ¢ Z,x € Y — Z. Finally, 
by the definition of intersection, since x € X andx € Y—Z,x € XM (Y — Z). We have 
proved equation (2.1.4). 

Since we have proved both equations (2.1.3) and (2.1.4), it follows that 


XN(Y¥—-Z) =(XN¥Y)—(XNZ). d< 


Our next example shows that in constructing a proof, we may find that we need 
some auxiliary results, at which point we pause, go off and prove these auxiliary results, 
and then return to the main proof. We call the proofs of auxiliary results subproofs. (For 
those familiar with programming, a subproof is similar to a subroutine.) 
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Example 2.1.12 Ifa and b are real numbers, we define min{a, b} to be the minimum of a and b or the 


Example 2.1.13 


common value if they are equal. More precisely, 


a ifa<b 
min{a,b}=<a ifa=b 
b ifb<a. 


Give a direct proof of the following statement. For all real numbers d, d1, d2, x, 


if d = min{d,, d>} and x < d, then x < d, and x < d. 


SOLUTION Discussion The outline of the proof is 
d = min{d, dy} and x < d (Hypotheses) 


x < d; and x < dy» (Conclusion) 


To help understand what is being asserted, let us look at a specific example. As we 
have remarked previously, when we are asked to prove a universally quantified statement, 
a specific example does not prove the statement. It may, however, help us to understand 
the statement. 

Let us set dj = 2 and d) = 4. Then d = min{d), d)} = 2. The statement to be 
proved says that if x < d (= 2), then x < d; (= 2) and x < dz (= 4). Why is this true in 
general? The minimum d of two numbers, d; and dp, is equal to one of the two numbers 
(namely, the smallest) and less than or equal to the other one (namely, the largest)—in 
symbols, d < d; andd < d). If x < d, then from x < dandd < d,, we may deduce 
x < d,. Similarly, from x < d and d < dz, we may deduce x < d,. Thus the outline of 
our proof becomes 


d = min{d, d>} and x < d (Hypotheses) 
Subproof: Show that d < d; andd < dp. 
From x < dandd < dj, deduce x < d. 
From x < dandd < dp, deduce x < dp. 


x < d, and x < dy (Conclusion—uses the conjunction inference rule) 


At this point, the only part of the proof that is missing is the subproof to show 
that d < d, and d < db. Let us look at the definition of “minimum.” If dj < do, then 
d = min{d,, d>} = d, andd = d, < d). If d> < d,, thend = min{d), dy} = d» and 
d =d) < d\. Ineither case, d < d; andd < d). 


Proof Let d, d,, dy, and x be arbitrary real numbers, and suppose that 
d = min{d,, d>} and x < d. 
We prove that x < d; and x < dp. 

We first show that d < d, and d < d. From the definition of “minimum,” if d; < 
d>, then d = min{d), do} = d; andd = d, < d. If dy < d), thend = min{d), dh} = dh 
and d = dy < d,. In either case, d < d; andd < d). From x < dandd < d,, it follows 
that x < d, from a previous theorem (the second theorem of Example 2.1.5). Fromx < d 


and d < dj, we may derive x < d, from the same previous theorem. Therefore, x < d; 
and x < do. <q < 


There are frequently many different ways to prove a statement. We illustrate by giving 
two proofs of the statement 


XU(Y—-—X)=XUY forall sets X and Y. 
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Example 2.1.14 


Example 2.1.15 


SOLUTION Discussion We first give a direct proof like the proof in Example 2.1.11. 
We show that for all x, ifx ¢ X U(Y — X), thenx € X UY, andifx € X UY, then 
xeXU(Y—-X). 

Our second proof uses Theorem 1.1.22, which gives laws of sets. The idea is to 
begin with X U (Y — X) and use the laws of sets, which here we think of as rules to 
manipulate set equations, to obtain X U Y. 


Proof [First proof] We show that for all x, if x e¢ X U (Y — X), then x € X UY, and 
ifxe XUY,thenx € XU(Y—-X). 

Let x € XU(Y¥Y —X). Thenx € Xorx ¢ Y—X.Ifx € X, thenx € X UY. If 
xe Y—X, then x € Y, so againx € X UY. Ineither case,x e XU Y. 

Letx e XUY. Thenx € X orx € Y. Ifx € X, thenx e XU(Y —X). Ifx ¢ X, then 
x € Y.Inthis case, x € Y—X. Therefore, x € XU(Y—X). In either case, x € XU(Y—X). 
The proof is complete. < 


Proof [Second proof] We use Theorem 1.1.22, which gives laws of sets, and the fact 
that Y — X¥ = YX, which follows immediately from the definition of set difference. 
Letting U denote the universal set, we obtain 


XU(Y-—X) =XU(YNX) [Y-X=YnNX] 
= (XU Y)N (X UX) [Distributive law; Theorem 1.1.22, part (c)] 
=(XUYNU {Complement law; Theorem 1.1.22, part (e)] 
=XUY {Identity law; Theorem 1.1.22, part (d)]. <q < 


Disproving a Universally Quantified Statement 


Recall (see Section 1.5) that to disprove VxP(x) we simply need to find one mem- 
ber x in the domain of discourse that makes P(x) false. Such a value for x is called a 
counterexample. 


The statement Vn € Z* (2” + 1 is prime) is false. A counterexample is n = 3 since 
23 + 1 = 9, which is not prime. < 
If the statement 


(AN B)UC=AN (BUC), forall sets A, B, and C 


is true, prove it; otherwise, give a counterexample. 


SOLUTION Let us begin by trying to prove the statement. We will first try to show that 
ifx € (AN B)UC, thenx € AN (BUC). Ifx € (ANB) UC, then 


xXxE€ANB or xEC. (2.1.5) 


We have to show that x € AM (BU C), that is, 


x€A and xe Buc. (2.1.6) 


Statement (2.1.5) is true if x is in C, and statement (2.1.6) is false if x ¢ A. Thus the given 
statement is false; there is no direct proof (or any other proof!). If we choose sets A and 
C so that there is an element that is in C, but not in A, we will have a counterexample. 
Let A = {1,2,3}, B = {2,3,4}, and C = {3,4, 5} so that there is an element 
that is in C, but not in A. Then (AN B) UC = {2,3,4,5}, AN (BUC) = {2, 3}, and 
(ANB)UC ZAN (BUC). Thus A, B, and C provide a counterexample that shows that 
the given statement is false. < 
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2.1 Problem-Solving Tips 


= To construct a direct proof of a universally quantified statement, first write down 
the hypotheses (so you know what you are assuming), and then write down the 
conclusion (so you know what you must prove). The conclusion is what you will 
work toward—something like the answer in the back of the book to an exercise, 
except here it is essential to know the goal before proceeding. You must now give 
an argument that begins with the hypotheses and ends with the conclusion. To 
construct the argument, remind yourself what you know about the terms (e.g., 
“even,” “odd’), symbols (e.g., X N Y, min{d), d2}), and so on. Look at relevant 
definitions and related results. For example, if a particular hypothesis refers to 
an even integer n, you know that n is of the form 2k for some integer k. If you 
are to prove that two sets X and Y are equal from the definition of set equality, 
you know you must show that for every x, ifx € X thenx € Y, andifx € Y 
then x € X. 


= To understand what is to be proved, look at some specific values in the domain of 
discourse. When we are asked to prove a universally quantified statement, showing 
that the statement is true for specific values does not prove the statement; it may, 
however, help to understand the statement. 


= To disprove a universally quantified statement, find one element in the domain of 
discourse, called a counterexample, that makes the propositional function false. 
Here, your proof consists of presenting the counterexample together with justifi- 
cation that the propositional function is indeed false for your counterexample. 


m= When you write up your proof, begin by writing out the statement to be proved. 
Indicate clearly where your proof begins (e.g., by beginning a new paragraph or by 
writing “Proof.”’). Use complete sentences, which may include symbols. For ex- 
ample, it is perfectly acceptable to write: Thus x € X. In words, this is the complete 
sentence: Thus x is in X. End a direct proof by clearly stating the conclusion, and, 
perhaps, giving a reason to justify the conclusion. For example, Example 2.1.10 
ends with: 


Thus, there exists an integer k (namely k = k; + kz) such that 
m+n = 2k-+ 1. Therefore, m+n is odd. 


Here the conclusion (m+ n is odd) is clearly stated and justified by the statement 
m+n=2k+1. 


m Alert the reader where you are headed. For example, if you are going to prove that 
X = Y, write “We will prove that X = Y” before launching into this part of the 
proof. 


= Justify your steps. For example, if you conclude that x € X or x € Y because it 
is known that x € X U Y, write “Since x € X UY, x € X orx € Y,” or perhaps 
even “Since x € X U Y, by the definition of union x € X or x € Y” if, like Richard 
Nixon, you want to be perfectly clear. 


= If you are asked to prove or disprove a universally quantified statement, you can 
begin by trying to prove it. If you succeed, you are finished—the statement is true 
and you proved it! If your proof breaks down, look carefully at the point where it 
fails. The given statement may be false and your failed proof may give insight into 
how to construct a counterexample (see Example 2.1.15). On the other hand, if you 
have trouble constructing a counterexample, check where your proposed examples 
fail. This insight may show why the statement is true and guide construction of a 
proof. 
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Some Common Errors 


In Example 2.1.10, we pointed out that it is an error to use the same notation for two 
possibly distinct quantities. As an example, here is a faulty “proof” that for all m and 
n, if m and n are even integers then mn is a square (i.e., mn = a” for some integer a): 
Since m and n are even, m = 2k andn = 2k. Now mn = (2k)(2k) = (2k). If we let 
a = 2k, then m = a’. The problem is that we cannot use k for two potentially different 
quantities. If m and n are even, all we can conclude is that m = 2k, and n = 2k) for 
some integers k; and kz. The integers k, and kz need not be equal. (In fact, it is false that 
for all m and n, if m and n are even integers then mn is a square. A counterexample is 
m=2andn = 4.) 

Given a universally quantified propositional function, showing that the proposi- 
tional function is true for specific values in the domain of discourse is not a proof that 
the propositional function is true for all values in the domain of discourse. (Such specific 
values may, however, suggest that the propositional function might be true for all values 
in the domain of discourse.) Example 2.1.10 is to prove that for all integers m and n, 


if mis odd and n is even, then m + n is odd. (2.1.7) 


Letting m = 11 andn = 4 and noting that m+n = 15 is odd does not constitute a proof 
that (2.1.7) is true for all integers m and n, it merely proves that (2.1.7) is true for the 
specific values m = 11 andn = 4. 

In constructing a proof, you cannot assume what you are supposed to prove. As 
an example, consider the erroneous “proof” that for all integers m and n, if m andm-+n 
are even, then n is even: Let m = 2k; andn = 2k. Then m+n = 2k, + 2k. Therefore, 


n= (m+n) —m = (2k; + 2kr) — 2ky = 2(ky + ky — ho). 


Thus n is even. The problem with the preceding “proof” is that we cannot write n = 2k 
since this is true if and only if n is even—which is what we are supposed to prove! This 
error is called begging the question or circular reasoning. [It is true that if m and m-+n 
are even integers, then n is even (see Exercise 12).] 


2.1 Review Exercises 


= 


. What is a mathematical system? 
. What is an axiom? 


. What is a definition? 


What is an undefined term? 


. What is a theorem? 


. What is a proof? 


2.1 Exercises 


hs 


7. What is a lemma? 

8. What is a direct proof? 

9. What is the formal definition of “even integer’’? 
. What is the formal definition of “odd integer’’? 
. What is a subproof? 


. How do you disprove a universally quantified statement? 


Give an example (different from those of Example 2.1.1) of an 
axiom in Euclidean geometry. 


. Give an example (different from those of Example 2.1.2) of an 


axiom in the system of real numbers. 


. Give an example (different from those of Example 2.1.1) of a 


definition in Euclidean geometry. 


. Give an example (different from those of Example 2.1.2) of a 


definition in the system of real numbers. 


. Give an example (different from those of Example 2.1.3) of a 


theorem in Euclidean geometry. 


. Give an example (different from those of Example 2.1.5) of a 


theorem in the system of real numbers. 


. Prove that for all integers m and n, if m and n are even, then 


m + nis even. 


. Prove that for all integers m and n, if m and n are odd, then 


m + nis even. 


16. 


17. 


18. 


19. 


20. 


21. 
22. 
23. 


24. 


25. 


26. 
27. 


28. 
29. 
30. 
31. 
32. 
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. Prove that for all integers m and n, if m and n are even, then 


mn is even. 


. Prove that for all integers m and n, if m and n are odd, then mn 


is odd. 


. Prove that for all integers m and n, if m is odd and n is even, 


then mn is even. 


. Prove that for all integers m and n, if m and m +4 n are even, 


then n is even. 


. Prove that for all rational numbers x and y, x + y is rational. 
. Prove that for all rational numbers x and y, xy is rational. 


. Prove that for every rational number x, if x # 0, then 1/x is 


rational. 


Prove that the product of two integers, one of the form 3k; + 1 
and the other of the form 3k2 +2, where k, and kp are integers, 
is of the form 3k3 + 2 for some integer k3. 


Prove that the product of two integers, one of the form 3k; +2 
and the other of the form 3k2 +2, where k, and kp are integers, 
is of the form 3k3 + | for some integer k3. 


If aand b are real numbers, we define max {a, b} to be the max- 
imum of a and b or the common value if they are equal. Prove 
that for all real numbers d, d, do, x, 


if d = max{d,, do} and x > d, then x > d, and x > do. 


Justify each step of the following direct proof, which shows 
that if x is a real number, then x-0 = 0. Assume that the fol- 
lowing are previous theorems: If a, b, and c are real numbers, 
thenb+0 = band a(b+c) = ab+ac.Ifa+b=a-+c, then 
b= 


Proof +x-0+0=<x-0 =x-(0+0) =x-0+-x-0; there- 
fore, x-0 = 0. < 
If X and Y are nonempty sets and X x Y = Y x X, what can 
we conclude about X and Y? Prove your answer. 


Prove that XM Y C X for all sets X and Y. 
Prove that X C X UY for all sets X and Y. 


Prove that if X C Y, then X UZ C YU Z for all sets X, Y, 
and Z. 


Prove that if X C Y, then XM Z C YQ Z for all sets X, Y, 
and Z. 


Prove that if X C Y, then Z — Y C Z — X for all sets X, Y, 
and Z. 


Prove that if X C Y, then Y—(Y—X) = X for all sets X and Y. 


Prove that ifX N Y= XN Zand X UY =XUZ, then Y =Z 
for all sets X, Y, and Z. 


Prove that P(X) U P(Y) C P(X U Y) for all sets X and Y. 
Prove that P(X N Y) = P(X) N P(Y) for all sets X and Y. 
Prove that if P(X) C P(Y), then X C Y for all sets X and Y. 
Disprove that P(X U Y) C P(X) U P(Y) for all sets X and Y. 


Give a direct proof along the lines of the second proof in Ex- 
ample 2.1.13 of the statement 


XN (¥—Z)=(XNY)— (XZ) forall sets X, Y, and Z. 


(In Example 2.1.11, we gave a direct proof of this statement 
using the definition of set equality.) 


In each of Exercises 33-45, if the statement is true, prove it; other- 
wise, give a counterexample. The sets X, Y, and Z are subsets of a 
universal set U. Assume that the universe for Cartesian products 
isU x U. 


33. For all sets X and Y, either X is a subset of Y or Y is a subset 
of X. 

34. X U(Y — Z) = (X UY) — (X UZ) for all sets X, Y, and Z. 

35. Y—X =X UY forall sets X and Y. 

36. Y-—Z=(XUY) — (X UZ) for all sets X, Y, and Z. 

37. X — (YUZ) = (X — Y) UZ for all sets X, Y, and Z. 

38. X — Y = Y — X forall sets X and Y. 

39. XNY CX for all sets X and Y. 

40. (XN Y)U(Y — X) =Y for all sets X and Y. 

41. X x (YUZ) = (X x Y) U(X x Z) for all sets X, Y, and Z. 

42. Xx Y =X x Y forall sets X and Y. 

43. X x (Y —Z) = (X x Y) — (X x Z) for all sets X, Y, and Z. 

44. X —(Y x Z) = (X — Y) x (X — Z) for all sets X, Y, and Z. 

45. XN (¥ x Z) = (XN Y) x (XZ) for all sets X, Y, and Z. 


46. Prove the associative laws for sets [Theorem 1.1.22, part (a)]. 


47. Prove the commutative laws for sets [Theorem 1.1.22, 
part (b)]. 
48. Prove the distributive laws for sets [Theorem 1.1.22, part (c)]. 
49. Prove the identity laws for sets [Theorem 1.1.22, part (d)]. 
50. Prove the complement laws for sets [Theorem 1.1.22, part (e)]. 
51. Prove the idempotent laws for sets [Theorem 1.1.22, part (f)]. 
52. Prove the bound laws for sets [Theorem 1.1.22, part (g)]. 
53. Prove the absorption laws for sets [Theorem 1.1.22, part (h)]. 
54, Prove the involution law for sets [Theorem 1.1.22, part (i)]. 
55. Prove the 0/1 laws for sets [Theorem 1.1.22, part (j)]. 
56. Prove De Morgan’s laws for sets [Theorem 1.1.22, part (k)]. 
In Exercises 57-65, A denotes the symmetric difference operator 
defined as A A B = (AU B) — (AM B), where A and B are sets. 
57. Prove that A A B = (A — B) U(B—A) for all sets A and B. 
58. Prove that (A A B) AA = B for all sets A and B. 


x59. Prove or disprove: If A, B, and C are sets satisfying A A C = 
BAC,thenA=B. 
60. Prove or disprove: A A (BU C) = (A A B) U(AA C) for all 
sets A, B, and C. 
61. Prove or disprove: A A (BNC) = (AA B)N (AAC) for all 
sets A, B, and C. 
62. Prove or disprove: A U (B A C) = (AUB) A (AUC) for all 
sets A, B, and C. 
63. Prove or disprove: AN (B A C) = (AN B) A (ANC) for all 
sets A, B, and C. 


64. Is A commutative? If so, prove it; otherwise, give a counter- 
example. 


x65. Is A associative? If so, prove it; otherwise, give a counter- 
example. 
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2.2 


Example 2.2.1 


More Methods of Proof 


In this section, we discuss several more methods of proof: proof by contradiction, proof 
by contrapositive, proof by cases, proofs of equivalence, and existence proofs. We will 
find these proof techniques of use throughout this book. 


Proof by Contradiction 


A proof by contradiction establishes p + q by assuming that the hypothesis p is true 
and that the conclusion q is false and then, using p and —q as well as other axioms, 
definitions, previously derived theorems, and rules of inference, derives a contradiction. 
A contradiction is a proposition of the form r A —r (r may be any proposition whatever). 
A proof by contradiction is sometimes called an indirect proof since to establish p > q 
using proof by contradiction, we follow an indirect route: We derive r A —r and then 
conclude that gq is true. 

The only difference between the assumptions in a direct proof and a proof by con- 
tradiction is the negated conclusion. In a direct proof the negated conclusion is not as- 
sumed, whereas in a proof by contradiction the negated conclusion is assumed. 

Proof by contradiction may be justified by noting that the propositions p > q and 
(p A 7q) > (r A =r) are equivalent. The equivalence is immediate from a truth table: 


Pp qd r|p>q pPA7q rA7r (p A 7q) > (rA nr) 
T He T Ak F F T 
aL T F T F F T 
T F T F T F F 
T F F F T F F 
F T T Tr F F T 
F T F T F F — 
F F T T F F T 
F F F T F F T 


Give a proof by contradiction of the following statement: 


For every n € Z, if n2 is even, then n is even. 


SOLUTION Discussion First, let us consider giving a direct proof of this statement. 
We would assume the hypothesis, that is, that n? is even. Then there exists an integer ky 
such that n? = 2k,. To prove that n is even, we must find an integer ky such that n = 2k». 
It is not clear how to get from n? = 2k; ton = 2k). (Taking the square root certainly 
does not work!) When a proof technique seems unpromising, try a different one. 

In a proof by contradiction, we assume the hypothesis (n? is even) and the negation 
of the conclusion (n is not even, i.e., m is odd). Since n is odd, there exists an integer k 
such that n = 2k + 1. If we square both sides of this last equation, we obtain 


w= (2k+ 1) =4? 4+4k4+1=22P +2441. 


But this last equation tells us that n* is odd. We have our contradiction: n* is even 
(hypothesis) and n? is odd. Formally, if r is the statement “n? is even,” we have de- 
duced r A =r. 


Proof We give a proof by contradiction. Thus we assume the hypothesis n? is even 


and that the conclusion is false n is odd. Since n is odd, there exists an integer k such 
that n = 2k + 1. Now 
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w= (2k4+ 1% =47 4+4k4+1=22P +2K 41. 
Thus n? is odd, which contradicts the hypothesis n” is even. The proof by contradiction 
is complete. We have proved that for every n € Z, if n? is even, thenniseven. <4 <4 


Give a proof by contradiction of the following statement: 


For all real numbers x and y, ifx + y > 2, then either x > 1 ory > 1. 


SOLUTION Discussion As in the previous example, a direct proof seems unpromis- 
ing—assuming only that x + y > 2 appears to be too little to get us started. We turn to a 
proof by contradiction. 


Proof We begin by letting x and y be arbitrary real numbers. We then suppose that 
the conclusion is false, that is, that ~(v > 1 V y > 1) is true. By De Morgan’s laws of 
logic (see Example 1.3.11), 


aAx>lvy> D)=r7AH>S>VY)AAOW z= D=HA0<)Y)AO<). 


In words, we are assuming that x < 1 and y < 1. Using a previous theorem, we may 
add these inequalities to obtain x + y < 1+ 1 = 2. At this point, we have derived a 
contradiction: x + y > 2 and x + y < 2. Thus we conclude that for all real numbers x 
and y, ifx + y > 2, then either x > lory> 1. <q < 


Prove that /2 is irrational using proof by contradiction. 


SOLUTION Discussion Here a direct proof seems particularly bleak. It seems we 
have a blank slate with which to begin. However, if we use proof by contradiction, we 
may assume that /2 is rational. In this case, we know that there exist integers p and 
q such that /2 = p/q. Now we have an entry on our slate. We can manipulate this 
equation and hope to obtain a contradiction. 


Proof We use proof by contradiction and assume that /2 is rational. Then there 
exist integers p and g such that /2 = p/q. We assume that the fraction p/gq is in lowest 
terms so that p and q are not both even. Squaring /2 = p/q gives 2 = p?/q?, and 
multiplying by q* gives 2q? = p’. It follows that p* is even. Example 2.2.1 tells us that 
p is even. Therefore, there exists an integer k such that p = 2k. Substituting p = 2k into 
2q° = p’ gives 2q* = (2k)? = 4k*. Canceling 2 gives q* = 2k’. Therefore q” is even, 
and Example 2.2.1 tells us the g is even. Thus p and q are both even, which contradicts 
our assumption that p and q are not both even. Therefore, V2 is irrational. qd 


Proof by Contrapositive 


Suppose that we give a proof by contradiction of p — q in which, as in Examples 2.2.1 
and 2.2.2, we deduce —p. In effect, we have proved —q — —p. [Recall (see Theorem 
1.3.18) that p — g and -q — —p are equivalent.] This special case of proof by contra- 
diction is called proof by contrapositive. 


Give a proof by contradiction that for any subset S of 26 cards from an ordinary 52-card 
deck (composed of four suits of 13 cards each), there is a suit such that S has at least 
7 cards of that suit. (In the card game bridge, this result says that two partners, who each 
hold 13-card hands, will between them have a suit of at least 7 cards.) 
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SOLUTION Discussion The conclusion is: There is a suit such that S has at least 7 
cards of that suit. If the conclusion is false, it must be that S has at most 6 cards of every 
suit. But, since there are four suits, we can account for at most 6-4 = 24 cards in S. But 
the hypothesis is that S contains 26 cards. Contradiction! We have a proof. 

We conclude the discussion by formally showing how De Morgan’s laws of logic 
(see Example 1.3.1) are used to yield the negated conclusion. We take as our domain of 
discourse, the set of all subsets of cards from an ordinary 52-card deck. One piece of 
notation will be useful. If Su is a suit, we let S(Su) denote the number of cards of suit Su 
in S. For example, the value of $(Club) is the number of clubs in S. 

We are to prove that 


VS, if |S| = 26, then 3 a suit Su such that S(Su) > 7. 


By De Morgan’s laws of logic, 


-=(4 a suit Su such that $(Su) > 7) =V suits Su, S(Su) < 7; 


that is, the negation of the conclusion can be written: S has less than 7 (1.e., at most 6) 
cards of every suit. 

In Example 2.2.8, we will discuss an alternative approach to finding a proof of this 
result. 


Proof Weare given a subset S of 26 cards from an ordinary 52-card deck. Suppose 
by way of contradiction that the conclusion is false; that is, suppose that S has at most 
6 cards of every suit. Since there are four suits, S has at most 6 - 4 = 24 cards. This 
contradicts the fact that S has 26 cards. Therefore, there is a suit such that S has at least 
7 cards of that suit. qa < 


Give a proof by contrapositive to prove that 


for all x € R, if x is irrational, then x is irrational. 


SOLUTION Discussion For much the same reasons that a direct proof seemed un- 
promising in Examples 2.2.1 and 2.2.2, a direct proof in which we assume only that x? is 
irrational seems to be too little to get us started. A proof by contradiction can be devised 
(see Exercise 1), but here a proof by contrapositive is requested. 


Proof We begin by letting x be an arbitrary real number. We prove the contrapositive 
of the given statement, which is 


. . . . ba . . 
if x is not irrational, then x~ is not irrational 
or, equivalently, 


. . . Os . 
if x is rational, then x“ is rational. 


So suppose that x is rational. Then x = p/q for some integers p and g. Now x* = p”/q’. 
Since x? is the quotient of integers, x? is rational. The proof is complete. | 


Proof by Cases 


Proof by cases is used when the original hypothesis naturally divides itself into various 
cases. For example, the hypothesis “x is a real number” can be divided into cases: (a) x 
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is a nonnegative real number and (b) x is a negative real number. Suppose that the task 
is to prove p — q and that p is equivalent to pi V p2 V-+-V Pn (P1, ---, Pn are the cases). 
Instead of proving 


(Pi V p2V+++V Pn) > 4, (2.2.1) 


we prove 


(Pi > DA (P2 > DA ADn > QD: (2.2.2) 


As we will show, proof by cases is justified because the two statements are equivalent. 

First suppose that q is true. Then all the implications in (2.2.1) and (2.2.2) are true 
(regardless of the truth value of the hypotheses). Thus (2.2.1) and (2.2.2) are true. 

Now suppose that q is false. If all the p; are false, then all the implications in 
(2.2.1) and (2.2.2) are true, so (2.2.1) and (2.2.2) are true. If for some j, p; is true, then 
Pi V +++ Pn is true, so (2.2.1) is false. Since pj > q is false, (2.2.2) is false. Thus (2.2.1) 
and (2.2.2) are false. Therefore, (2.2.1) and (2.2.2) are equivalent. 

Sometimes the number of cases to prove is finite and not too large, so we can check 
them all one by one. We call this type of proof exhaustive proof. 


Prove that 2m? + 3n’ = 40 has no solution in positive integers, that is, that 2m? + 
3n* = 40 is false for all positive integers m and n. 


SOLUTION Discussion We certainly cannot check 2m?+3n’ for all positive integers 
mand n, but we can rule out most positive integers because, if 2m? +3n? = 4O, the sizes 
of m and n are restricted. In particular, we must have 2m? < 40 and 3n? < 40. (If, for 
example, 2m? > 40, when we add 3n? to 2m”, the sum 2m? + 3n? will exceed 40.) If 
2m? < 40, then m? < 20 and mcan be at most 4. Similarly, if 3n? < 40, then n? < 40/3 
and n can be at most 3. Thus it suffices to check the cases m = 1, 2,3, 4andn = 1, 2,3. 


Proof If 2m? + 3n? = 40, we must have 2m? < 40. Thus m? < 20 andm < 4. 
Similarly, we must have 3n? < 40. Thus rn? < 40/3 and n < 3. Therefore it suffices to 
check the cases m = 1, 2,3, 4 andn = 1, 2, 3. 

The entries in the table give the value of 2m? + 3n? for the indicated values of m 
and n. 


Since 2m? + 3n? 40 form = 1,2,3,4andn = 1, 2, 3, and 2m? +3n? > 40 form > 4 
or n > 3, we conclude that 2m? + 3n? = 40 has no solution in positive integers. <@ <4 


Example 2.2.7 illustrates an important point: An or statement often leads itself to 
a proof by cases. 


We prove that for every real number x, x < |x|. 


SOLUTION Discussion Since x is a real number, either x > 0 or x < 0. We use 
this or statement to divide the proof into cases. We divide the proof into cases because 
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the definition of absolute value is itself divided into cases x > 0 and x < 0 (see Exam- 
ple 2.1.2). Case 1 is x > 0 and case 2 is x < 0. 


Proof If x > 0, by definition |x| = x. Thus |x| > x. If x < 0, by definition |x| = —x. 
Since |x] = —x > Oand0 > x, |x| > x. In either case, |x| > x; so the proof is 
complete. | 


Example 2.2.8 offers an alternative approach to developing a proof of the result in 
Example 2.2.4. There are often many approaches and proofs of a single result. 


We revisit the problem (see Example 2.2.4) of proving that for any subset S of 26 cards 
from an ordinary 52-card deck, there is a suit such that S has at least 7 cards of that suit. 


Discussion Consider trying a direct proof. How about the club suit? If there are 7 or 
more clubs, we have the desired conclusion. What if there are 6 or fewer clubs? Try 
another suit! Let us try diamonds next. If there are 7 or more diamonds, we have the 
desired conclusion. What if there are 6 or fewer diamonds? Try another suit, and so on. 
If each of the four suits consists of 6 or fewer cards, we cannot deduce the conclusion; 
however, if each suit has 6 or fewer cards, we can account for only 24 cards. But there 
are 26 cards; this case cannot occur. 

The preceding discussion shows that we can divide the proof into two cases. Case 1 
is that some suit consists of 7 or more cards. (Proof complete!) Case 2 is that every suit 
consists of 6 or fewer cards. (We have shown that this case cannot occur.) Since only 
case | can occur, the proof is complete. 

We started by thinking along the lines of a direct proof, which led us to a proof 
by cases. The proof by cases is essentially identical to the proof by contradiction in 
Example 2.2.4. 


Proof Let S be a subset of 26 cards from an ordinary 52-card deck. We consider two 
cases. Case | is that there is a suit such that S has at least 7 or more cards of that suit. 
Case 2 is that S has 6 or fewer cards of every suit. 

If case 1 holds, the proof is complete. Suppose that case 2 holds. Since S$ has 6 
or fewer cards of each of the four suits, S has at most 24 cards. But we are given that 
S has 26 cards. Therefore, case 2 cannot hold. Since only case 1 holds, the proof is 
complete. q< 


Proofs of Equivalence 


Some theorems are of the form p if and only if g. Such theorems are proved by using the 
equivalence (see Example 1.3.15) 


Poqd=P>DAG~D); 
that is, to prove “p if and only if g,” prove “if p then g” and “if g then p.” 
Prove that for every integer n, n is odd if and only if n — 1 is even. 
SOLUTION Discussion We let n be an arbitrary integer. We must prove that 
if n is odd then n — | is even (2.2.3) 
and 


if n — | is even then n is odd. (2.2.4) 
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Proof We first prove (2.2.3). If n is odd, then n = 2k + 1 for some integer k. Now 
n—1=(2k+ 1) —1 = 2k. Therefore, n — 1 is even. 

Next we prove (2.2.4). If n — | is even, then n — 1 = 2k for some integer k. Now 
n = 2k + 1. Therefore, n is odd. The proof is complete. qd 


Some proofs of p < q combine the proofs of p — g and q — p. For example, 
the proof in Example 2.2.9 could be written as follows: 


nis odd if and only if nm = 2k + 1 for some integer k if and only if nm — 1 = 2k for 
some integer k if and only if n — 1 is even. 


For such a proof to be correct, it must be the case that each if-and-only-if statement is 
true. If such a proof of p < q is read in one direction, we obtain the proof of p — q and, 
if it is read in the other direction, we obtain a proof of gq — p. Reading the preceding 
proof in the if-then direction, 


if n is odd, then n = 2k + 1 for some integer k; if m = 2k + 1 for some integer k, 
then n — 1 = 2k for some integer k; if n — 1 = 2k for some integer k, then n — 1 
is even, 


proves that if n is odd, then n — | is even. Reversing the order, 


if n — 1 is even, then n — 1 = 2k for some integer k; if n — 1 = 2k for some 
integer k, then n = 2k + | for some integer k; if n = 2k + 1 for some integer k, 
then n is odd, 


proves that if n — | is even, then n is odd. 


Prove that for all real numbers x and all positive real numbers d, 
|x| < dif and only if —d < x <d. 
SOLUTION Discussion We let x be an arbitrary real number and d be an arbitrary 
positive real number. We must show 
if |x| < dthen-d<x<d (2.2.5) 
and 
if —d < x < d then |x| <d. (2.2.6) 


Since |x| is defined by cases, we expect to use proof by cases. 


Proof To show (2.2.5), we use proof by cases. We assume that |x| < d. If x > 0, 
then —d < 0 < x = |x| < d. Ifx < 0, then —d < 0 < —x = |x| < d; that is, 
—d < —x < d. Multiplying by —1, we obtain d > x > —d. In either case, we have 
proved that —d < x < d. 

To show (2.2.6), we also use proof by cases. We assume that —d < x < d.If 


x > 0, then |x| = x < d. If x < 0, then |x| = —x. Since —d < x, we may multiply by 
—1 to obtain d > —x. Combining |x| = —x and d > —x gives |x| = —x < d. In either 
case, we have proved that |x| < d. The proof is complete. qi 


In proving p <> q, we are proving that p and q are logically equivalent, that is, p 
and g are either both true or both false. Some theorems state that three or more statements 
are logically equivalent and, thus, have the form 
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The following are equivalent: 
(a) — 
(b) — 
(c)— 


Such a theorem asserts that (a), (b), (c), and so on are either all true or all false. 
To prove that p1, p2,..., Pn are equivalent, the usual method is to prove 


(P1 a P2) A (p2 —> P3) ae A (Pn-1 > Pn) A (Pn => Pi). (2.2.7) 


We show that proving (2.2.7) shows that p;, p2,..., Pn are equivalent. 

Suppose that we prove (2.2.7). We consider two cases: p is true, p, is false. First, 
suppose that p; is true. Because p; and py — po are true, po is true; because pz and 
P2 — p3 are true, p3 is true; and so on. In this case, p1, p2,..., Pn have the same truth 
value: each is true. 

Now suppose that p; is false. Because p; is false and p, — py, is true, p, is 
false; because p, is false and py_1 — pp is true, pp— is false; and so on. In this case, 
P\;P2;+++sPn have the same truth value; each is false. Therefore, proving (2.2.7) shows 
that pj, p2,..., Py are equivalent. 

Not just any arrangement of implications along the lines of (2.3.7) will make 
Pi.-++,Pn equivalent. For example, consider pj: 2 = 3, po: 4 = 6, and p3: 8 = 8. 
For the arrangement 


Pi P2, P2—>p3, Pi > P3, 


P1 > P2, P2 > p3, and p; — p3 are all true, but pi, p2, p3 are not equivalent. 
Let A, B, and C be sets. Prove that the following are equivalent: 


(ACB (b)ANB=A_ (c)AUB=B. 


SOLUTION Discussion According to the discussion preceding this example, we must 
prove 


[(a) > (b)] A [(b) > ©] A [(e) > @)]. 


Proof We prove (a) — (b), (b) + (c), and (c) > (a). 

{(a) — (b).] We assume that A C B, and prove that AM B = A. Suppose that 
x € AM B. We must show that x € A. But if x € AN B, x € A by the definition of 
intersection. 

Now suppose that x € A. We must show that x ¢ AM B. Since A C B,x € B. 
Therefore x € AM B. We have proved thatA B =A. 

[(b) — (c).] We assume that A 7 B = A, and prove that A U B = B. Suppose that 
x € AUB. We must show that x € B. By assumption, either x € A orx € B. If x € B, we 
have the desired conclusion. If x € A, since AM B = A, again x € B. 

Now suppose that x € B. We must show that x ¢ AU B. But ifx ¢ B,x e AUB 
by the definition of union. We have proved that AU B = B. 

[(c) — (a).] We assume that AUB = B, and prove that A C B. Suppose that x € A. 
We must show that x € B. Since x € A, x € A UB by the definition of union. Since 
AUB=B,x € B. We have proved that A C B. The proof is complete. <q < 


Example 2.2.12 


Example 2.2.13 


Go Online 

The Great Internet 
Mersenne Prime 
Search is at 
goo.gl/gHgyey 


Example 2.2.14 
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Existence Proofs 
A proof of 


Ax P(x) (2.2.8) 


is called an existence proof. In Section 1.5, we showed that one way to prove (2.2.8) is 
to exhibit one member a in the domain of discourse that makes P(a) true. 


Let a and b be real numbers with a < b. Prove that there exists a real number x satisfying 
a<x<b. 


SOLUTION Discussion We will prove the statement by exhibiting a real number x 
between a and b. The point midway between a and b suffices. 


Proof It suffices to find one real number x satisfying a < x < b. The real number 


a+b 
7 


halfway between a and J, surely satisfies a < x < b. <q < 


Prove that there exists a prime p such that 2”? — | is composite (1.e., not prime). 


SOLUTION Discussion By trial and error, we find that 2? — 1 is prime for p = 
2,3,5,7 but not p = 11 since 2'! — 1 = 2048 — 1 = 2047 = 23 - 89. Thus p = 11 
makes the given statement true. 


Proof For the prime p = 11, 2? — 1 is composite: 


2'! — 1] = 2048 — 1 = 2047 = 23 - 89. <q < 


A prime number of the form 2? — 1, where p is prime, is called a Mersenne 
prime [named for Marin Mersenne (1588—1648)]. It is not known whether the number of 
Mersenne primes is finite or infinite. The largest primes known are Mersenne primes. In 
January 2016, the 49th known Mersenne prime was found, 274,207,281 1 a number hav- 
ing 22,338,618 decimal digits. This number was found by the Great Internet Mersenne 
Prime Search (GIMPS). GIMPS is a computer program distributed over many personal 
computers maintained by volunteers. You can participate. Just check the web link. You 
may find the next Mersenne prime! 

An existence proof of (2.2.8) that exhibits an element a of the domain of discourse 
that makes P(a) true is called a constructive proof. The proofs in Examples 2.2.12 and 
2.2.13 are constructive proofs. A proof of (2.2.8) that does not exhibit an element a of 
the domain of discourse that makes P(a) true, but rather proves (2.2.8) some other way 
(e.g., using proof by contradiction), is called a nonconstructive proof. 


Let 


Sp +S. +---+5n 
n 


A= 


be the average of the real numbers 51, ..., 5). Prove that there exists i such that s; > A. 


SOLUTION Discussion It seems hopeless to choose an i and prove that s; > A; 
instead, we use proof by contradiction. 
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Proof We use proof by contradiction and assume the negation of the conclusion 
—3i(s; > A). By the generalized De Morgan’s laws for logic (Theorem 1.5.14), the 
negation of the conclusion is equivalent to Vi-(s; > A) or Vi(s; < A). Thus we assume 
81 <A, 52 <A,...,5, <A. Adding these inequalities yields 


Sp +5. +---+5y < nA. 
Dividing by n gives 


Sp +52 +--+ +38y 
n 


<A, 


which contradicts the hypothesis 
S152 ++ + Sn 
= ae = 


Ax 
Therefore, there exists i such that s; > A. <q < 


The proof in Example 2.2.14 is nonconstructive; it does not exhibit an i for which 
s; => A. It does however prove indirectly using proof by contradiction that there is such 
ani. We could find such an i: We could check whether s; > A and if true, stop. Otherwise, 
we could check whether sz > A and if true, stop. We could continue in this manner until 
we find an i for which 5; > A. Example 2.2.14 guarantees that there is such an i. 


2.2 Problem-Solving Tips 


It is worth reviewing the Problem-Solving Tips of Section 2.1. Tips specific to the present 
section follow. 


= If you are trying to construct a direct proof of a statement of the form p — q and 
you seem to be getting stuck, try a proof by contradiction. You then have more to 
work with: Besides assuming p, you get to assume gq. 


= When writing up a proof by contradiction, alert the reader by stating, “We give 


a proof by contradiction, thus we assume ---,” where --- is the negation of the 
conclusion. Another common introduction is: Assume by way of contradiction 
that ---. 


= Proof by cases is useful if the hypotheses naturally break down into parts. For 
example, if the statement to prove involves the absolute value of x, you may want 
to consider the cases x > 0 and x < 0 because |x| is itself defined by the cases 
x > Oand x < 0. If the number of cases to prove is finite and not too large, the 
cases can be directly checked one by one. 
In writing up a proof by cases, it is sometimes helpful to the reader to indicate 
the cases, for example, 


[Case I: x > 0.] Proof of this case goes here. 
[Case II: x < 0.] Proof of this case goes here. 


= To prove p if and only if g, you must prove two statements: (1) if p then g and 
(2) if g then p. It helps the reader if you state clearly what you are proving. You 
can write up the proof of (1) by beginning a new paragraph with a sentence that 
indicates that you are about to prove “if p then g.” You would then follow with a 
proof of (2) by beginning a new paragraph with a sentence that indicates that you 
are about to prove “if g then p.” Another common technique is to write 


[p — q.] Proof of p — q goes here. 
[q — p.] Proof of g > p goes here. 


2.2 Review Exercises 


on na un FF & DY 


. What is proof by contradiction? 
. Give an example of a proof by contradiction. 
. What is an indirect proof? 

. What is proof by contrapositive? 
. Give an example of a proof by contrapositive. 
. What is proof by cases? 

. Give an example of a proof by cases. 


. What is a proof of equivalence? 


2.2 Exercises 


nan & WwW NY 


i 


then eitherx > lory>lorz> 1. 


. Prove that for all real numbers x and y, if xy < 2, then either 


a 2ory < V2. 


a rational number x satisfying a < x < b. 
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= To prove that several statements, say pj, ..., Dyn, are equivalent, prove py —> Po, 


P2 — P3,+++5Pn—1 —> Pn» Pn — pi. The statements can be ordered in any way and 
the proofs may be easier to construct for one ordering than another. For example, 
you could swap p2 and p3 and prove pj > p3, P3 —> P2, P2 —> P4, Pa > Ps,---s 
Pn—-1 — Pn> Pn — P1. You should indicate clearly what you are about to prove. 
One common form is 


[p1 — p2.] Proof of p; — p2 goes here. 
[p2 — p3.] Proof of p2 — p3 goes here. 
And so forth. 


If the statement is existentially quantified (1.e., there exists x .. .), the proof, called 
an existence proof, consists of showing that there exists at least one x in the domain 
of discourse that makes the statement true. One type of existence proof exhibits a 
value of x that makes the statement true (and proves that the statement is indeed 
true for the specific x). Another type of existence proof indirectly proves (e.g., 
using proof by contradiction) that a value of x exists that makes the statement true 
without specifying any particular value of x for which the statement is true. 


9. Give an example of a proof of equivalence. 


10. How can we show that three statements, say (a), (b), and (c), 
are equivalent? 


11. What is an existence proof? 

12. What is a constructive existence proof? 

13. Give an example of a constructive existence proof. 
14. What is a nonconstructive existence proof? 


15. Give an example of a nonconstructive existence proof. 


. Use proof by contradiction to prove that for all x € R, if x* is 11. Prove that if a and b are real numbers with a < b, there exists 
irrational, then x is irrational. an irrational number x satisfying a < x < b. 
. Is the converse of Exercise | true or false? Prove your answer. 12. Fill in the details of the following proof that there exist irra- 
. b . : 
. Prove that for all x € R, if x3 is irrational, then x is irrational. tional mumbers @'and b such that a” is rational. 
. Prove that for every n € Z, if n? is odd, then n is odd. Proof Let x = y = V2. If x’ is rational, the proof is 
. Prove that for all real numbers x, y, and z, ifx +y+z>3 complete. (Explain.) Otherwise, suppose that x” is irrational. 


(Why?) Let a = » andb = /2. Consider a”. (How does this 
complete the proof?) < 
Is this proof constructive or nonconstructive? 


13. Prove or disprove: There exist rational numbers a and b such 


3, e * . 
- Prove that ¥/2 is irrational. that a? is rational. What kind of proof did you give? 
. Prove that for all uy E R, if x is rational and y is irrational, 14. Prove or disprove: There exist rational numbers a and b such 
then x + y is irrational, that a is irrational. What kind of proof did you give? 
: Prove of disprove: For all x%y € R, if xis rational and y is 45, Let x and y be real numbers. Prove that if x < y + ¢ for every 
irrational, then xy is irrational. positive real number ¢, then x < y. 
- Prove that if a and b are real numbers with a < b, there exists 16. In American football, a safety counts two points, a field goal 


three points, a touchdown six points, a successful kick imme- 
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17. 


18. 


19. 
20. 
21. 


22. 


23. 


24. 


*25, 


26. 
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diately after a touchdown one point, and a successful run or 
pass immediately after a touchdown two points. What are the 
possible points a team can score? Prove your answer. 


Suppose that a real number a has the property that a” is irra- 
tional for every positive integer n. Prove that a” is irrational 
for every positive rational number r. (There are real numbers 
such as a. The real number e = 2.718..., the base of the nat- 
ural logarithm, has the property that e” is irrational for every 
rational number r. An elementary proof, using only calculus, 
may be found in [Aigner].) 

Abby, Bosco, Cary, Dale, and Edie are considering going to 
hear the Flying Squirrels. If Abby goes, Cary will also go. If 
Bosco goes, Cary and Dale will also go. If Cary goes, Edie will 
also go. If Dale goes, Abby will also go. If Edie goes, Abby 
will also go. Exactly four of the five went to the concert. Who 
went? Prove your result. 


Prove or disprove: (X — Y)N(Y—X) = @ for all sets X and Y. 
Prove or disprove: X x © = © for every set X. 


Show, by giving a proof by contradiction, that if 100 balls are 
placed in nine boxes, some box contains 12 or more balls. 
Show, by giving a proof by contradiction, that if 40 coins are 
distributed among nine bags so that each bag contains at least 
1 coin, at least two bags contain the same number of coins. 


Let S be a subset of 26 cards from an ordinary 52-card deck. 
Suppose that there is a suit in which S has exactly 7 cards. 
Prove that there is another suit in which S has at least 7 cards. 


Let S; be a subset of 26 cards from an ordinary 52-card 
deck, and let S: be the remaining 26 cards. Suppose that 
there is a suit in which S; has at least 9 cards. Prove that 
there is a suit in which S$) has at least 8 cards. (In the 
card game bridge, two pairs compete against each other 
and initially hold 13 cards each. This result says a pair, 
who between them have a suit of at least 9 cards, will 
have opponents who between them have a suit of at least 8 


cards.) 
Let 5}, ..., S, be a sequence! satisfying 


(a) 5s is a positive integer and s,, is a negative integer, 


(b) foralli, 1 <i <n, sj4; =s; +1 or sj41 = 5; — 1. 
Prove that there exists i, 1 < i < n, such that s; = 0. 
Calculus students will recognize this exercise as a dis- 
crete version of the calculus theorem: Iff is a continuous func- 
tion on [a, b] and f(a) > 0 and f(b) < 0, then f(c) = 0 for 
some c in (a,b). There are similar proofs of the two state- 
ments. 


Disprove the statement: For every positive integer n, n> < 2”. 


In Exercises 27-31, 


is the average of the real numbers s,... 


SL S28 F Sn 
~ n 


>Sn- 


27. 
28. 


29. 


30. 


31. 


32. 
33. 
34. 


35. 
36. 
37. 


38. 


39. 


40. 


41. 


42. 


43. 


44. 


45. 


46. 


47. 


48. 


Prove that there exists i such that 5; < A. 


Prove or disprove: There exists i such that s; > A. What proof 
technique did you use? 


Suppose that there exists i such that 5; < A. Prove or disprove: 
There exists j such that s; > A. What proof technique did you 
use? 


Suppose that there exist i and j such that s; 4 s;. Prove that 
there exists k such that s, < A. 


Suppose that there exist i and j such that s; 4 s;. Prove that 
there exists k such that 5s, > A. 


Prove that 2m -+ 5n? = 20 has no solution in positive integers. 


Prove that m3 + 2n? = 36 has no solution in positive integers. 


Prove that 2m? + 4n? — 1 = 2(m +n) has no solution in pos- 
itive integers. 


Prove that the product of two consecutive integers is even. 
Prove that for every n € Z, n> + nis even. 

Use proof by cases to prove that |xy| = |x||y| for all real num- 
bers x and y. 

Use proof by cases to prove that |x + y| < |x| + |y| for all real 
numbers x and y. 


Define the sign of the real number x, sgn(x), as 
1 ifx>0 
sgn(x) = 0 ifx=0 
-1 ifx<0. 


Use proof by cases to prove that |x| = sgn(x)x for every real 
number x. 

Use proof by cases to prove that sgn(xy) = sgn(x)sgn(y) for 
all real numbers x and y (sgn is defined in Exercise 39). 

Use Exercises 39 and 40 to give another proof that |xy| = |x|[y| 
for all real numbers x and y. 


Use proof by cases to prove that max{x, y} + min{x, y} = 
x + y for all real numbers x and y. 


Use proof by cases to prove that 
x+ytlk—yl 
max{x, y} = a aa 
for all real numbers x and y. 
Use proof by cases to prove that 
. xty-lx—yl 
min{x, y} = ———__—— 
: 2, 
for all real numbers x and y. 


Use Exercises 43 and 44 to prove that max{x, y}+ min{x, y} = 
x + y for all real numbers x and y. 


Prove that for all n € Z, n is even if and only if n + 2 is 
even. 


Prove that for all n € Z, n is odd if and only if n + 2 is 
odd. 


Prove that for all sets A and B, A C B if and only if B C A. 


‘Informally, a sequence is a list of elements in which order is taken into account, so that s; is the first element, 
s2 is the second element, and so on. We present the formal definition in Section 3.2. 


Problem-Solving Corner: Proving Some Properties of Real Numbers 


49. Prove that for all sets A, B, and C, A C C and B C Cif and 


only ifAUBCC. 


50. Prove that for all sets A, B, and C, C C A and C C Bif and 


only if CCANB. 
x51. The ordered pair (a, b) can be defined in terms of sets as 


(a, b) = {{a}, {a, b}}. 


Taking the preceding equation as the definition of ordered pair, 


prove that (a, b) = (c, d) if and only if a = c and b = d. 
52. Prove that the following are equivalent for the integer n: 


(a) nis odd. (b) There exists k € Z such that n = 2k — 1. 
(c) n* + 1 is even. 


54. 


53. Prove that the following are equivalent for sets A, B, and C: 


(aaANB=@ (b)BCA (c) AAB=AUB, 


where A is the symmetric difference operator (see Exercise 


101, Section 1.1). 


(ay AUB=U (bJ)ANB=2 (c)A CB, 


where U is a universal set. 


Problem-Solving Corner Proving Some Properties of Real Numbers 


Problem 
First some definitions: 


(a) Let X be a nonempty set of real numbers. An 
upper bound for X is a real number a having 
the property that x < a for every x € X. 


(b) Let a be an upper bound for a set X of real 
numbers. If every upper bound b for X satisfies 
b> a, we call aa least upper bound for X. 


A fundamental property of the real numbers is that ev- 
ery nonempty subset of real numbers bounded above 
has a least upper bound. 

Answer the following where R serves as a univer- 
sal set: 


1. Give an example of a set X and three distinct upper 
bounds for X, one of which is a least upper bound 
for X. 

2. Prove that if a and b are least upper bounds for a set 
X, then a = b. We say that the least upper bound 
for a set X is unique. If ais the least upper bound of 
a set X, we sometimes write a = lub X. 

3. Let X be a set with least upper bound a. Prove 
that if « > O, then there exists x € X satisfying 
a-—é<x<a. 

4. Let X be a set with least upper bound a, and suppose 
that t > 0. Prove that fa is the least upper bound of 
the set {tx | x € X}. 


Attacking the Problem 


To better understand the definitions, let’s construct ex- 
amples, write out the definitions in words, look at nega- 
tions of the definitions, and draw pictures. 

We’ ll start with definition (a) and construct a sim- 
ple example—taking X to be a small finite set, say 


X = {1,2, 3,4}. Now an upper bound a for X satis- 
fies x < a for every x in X—here, we must have 


lesa, 23m 3Sa, @]S3@, 


Examples of upper bounds for X are 4, 6.9, 377, 9072. 

In words, definition (a) says that a is an upper 
bound for a set X if every element in X is less than or 
equal to a. We see that upper bounds e, f, and g for a 
set X (shown in color) look like 


x e i & 


Prove that the following are equivalent for sets A, B, and C: 


What would it mean that a is not an upper bound 
for a set X? We would have to negate definition (a): 
=Vx(x < a) or, equivalently, Sx-(x < a) or dx(x > a). 
In words, a is not an upper bound for a set X if there 
exists x in X such that x > a. Looking at the preceding 
picture, we see that any number less than e is not an 
upper bound for X. 

Let’s turn to definition (b), which says, in words, 
that a is a least upper bound for a set X if, among all 
upper bounds for X, a is smallest. Looking ahead, prob- 
lem 2 is to show that there is only one (distinct) upper 
bound for a set X; thus, we usually say the least up- 
per bound rather than a least upper bound. The least 
upper bound of our previous set X = {1, 2,3, 4} 
is 4. We have already noted that 4 is an upper bound 
for X. If a is any upper bound for X, since 4 € X, 
4 < a. Therefore, 4 is the least upper bound for X. 
In the preceding figure, e is a least upper bound for the 
set X. 


Finding a Solution 


Now we consider the problems. 
[Problem 1.] Our previous example, X = 
{1, 2, 3,4}, will suffice. We have noted that 4 is the 
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least upper bound for X. Any values greater than 4 
serve as additional upper bounds. 

{Problem 2.] One way to prove that two numbers 
a and b are equal is to show thata < bandb < a. 
We’ ll try this first. Another possibility is proof by con- 
tradiction and assume that a $ b. 

[Problem 3.] Here we can use the fact that a — « is 
not an upper bound (since it’s less than the least upper 
bound) and, as discussed previously, what it means for 
a value to not be an upper bound. 

[Problem 4.] Here we are given a value, fa, and 
asked to prove that it is the least upper bound of the 
given set, which here we denote as tX. Going directly 
to the definitions, we must show that 


(a) z < ta for every z € 1X (i.e., fa is an upper 
bound for tx), 


(b) if b is an upper bound for 4X, then b > fa (ie., 
ta is the least upper bound for fX). 


For part (a), since z = tx (x € X), we must show 
that 
tx<ta forallxe X. 


We are given that a is a least upper bound for X. In 
particular, a is an upper bound for X so 


x<a forallxe X. 


How do we deduce the first inequality from the sec- 
ond? Multiply by t! We hope that the proof of part (b) 
proceeds in a similar way. 


Formal Solution 


[Problem 1.] Let X = {1, 2, 3, 4}. Upper bounds for X 
are 4, 5, and 6 since x < 4, x < 5, and x < 6 for every 
xe X. 

The least upper bound for X is 4. We have already 
noted that 4 is an upper bound for X. If a is any upper 
bound for X, since 4 € X, 4 < a. Therefore, 4 is the 
least upper bound for X. 

[Problem 2.] Since a is a least upper bound for X 
and b is an upper bound for X, a < b. Since bis a 
least upper bound for X and a is an upper bound for X, 
b < a. Therefore, a = b. 

[Problem 3.] Let ¢ > 0. Since a is the least upper 
bound for X and a—e < a,a—€ is not an upper bound 
for X. Therefore, by definition (a) there exists x € X 
such that a — ¢ < x. Since a is an upper bound for X, 
x < a. We have shown that there exists x € X such that 
G2 SSO 

[Problem 4.] Let tX denote the set {tx | x € X}. 
We must prove that 


(a) z < ta for every z € 1X (i.e., fa is an upper 
bound for tX), 


(b) if b is an upper bound for ¢X, then b > fa (i.e., 
ta is the least upper bound for #X). 


We first prove part (a). Let z € X. Then z = tx for 
some x € X (by the definition of the set tX). Since a is 
an upper bound for X, x < a. Multiplying by ¢ and not- 
ing that t > 0, we have z = tx < ta. Therefore, z < ta 
for every z € 4X and the proof of part (a) is complete. 

Next we prove part (b). Let b be an upper bound 
for tX. Then tx < b for every x € X (since an arbitrary 
element in ¢X is of the form tx for some x € X). Di- 
viding by f and noting that t > 0, we have x < b/t for 
every x € X. Therefore b/t is an upper bound for X. 
Since a is the least upper bound for X, b/t > a. Multi- 
plying by ¢ and noting again that t > 0, we have b > ta. 
Therefore fa is the least upper bound for tX. The proof 
is complete. 


Summary of Problem-Solving Techniques 


= Before beginning a proof, familiarize yourself 
with relevant definitions, theorems, examples, 
and so on. 


= Construct additional examples—especially 
small examples (e.g., for sets look at some small 
finite sets). 


= Write out some of the technical statements in 
words. 


= Look at negations of statements. 
= Draw pictures. 


= If one proof technique seems not to be working, 
try another. For example, if a direct proof seems 
unpromising, try a proof by contradiction. 


™ Review the Problem-Solving Tips sections in 
this chapter and the previous chapter. 


Comments 


The fact that every nonempty set of real numbers that 
is bounded above has a least upper bound is called 
the completeness property of the real numbers. The 
real numbers are complete in the sense that there are 
no “holes” in the number line. Informally, if there was 
a hole in the line, the set of numbers to the left of the 
hole, although bounded above, would not have a least 
upper bound: 


The set of rational numbers is not complete. The 
subset of rational numbers less than V2 is bounded 
above, but does not have a rational least upper bound. 
(The least upper bound of the subset of rational num- 
bers less than 2 is the irrational number ,)) 


Exercises 
1. What is the least upper bound of a nonempty finite 
set of real numbers? 
2. What is the least upper bound of the set 
{1 — 1/n | nis a positive integer}? 
Prove your answer. 


3. Let X and Y be nonempty sets of real numbers such 
that X C Y and Y is bounded above. Prove that X is 
bounded above and lub X < lub Y. 


4. Let X be a nonempty set. What is the least upper 
bound of the set {tx | x € X} if t= 0? 


x5. Let X be a set with least upper bound a, and let Y 
be a set with least upper bound b. Prove that the set 


{x+y|xeXandye ¥} 


is bounded above and its least upper bound is a+ b. 


2.3 @ Resolution Proofs 


Let X be a nonempty set of real numbers. A lower 
bound for X is a real number a having the property 
that x > a for every x € X. Let a be a lower bound 
for a set X of real numbers. If every lower bound 
b for X satisfies b < a, we call a a greatest lower 
bound for X. 


. Prove that if a and b are greatest lower bounds for 


assctexenthenmiai—tps 


. Prove that every nonempty subset of real num- 


bers bounded below has a greatest lower bound. 
Hint: lf X is a nonempty set of real numbers 
bounded below, let Y denote the set of lower 
bounds. Prove that Y has a least upper bound, 
say a. Prove that a is the greatest lower bound 
for X. 


. Let X be a set with greatest lower bound a. Prove 


that if e > O, then there exists x € X satisfying 
O-e == a. 


. Let X be a set with least upper bound a, and let 


t < 0. Prove that fa is the greatest lower bound of 
the set {tx | x € X}. 
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2.3 


Example 2.3.1 
Example 2.3.2 


Example 2.3.3 


Resolution Proofs* 


In this section, we will write a A bas ab. 
Resolution is a proof technique proposed by J. A. Robinson in 1965 (see 
[Robinson]) that depends on a single rule: 


If p V q and —p V r are both true, then g V r is true. (2.3.1) 


Statement (2.3.1) can be verified by writing the truth table (see Exercise 1). Because 
resolution depends on this single, simple rule, it is the basis of many computer programs 
that reason and prove theorems. 

In a proof by resolution, the hypotheses and the conclusion are written as clauses. 
A clause consists of terms separated by or’s, where each term is a variable or the negation 
of a variable. 


The expression a V b V -c V d is a clause since the terms a, b, c, and d are separated 
by or’s and each term is a variable or the negation of a variable. < 


The expression xy V w V —z is not a clause even though the terms are separated by or’s, 
since the term xy consists of two variables—not a single variable. < 


The expression p — q is not a clause since the terms are separated by —. Each term is, 
however, a variable. < 


‘This section can be omitted without loss of continuity. 
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Example 2.3.5 


A direct proof by resolution proceeds by repeatedly applying (2.3.1) to pairs of 
statements to derive new statements until the conclusion is derived. When we apply 
(2.3.1), p must be a single variable, but g and r can be expressions. Notice that when 
(2.3.1) is applied to clauses, the result g V r is a clause. (Since qg and r each consist of 
terms separated by or’s, where each term is a variable or the negation of a variable, g Vr 
also consists of terms separated by or’s, where each term is a variable or the negation of 
a variable.) 


Prove the following using resolution: 
aVb 
saVc 


acVd 


bvd 
SOLUTION Applying (2.3.1) to expressions | and 2, we derive 
4. bve. 
Applying (2.3.1) to expressions 3 and 4, we derive 
5. bvd, 


the desired conclusion. Given the hypotheses 1, 2, and 3, we have proved the conclusion 
bv d. < 


Special cases of (2.3.1) are as follows: 
If p V q and —p are true, then gq is true. 


(2.3.2) 
If p and —p V r are true, then r is true. 


Prove the following using resolution: 


a 

2. maVec 

3. xacVvd 
vd 


SOLUTION Applying (2.3.2) to expressions 1 and 2, we derive 
4. ¢. 


Applying (2.3.2) to expressions 3 and 4, we derive 
3d, 


the desired conclusion. Given the hypotheses 1, 2, and 3, we have proved the conclu- 
sion d. < 


If a hypothesis is not a clause, it must be replaced by an equivalent expression that 
is either a clause or the and of clauses. For example, suppose that one of the hypotheses 
is —(a V b). Since the negation applies to more than one term, we use the first of De 
Morgan’s laws (see Example 1.3.11) 


=a(aV b) =-a-b, a(ab) = -aV —b (2.3.3) 


Example 2.3.6 


Example 2.3.7 
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to obtain an equivalent expression with the negation applying to single variables: 
a(a Vv b) = -a-b. We then replace the original hypothesis —(a Vv b) by the two hy- 
potheses —a and —b. This replacement is justified by recalling that individual hypotheses 
hy, and hz are equivalent to /,h2 (see Definition 1.4.1 and the discussion that precedes 
it). Repeated use of De Morgan’s laws will result in each negation applying to only one 
variable. 

An expression that consists of terms separated by or’s, where each term consists 
of the and of several variables, may be replaced by an equivalent expression that consists 
of the and of clauses by using the equivalence 


aVbc=(av by(aVc). (2.3.4) 


In this case, we may replace the single hypothesis av bc by the two hypotheses av b and 
av c. By using first De Morgan’s laws (2.3.3) and then (2.3.4), we can obtain equivalent 
hypotheses, each of which is a clause. 


Prove the following using resolution: 


l. aV-be 
2. -x(avd) 
J. =b 


SOLUTION We use (2.3.4) to replace hypothesis 1 with the two hypotheses a v —b 
and av c. We use the first of De Morgan’s laws (2.3.3) to replace hypothesis 2 with the 
two hypotheses —a and —d. The argument becomes 


1. av-b 

2. aVec 

3. =a 

4. -d 
Pie! 


Applying (2.3.1) to expressions | and 3, we immediately derive the conclusion —b. < 


In automated reasoning systems, proof by resolution is combined with proof by 
contradiction. We write the negated conclusion as clauses and add the clauses to the 
hypothesis. We then repeatedly apply (2.3.1) until we derive a contradiction. 


Give another proof of Example 2.3.4 by combining resolution with proof by 
contradiction. 


SOLUTION We first negate the conclusion and use the first of De Morgan’s laws (2.3.3) 
to obtain =(b V d) = —b-d. We then add the clauses —b and —d to the hypotheses to 
obtain 


l. avb 

2. maVe 

3. -cVvd 

4. <=b 

5. ad 
Applying (2.3.1) to expressions | and 2, we derive 

6. bVc. 


Applying (2.3.1) to expressions 3 and 6, we derive 
7. bvd. 
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Applying (2.3.1) to expressions 4 and 7, we derive 
8. d. 


Now 5 and 8 combine to give a contradiction, and the proof is complete. < 


It can be shown that resolution is correct and refutation complete. “Resolution 
is correct” means that if resolution derives a contradiction from a set of clauses, the 
clauses are inconsistent (i.e., the clauses are not all true). “Resolution is refutation com- 
plete” means that resolution will be able to derive a contradiction from a set of inconsis- 
tent clauses. Thus, if a conclusion follows from a set of hypotheses, resolution will be 
able to derive a contradiction from the hypotheses and the negation of the conclusion. 
Unfortunately, resolution does not tell us which clauses to combine in order to deduce 
the contradiction. A key challenge in automating a reasoning system is to help guide 
the search for clauses to combine. References on resolution and automated reasoning 
are [Gallier; Genesereth; and Wos]. 


2.3 Problem-Solving Tips 


To construct a resolution proof, first replace any of the hypotheses or conclusion that is 
not a clause with one or more clauses. Then replace pairs of hypotheses of the form pV q 
and —p V r with g Vv r until deriving the conclusion. Remember that resolution can be 
combined with proof by contradiction. 


2.3 Review Exercises 


1. What rule of logic does proof by resolution use? 3. Explain how a proof by resolution proceeds. 


2. What is a clause? 


1. Write a truth table that proves (2.3.1). 4. s=pvt 5. p>q 6 por 
, F ease 8 : ‘ aqVs PY r 
Use resolution to derive each conclusion in Exercises 2-6. Hint: ve 
In Exercises 5 and 6, replace — and < with logically equivalent ae Wg up 
: PpYqvrvVvu 
expressions that use or and and. ee 
2. mpvqvr 3. apvr oe ee 
“| TV q 7. Use resolution and proof by contradiction to re-prove Exer- 
ar cises 2-6. 
2Sp “9 8. Use resolution and proof by contradiction to re-prove Exam- 
ple 2.3.6. 
2.4 Mathematical Induction 
Suppose that a sequence of blocks numbered 1, 2, ... sits on an (infinitely) long table 
Go Online (see Figure 2.4.1) and that some blocks are marked with an “X.” (AI of the blocks visible 


For more on 
mathematical induction, see 


goo.gl/gHgyey 


in Figure 2.4.1 are marked.) Suppose that 


The first block is marked. (2.4.1) 
For all n, if block n is marked, then block n + 1 is also marked. (2.4.2) 


We claim that (2.4.1) and (2.4.2) imply that every block is marked. 


1(2 
Sy = = x 
2(3 
So = a x 
(n-1)n 
Sn-1 = 5 x 
n(n+1 
Sn = = ) x 
(n+1)(n+2) 
So = 


Figure 2.4.2 A sequence of 
statements. True statements are 
marked with x. 
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Va aaa ae 


Figure 2.4.1 Numbered blocks on a table. 


We examine the blocks one by one. Statement (2.4.1) explicitly states that block 1 
is marked. Consider block 2. Since block | is marked, by (2.4.2) (taking n = 1), block 2 
is also marked. Consider block 3. Since block 2 is marked, by (2.4.2) (taking n = 2), 
block 3 is also marked. Continuing in this way, we can show that every block is marked. 
For example, suppose that we have verified that blocks 1-5 are marked, as shown in 
Figure 2.4.1. To show that block 6, which is not shown in Figure 2.4.1, is marked, we 
note that since block 5 is marked, by (2.4.2) (taking n = 5), block 6 is also marked. 

The preceding example illustrates the Principle of Mathematical Induction. To 
show how mathematical induction can be used in a more profound way, let S,, denote 
the sum of the first n positive integers: 


S,=1+2+---+n. (2.4.3) 


Suppose that someone claims that 


1 
Sn = ee for all n > 1. (2.4.4) 
A sequence of statements is really being made, namely, 
1(2) 2(3) 3(4) 
Ss; = —— =1, & = —— =3,. $3 = —— = 6, as 
3 a a 


66 


Suppose that each true equation has an “x” placed beside it (see Figure 2.4.2). 
Since the first equation is true, it is marked. Now suppose we can show that for all n, 
if equation n is marked, then equation n + | is also marked. Then, as in the example 
involving the blocks, all of the equations are marked; that is, all the equations are true 
and the formula (2.4.4) is verified. 

We must show that for all n, if equation n is true, then equation n + | is also true. 
Equation n is 


n(n + 1) 


Sa = 
2 


(2.4.5) 


Assuming that this equation is true, we must show that equation n + 1 


(n+ 1)(n + 2) 
n+1 = a 


is true. According to definition (2.4.3), 
Snyp =1H+24+---+n+(n4+1). 
We note that S,, is contained within S,,,;, in the sense that 


Sn4i = 1 t2+---+n+(n+1)=S,4+ (n+ 1). (2.4.6) 
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Principle of 
Mathematical Induction 


Example 2.4.2 


Example 2.4.3 


Because of (2.4.5) and (2.4.6), we have 
(n+ 1) 


Sn4t =S,+@+D= : 2 +(n+1). 
Since 
n(n + 1) n(n+1)  2(n+1) 
t(n+1)= t 
2 2 2 
_ antl) +2(n+1) 
7 2 
_ (n+ D2) 
= 5 ; 
we have 
(n + 1)(n + 2) 
Sn+l = —— . 


2 


Therefore, assuming that equation n is true, we have proved that equation n + | is true. 
We conclude that all of the equations are true. 

Our proof using mathematical induction consisted of two steps. First, we veri- 
fied that the statement corresponding to n = 1 was true. Second, we assumed that 
statement n was true and then proved that statement n + 1 was also true. In prov- 
ing statement n + 1, we were permitted to make use of statement n; indeed, the trick 
in constructing a proof using mathematical induction is to relate statement n to state- 
ment n+ 1. 

We next formally state the Principle of Mathematical Induction. 


Suppose that we have a propositional function $(n) whose domain of discourse is the 
set of positive integers. Suppose that 


S(1) is true; (2.4.7) 
for alln > 1, if S(n) is true, then S(n + 1) is true. (2.4.8) 


Then S(n) is true for every positive integer n. 


Condition (2.4.7) is sometimes called the Basis Step and condition (2.4.8) is some- 
times called the Inductive Step. Hereafter, “induction” will mean “mathematical induc- 
tion.” 

After defining n factorial, we illustrate the Principle of Mathematical Induction 
with another example. 


Definition 2.4.1 > 1 factorial is defined as 


watt ifn =0 
ta DO) tin 1, 


That is, if nm > 1, n! is equal to the product of all the integers between 1| and n inclusive. 
AS a special case, 0! is defined to be 1. < 


Ol=1!=1, 3!=3-2-1=6, 6!=6-5-4-3-2-1=720 < 


Use induction to show that 


n} > 2"! for alln > 1. (2.4.9) 


Example 2.4.4 
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SOLUTION 


Basis Step (n = 1) 
{Condition (2.4.7)] We must show that (2.4.9) is true if n = 1. This is easily accom- 
plished, since 1! = 1>1= 2, 
Inductive Step 
{Condition (2.4.8)] We assume that the inequality is true for n > 1; that is, we assume 
that 

ni > 2"! (2.4.10) 


is true. We must then prove that the inequality is true for n + 1; that is, we must prove 
that 


(n+ 1)! > 2" (2.4.11) 
is true. We can relate (2.4.10) and (2.4.11) by observing that (n + 1)! = (n+ 1)(v!). 
Now 
(n+ 1)! = (n+ 1)(n!) 
> (n+1)2""! by (2.4.10) 


>2.2"! sincen+1>2 


Therefore, (2.4.11) is true. We have completed the Inductive Step. 
Since the Basis Step and the Inductive Step have been verified, the Principle of 
Mathematical Induction tells us that (2.4.9) is true for every positive integer n. < 


If we want to verify that the statements S(79), S(m9 + 1),..., where no # 1, are 
true, we must change the Basis Step to S(1o) is true. In words, the Basis Step is to prove 
that the propositional function S(m) is true for the smallest value no in the domain of 
discourse. 

The Inductive Step then becomes 


for alln = no, if S(n) is true, then S(n + 1) is true. 


Geometric Sum Use induction to show that if r 4 1, 


a pitt ry | 
atar! +ar+---+tar'= ( i ) (2.4.12) 
yes 


for alln > 0. 
The sum on the left is called the geometric sum. In the geometric sum in which 
a #Oandr #0, the ratio of adjacent terms [(ar't!)/(ar’) = r] is constant. 


SOLUTION 


Basis Step (n = 0) 
Since the smallest value in the domain of discourse {n | n > 0} is n = O, the Basis Step 
is to prove that (2.4.12) is true for n = 0. For n = 0, (2.4.12) becomes 
= a(r'! — 1) 
~ ra] 


’ 


which is true. 
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Inductive Step 


Assume that statement (2.4.12) is true for n. Now 


tl — 
a-+ar! + ar’ Abeer + ar" + art! = SS 4 art! 
r— 
_ art! —1)— ar™!(r-1) 
~ r-1 r-—1 
_ a(r"*? — 1) 
~ r-1 : 


Since the Basis Step and the Inductive Step have been verified, the Principle of Mathe- 
matical Induction tells us that (2.4.12) is true for all n > 0. < 


As an example of the use of the geometric sum, if we take a = | andr = 2 in 
(2.4.12), we obtain the formula 


gntl =] 
Wate OF 9? he cots he = i 

The reader has surely noticed that in order to prove the previous formulas, one has 
to be given the correct formulas in advance. A reasonable question is: How does one 
come up with the formulas? There are many answers to this question. One technique 
to derive a formula is to experiment with small values and try to discover a pattern. 
(Another technique is discussed in Exercises 70-73). For example, consider the sum 
14+3+---+(2n—1). The following table gives the values of this sum for n = 1, 2, 3, 4. 


n 1434+ (@Qn—1) 
1 1 
2 4 
3 9 
4 16 


Since the second column consists of squares, we conjecture that 
1434---+(Qn-N=r for every positive integer n. 


The conjecture is correct and the formula can be proved by mathematical induction (see 
Exercise 1). 

At this point, the reader may want to read the Problem-Solving Corner that follows 
this section. This Problem-Solving Corner gives an extended, detailed exposition of how 
to do proofs by mathematical induction. 

Our final examples show that induction is not limited to proving formulas for sums 
and verifying inequalities. 


Use induction to show that 5” — | is divisible by 4 for all n > 1. 


SOLUTION 


Basis Step (n = 1) 
Ifn =1,5"—1=5! —1 =4, which is divisible by 4. 


Inductive Step 


We assume that 5” — | is divisible by 4. We must then show that 5"+! — | is divisible 
by 4. We use the fact that if p and q are each divisible by k, then p + gq is also divisible 
by k. In our case, k = 4. We leave the proof of this fact to the exercises (see Exercise 74). 


Theorem 2.4.6 


Subsets of X | Subsets of X 
that that do not 
contain a contain a 
{a} ) 
{a, b} {b} 
{a, c} {c} 
{a, b, c} {b, c} 


Figure 2.4.3 Subsets of 

X = {a, b, c} divided into two 
classes: those that contain a and 
those that do not contain a. Each 
subset in the right column is 
obtained from the corresponding 
subset in the left column by 
deleting the element a from it. 
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We relate the (n + 1)st case to the nth case by writing 
5"! __ | = 5" — | + to be determined. 


Now, by the inductive assumption, 5” — 1 is divisible by 4. If “to be determined” is 
also divisible by 4, then the preceding sum, which is equal to 5’*! — 1, will also be 
divisible by 4, and the Inductive Step will be complete. We must find the value of “to be 
determined.” 
Now 
shth_ y= 5.5" 154-5" 41-5" = 1. 


Thus, “to be determined” is 4-5”, which is divisible by 4. Formally, we could write the 
Inductive Step as follows. 

By the inductive assumption, 5” — | is divisible by 4 and, since 4-5” is divisible 
by 4, the sum 


(5" _ 1) +4.5" = stl a5'] 


is divisible by 4. 
Since the Basis Step and the Inductive Step have been verified, the Principle of 
Mathematical Induction tells us that 5” — 1 is divisible by 4 for all n > 1. < 


We next give the proof promised in Section 1.1 that if a set X has n elements, the 
power set of X, P(X), has 2” elements. 


If |X| =n, then 
IP (X)| = 2” (2.4.13) 


for all n > 0. 


Proof The proof is by induction on n. 


Basis Step (n = 0) 
If n = 0, X is the empty set. The only subset of the empty set is the empty set itself; 
thus, 
|P(X)| =1=2° =2". 
Thus, (2.4.13) is true for n = 0. 


Inductive Step 


Assume that (2.4.13) holds for n. Let X be a set with n+ | elements. Choose x € X. We 
claim that exactly half of the subsets of X contain x, and exactly half of the subsets of X 
do not contain x. To see this, notice that each subset S of X that contains x can be paired 
uniquely with the subset obtained by removing x from S (see Figure 2.4.3). Thus exactly 
half of the subsets of X contain x, and exactly half of the subsets of X do not contain x. 
If we let Y be the set obtained from X by removing x, Y has n elements. By the 
inductive assumption, |P(Y)| = 2”. But the subsets of Y are precisely the subsets of X 
that do not contain x. From the argument in the preceding paragraph, we conclude that 


|P(X)| 


IP)| = ) 


Therefore, 
|P(X)| = 2|/P(Y)| =2 - 27 = 271, 


Thus (2.4.13) holds for n + 1 and the inductive step is complete. By the Principle of 
Mathematical Induction, (2.4.13) holds for all n > 0. < 
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Go Online 


For more on trominoes, see 
goo.gl/gHgyey 


Figure 2.4.4 A tromino. 


ATiling Problem A right tromino, hereafter called simply a tromino, is an object made 
up of three squares, as shown in Figure 2.4.4. A tromino is a type of polyomino. Since 
polyominoes were introduced by Solomon W. Golomb in 1954 (see [Golomb, 1954]), 
they have been a favorite topic in recreational mathematics. A polyomino of order s 
consists of s squares joined at the edges. A tromino is a polyomino of order 3. Three 
squares in a row form the only other type of polyomino of order 3. (No one has yet 
found a simple formula for the number of polyominoes of order s.) Numerous problems 
using polyominoes have been devised (see [Martin]). 


We give Golomb’s inductive proof (see [Golomb, 1954]) that if we remove one 
square from an n x n board, where n is a power of 2, we can tile the remaining squares 
with right trominoes (see Figure 2.4.5). By a tiling of a figure by trominoes, we mean 
an exact covering of the figure by trominoes without having any of the trominoes over- 
lap each other or extend outside the figure. We call a board with one square missing a 
deficient board. 

We now use induction on k to prove that we can tile a 2 x 2* deficient board with 
trominoes for all k > 1. 


Basis Step (k = 1) 


If k = 1, the 2 x 2 deficient board is itself a tromino and can therefore be tiled with one 
tromino. 


Inductive Step 


Assume that we can tile a 2 x 2* deficient board. We show that we can tile a 2‘! x 2**! 
deficient board. 

Consider a 2‘*+! x 2'+! deficient board. Divide the board into four 2° x 2* boards, 
as shown in Figure 2.4.6. Rotate the board so that the missing square is in the upper-left 
quadrant. By the inductive assumption, the upper-left 2 x 2" board can be tiled. Place 
one tromino T in the center, as shown in Figure 2.4.6, so that each square of T is in 
each of the other quadrants. If we consider the squares covered by T as missing, each of 
these quadrants is a 2" x 2* deficient board. Again, by the inductive assumption, these 
boards can be tiled. We now have a tiling of the 2+! x 2*+! board. By the Principle 
of Mathematical Induction, it follows that any 2 x 2* deficient board can be tiled with 
trominoes,k = 1,2,.... 

If we can tile ann x n deficient board, where n is not necessarily a power of 2, then 
the number of squares, n* — 1, must be divisible by 3. [Chu] showed that the converse is 
true, except when n is 5. More precisely, ifn 4 5, any n x n deficient board can be tiled 


gkt1 


ex ok 1 Ok x 2k 


gkt1 


Figure 2.4.6 Using mathematical 


Figure 2.4.5 Tiling a4 x 4 induction to tile a 24+! 2+! 
deficient board with trominoes. deficient board with trominoes. 
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with trominoes if and only if 3 divides n? — 1 (see Exercises 28 and 29, Section 2.5). 
[Some 5 x 5 deficient boards can be tiled and some cannot (see Exercises 33-35). ] 
Some real-world problems can be modeled as tiling problems. One example is 
the VLSI layout problen—the problem of packing many components on a computer 
chip (see [Wong]). (VLSI is short for very large scale integration.) The problem is to 
tile a rectangle of minimum area with the desired components. The components are 
sometimes modeled as rectangles and L-shaped figures similar to (right) trominoes. In 
practice, other constraints are imposed such as the proximity of various components that 
must be interconnected and restrictions on the ratios of width to height of the resulting 
rectangle. < 


A loop invariant is a statement about program variables that is true just before a 
loop begins executing and is also true after each iteration of the loop. In particular, a loop 
invariant is true after the loop finishes, at which point the invariant tells us something 
about the state of the variables. Ideally, this statement tells us that the loop produces the 
expected result, that is, that the loop is correct. For example, a loop invariant for a while 
loop 


while (condition) 
// loop body 


is true just before condition is evaluated the first time, and it is also true each time the 
loop body is executed. 

We can use mathematical induction to prove that an invariant has the desired be- 
havior. The Basis Step proves that the invariant is true before the condition that controls 
looping is tested for the first time. The Inductive Step assumes that the invariant is true 
and then proves that if the condition that controls looping is true (so that the loop body 
is executed again), the invariant is true after the loop body executes. Since a loop iter- 
ates a finite number of times, the form of mathematical induction used here proves that 
a finite sequence of statements is true, rather than an infinite sequence of statements as 
in our previous examples. Whether the sequence of statements is finite or infinite, the 
steps needed for the proof by mathematical induction are the same. We illustrate a loop 
invariant with an example. 


Use a loop invariant to prove that when the pseudocode 


i=1 

fact = 1 

while (i < n) { 
i=i+l1 
fact = fact *i 


} 


terminates, fact is equal to n!. 


SOLUTION We prove that fact = i! is an invariant for the while loop. Just before the 
while loop begins executing, i= 1 and fact = 1, so fact = 1!. We have proved the Basis 
Step. 

Assume that fact = i!. If i<n is true (so that the loop body executes again), 
i becomes i + | and fact becomes 


factx G+) =ile G+) =G4D)!. 


We have proved the Inductive Step. Therefore, fact = i! is an invariant for the while 
loop. 
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The while loop terminates when i = n. Because fact = i! is an invariant, at this 
point, fact = n!. < 


2.4 Problem-Solving Tips 


To prove 
a, +a.+---+a, = F(n) for alln > 1, 


where F(n) is the formula for the sum, first verify the equation for n = 1: a; = F(1) 
(Basis Step). This is usually straightforward. 
Now assume that the statement is true for n; that is, assume 


a +2 +++ + dy = F(n). 
Add a,,41 to both sides to get 
Qy + Ay +08) + Gn + Any = F(A) + apt. 
Finally, show that 
F(n) +an)4; = F(n+ 1). 


To verify the preceding equation, use algebra to manipulate the left-hand side of the equa- 
tion [F(7) +441] until you get F(n+ 1). Look at F(n+ 1) so you know where you’re 
headed. (It’s somewhat like looking up the answer in the back of the book!) You’ve 
shown that 


Q +a, +++++4n41 = F(n+ 1), 


which is the Inductive Step. Now the proof is complete. 

Proving an inequality is handled in a similar fashion. The difference is that instead 
of obtaining equality [F(n) + a,.; = F(n + 1) in the preceding discussion], you obtain 
an inequality. 

In general, the key to devising a proof by induction is to find case n “within” case 
n+ 1. Review the tiling problem (Example 2.4.7), which provides a striking example of 
case n “within” case n + 1. 


2.4 Review Exercises 


1. State the Principle of Mathematical Induction. 


3. Give a formula for the sum 1 + 2+---+n7. 


2. Explain how a proof by mathematical induction proceeds. 4. What is the geometric sum? Give a formula for it. 


In Exercises 1-12, using induction, verify that each equation is true 


for every positive integer n. 


1. 
2. 


3. 


24224 324..04 n= 


(-1)"*!n(v +1) 


§. 17-27 43?7--.- 4 (Dr = 


2 
14345+---4(Qn-lD=nr" 2 
ap 
l(n+2 Ae 4 Sony eg | 
(pie tad Saeki le ee Sia 2 
3 
fe se at 1 
1d) +22) +---+a(at) =(a+D!I-1 ie 36 sq "Ge tne 
_ ant 1)Qn+ n 


6 2n+1 


gt tg Se _ 1+3+5++:Qn-1) 
"2.4 2-466 " 2.4.6.8 | 2-4.6---(2n +2) 
1 1+3+5---(Qn+1) 
2 2+4-6--+(2n +2) 
ae ee ee 
es ee on | 
3 1 1 
~ 4 %W+1) 242) 


n(n + 1)(n+2)(3n+ 5) 
12 
cos[(x/2)(n + 1)] sin(nx/2) 
cos nx = 
sin(x/2) 


10. 1-27 4+2-37+---+n(n+ 1)? = 


«11. 


cosx + cos 2x +--+ 4 


provided that sin(x/2) 4 0. 


*12. lsinx+2sin2x+---+nsinnx 
sin [(n + 1)x] (n + 1) cos[(2n + 1)x/2] 
~ 4sin? (x/2) 2 sin (x/2) 


provided that sin(x/2) 4 0. 


In Exercises 13-18, using induction, verify the inequality. 
1 1-3-5---(2n— 1) 


BB. ee ES 
oS, = 3 dahon 


1+3-5---@n-D _ 1 
2+4-+6---(2n) ~ Sn+1 
15. 2n+1 <2", n=3,4,... 


«14, 


n= 1,2, 


#16. 2” > n?, n=4,5,... 
» AY + a7 +--+ + an 
#17. (ayaz +++ am)!/2" < 1+ as = ee n=1,2,..., and 
the a; are positive numbers 
18. (1 +x)" > 1-+nx, forx > —landn> 1 
19. Use the geometric sum to prove that 
rPotri4 frr< ! 
l-r 
for alln > Oand0 <r <1. 
*20. Prove that 
7 
ler 42-P?4---4n7" < 
(a —r)? 


for alln > 1 andO < r < 1. Hint: Using the result of the 
previous exercise, compare the sum of the terms in 


r r2 P r+ a rn 
r2 r r4 cae yr? 

r rt yr” 

rt 

pr-l yr? 

rt 


in the diagonal direction (,“) with the sum of the terms by 
columns. 


21. 
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Prove that 


for alln > 1. 


In Exercises 22-25, use induction to prove the statement. 


22. 
23. 
24. 
«25. 
26. 


27. 


28. 


29. 


30. 


31. 


32. 


33: 


34. 


35. 


36. 


x37. 


7" — 1 is divisible by 6, for alln > 1. 

11” — 6 is divisible by 5, for alln > 1. 
6-7" — 2-3" is divisible by 4, for all n > 1. 
3" + 7" — 2 is divisible by 8, for alln > 1. 
Use induction to prove that if X),.. 
(a) XA(X;UX2U- - -UX,) = (XNX))U(XNX2)U: - -U(XNX,). 
(b) X,N X2N---NX, =X, UX2U-+ UX. 


Use induction to prove that if X;,.. 


., X, and X are sets, then 


., X, are sets, then 
|X, x X2 xX +++ X Xp] = [Xi] + |X2]-- + |Xnl- 
Prove that the number of subsets S of {1,2,...,m}, with |S| 
even, is 2”7!, n =, 
By experimenting with small values of n, guess a formula for 
the given sum, 
1 1 1 
1 fs sg og clk 
1-2 


2-35 0° n(nt+ 1)’ 
then use induction to verify your formula. 


Use induction to show that n straight lines in the plane divide 
the plane into (n? -+n+2)/2 regions. Assume that no two lines 
are parallel and that no three lines have a common point. 


Show that the regions of the preceding exercise can be colored 
red and green so that no two regions that share an edge are the 
same color. 


Given n 0’s and n 1’s distributed in any manner whatsoever 
around a circle (see the following figure), show, using induc- 
tion on n, that it is possible to start at some number and proceed 
clockwise around the circle to the original starting position so 
that, at any point during the cycle, we have seen at least as 
many 0’s as 1’s. In the following figure, a possible starting 
point is marked with an arrow. 


yt 
ot 
ei 1 
1 


Give a tiling of a 5 x 5 board with trominoes in which the 
upper-left square is missing. 

Show a5 x 5 deficient board that is impossible to tile with 
trominoes. Explain why your board cannot be tiled with tro- 
minoes. 

Which 5 x 5 deficient boards can be tiled? 


Show that any (27) x (37) board, where i and j are positive 
integers, with no square missing, can be tiled with trominoes. 


Show that any 7 x 7 deficient board can be tiled with tromi- 
noes. 
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38. 


39. 


40. 


41. 


42. 


43. 


+44, 


45. 


46. 
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Show that any 11x 11 deficient board can be tiled with 
trominoes. Hint: Subdivide the board into overlapping 7 x 7 
and 5 x 5 boards and two 6 x 4 boards. Then, use Exercises 
33, 36, and 37. 


This exercise and the one that follows are due to Anthony 
Quas. A 2” x 2” L-shape, n > 0, is a figure of the form 


2 xe F 


2h 2 | OK OF 


with no missing squares. Show that any 2” x 2” L-shape can 
be tiled with trominoes. 


Use the preceding exercise to give a different proof that any 
2” x 2” deficient board can be tiled with trominoes. 


A straight tromino is an object made up of three squares in a 
row: 


Which 4 x 4 deficient boards can be tiled with straight 
trominoes? Hint: Number the squares of the 4 x 4 board, 
left to right, top to bottom: 1, 2,3, 1, 2,3, and so on. Note that 
if there is a tiling, each straight tromino covers exactly one 2 
and exactly one 3. 


Which 5 x 5 deficient boards can be tiled with straight 
trominoes? 


Which 8 x 8 deficient boards can be tiled with straight 
trominoes? 


A T-tetromino is an object made up of four squares 


Prove that an m x n rectangle can be tiled with T-tetrominoes 
if and only if 4 divides m and 4 divides n. 


Use a loop invariant to prove that when the pseudocode 


i=1 
pow = 
while (i < n) { 
pow = pow *a 
i=i+l1 
} 
terminates, pow is equal to a”. 
Prove that, after the following pseudocode terminates, a[h] = 
val; for all p, i < p < h, a[p] < val; and for all p, h < p <j, 
a(p] = val. In particular, val is in the position in the array 
ai], ..., aj] where it would be if the array were sorted. 


val = a{i] 
h=i 
fork =i+1toj 
if (a[k] < val) { 
h=h+1 
swap(a{h], a[k]) 
} 
swap(a{i], a[h]) 


Hint: Use the loop invariant: h < k; forall p,i < p < h,a[p] < 
val; and, for allp,h < p < k,a[p] = val. (A picture is helpful.) 

This technique is called partitioning. This particular 
version is due to Nico Lomuto. Partitioning can be used to 
find the kth smallest element in an array and to construct a 
sorting algorithm called quicksort. 


A 3D-septomino is a three-dimensional 2 x 2 x 2 cube with one 
1 x 1 x 1 corner cube removed. A deficient cube isak x k x k 
cube with one 1 x 1 x 1 cube removed. 


47. 


48. 


49. 


x50. 


51. 


Prove that a 2” x 2” x 2” deficient cube can be tiled by 3D- 
septominoes. 


Prove that if a k x k x k deficient cube can be tiled by 
3D-septominoes, then 7 divides one of k — 1,k —2,k —4. 


Suppose that S,, = (n + 2)(n — 1) is (incorrectly) proposed as 
a formula for 


Bide Bi ace yd Sed 


(a) Show that the Inductive Step is satisfied but that the Basis 
Step fails. 


*(b) If S/, is an arbitrary expression that satisfies the Inductive 


Step, what form must Se assume? 


What is wrong with the following argument, which allegedly 
shows that any two positive integers are equal? 

We use induction on n to “prove” that if a and b are 
positive integers and n = max{a, b}, then a = b. 


Basis Step (n= 1) 


If a and b are positive integers and 1 = max{a, b}, we must 
havea=b=1. 


Inductive Step 

Assume that if a’ and Db’ are positive integers and 
n= max{a',b’}, then a’ = Db’. Suppose that a and b are 
positive integers and that n + 1 = max{a,b}. Nown = 


max{a— 1, b— 1}. By the inductive hypothesis, a— 1 = b—1. 
Therefore, a = b. 

Since we have verified the Basis Step and the Inductive 
Step, by the Principle of Mathematical Induction, any two 
positive integers are equal! 


What is wrong with the following “proof” that 


2, on n- 


3° n+l? n+l 


for alln > 2? 


Suppose by way of contradiction that 


1.2 n n 

ae ars ae ee Me 
Then also 

1,2, ,.8 , 7+ (n+ 1)? 

2°37 "n+l n+2. n+2— 


We could prove statement (2.4.14) by induction. In particular, 
the Inductive Step would give 


1 a 7 n atl n atl 
273 ew ntl) ont20 ntl nt2’ 


Therefore, 
nr n+1 (n+ 1)? 
n+1 eee ~ n+2 
Multiplying each side of this last equation by (n + 1)(n + 2) 
gives 


m(n+2)+(n+1)? =(n+1). 


This last equation can be rewritten as 


Ww +27? +r +2n+1 =n +3 4+3n4+1 


or 


Ww +3n2 +2n | l=n7 | 3n? + 3n+1, 


1 2 n rv 


1 Le  sciacavadl 
T T Pat aed 


as claimed. 
52. Use mathematical induction to prove that 
Ls. 2 Lon nr 


| < 
n+1 n+1 


for all n > 2. This inequality gives a correct proof of the 
statement of the preceding exercise. 


In Exercises 53-57, suppose that n> 1 people are positioned in a 
field (Euclidean plane) so that each has a unique nearest neighbor. 
Suppose further that each person has a pie that is hurled at the 
nearest neighbor. A survivor is a person that is not hit by a pie. 


53. Give an example to show that if n is even, there might be no 
survivor. 


54. Give an example to show that there might be more than one 
survivor. 


x55. [Carmony] Use induction on n to show that if n is odd, there 
is always at least one survivor. 


56. Prove or disprove: If n is odd, one of two persons farthest 
apart is a survivor. 


57. Prove or disprove: If n is odd, a person who throws a pie the 
greatest distance is a survivor. 


Exercises 58-61 deal with plane convex sets. A plane convex set, 
subsequently abbreviated to “convex set,” is a nonempty set X in 
the plane having the property that if x and y are any two points in 
X, the straight-line segment from x to y is also in X. The following 
figures illustrate. 
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convex set nonconvex set 
58. Prove that if X and Y are convex sets and XM Y is nonempty, 
X11 Y is aconvex set. 


x59. Suppose that X), X2, X3,X4 are convex sets, each three of 
which have a common point. Prove that all four sets have a 
common point. 


x60. Prove Helly’s Theorem: Suppose that X1, X2,...,Xn,n = 4, 
are convex sets, each three of which have a common point. 
Prove that all n sets have a common point. 


61. Suppose that n > 3 points in the plane have the property that 
each three of them are contained in a circle of radius 1. Prove 
that there is a circle of radius | that contains all of the points. 


62. Ifa and Db are real numbers with a < b, an open interval (a, b) 
is the set of all real numbers x such that a < x < b. Prove that 
iff;,..., J, is aset of n > 2 open intervals such that each pair 
has a nonempty intersection, then 


QAbA:::Nh, 


is nonempty. 


Flavius Josephus was a Jewish soldier and historian who lived in 
the first century (see [Graham, 1994; Schumer]). He was one of 
the leaders of a Jewish revolt against Rome in the year 66. The 
following year, he was among a group of trapped soldiers who 
decided to commit suicide rather than be captured. One version of 
the story is that, rather than being captured, they formed a circle 
and proceeded around the circle killing every third person. Jose- 
phus, being proficient in discrete math, figured out where he and a 
buddy should stand so they could avoid being killed. 

Exercises 63-69 concern a variant of the Josephus Problem in 
which every second person is eliminated. We assume that n people 
are arranged in a circle and numbered 1, 2, ..., n clockwise. Then, 
proceeding clockwise, 2 is eliminated, 4 is eliminated, and so on, 
until there is one survivor, denoted J(n). 


63. Compute J(4). 

64. Compute /(6). 

65. Compute J(10). 

66. Use induction to show that J/(2') = 1 for alli > 1. 


67. Given a value of n > 2, let 2! be the greatest power of 2 with 
<n. (Examples: If nm = 10, i = 3. Ifn = 16,7 = 4.) Let 
j =n-— 2. (After subtracting 2', the greatest power of 2 less 
than or equal to n, from n, j is what is left over.) By using the 
result of Exercise 66 or otherwise, prove that 


Jn) =2j7+1. 


68. Use the result of Exercise 67 to compute J(1000). 
69. Use the result of Exercise 67 to compute J(100,000). 
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If aj, a2,... is a sequence, we define the difference operator A 
to be 


Aan = an+1 — Gn- 


The formula of Exercise 70 can sometimes be used to find a for- 
mula for a sum as opposed to using induction to prove a formula 
for a sum (see Exercises 71-73). 


70. Suppose that Aa, = b,. Show that 
by + bo +++++ by = any — a1. 


This formula is analogous to the calculus formula 
[2 F) dx = g(d) — g(c), where Dg = f (D is the deriva- 
tive operator). In the calculus formula, sum is replaced by 
integral, and A is replaced by derivative. 


Problem-Solving Corner 


71. Let a, = n’, and compute Aa,. Use Exercise 70 to find a 


formula for 
14+2+3+4---+n. 
72. Use Exercise 70 to find a formula for 
1d!) + 2(2!) +--+ +n(n!). 


(Compare with Exercise 3.) 


73. Use Exercise 70 to find a formula for 
: + : pies : 
1-2. 2-3 n(n+1)° 


(Compare with Exercise 29.) 


74. Prove that if p and q are divisible by k, then p + g is divisible 


by k. 


Mathematical Induction 


Problem 


Define 


i 1 1 
A,p=14+24+2-4+-:-4- {| 
k ig + (1) 
for all k > 1. The numbers Hj, Ho, ... are called the 


harmonic numbers. Prove that 
n 
Hy > 1+ = (2) 
2 
for alln > 0. 


Attacking the Problem 


It’s often a good idea to begin attacking a problem by 
looking at some concrete examples of the expressions 
under consideration. Let’s look at H; for some small 
values of k. The smallest value of k for which H, is 
defined is k = 1. In this case, the last term 1/k in the 
definition of H; equals 1/1 = 1. Since the first and last 
terms coincide, H; = 1. For k = 2, the last term 1/k 
in the definition of H; equals 1/2, so 


1 
Similarly, we find that 
ee 
i 2 3) 
Hee eae 
2 4: 


We observe that H; appears as the first term of 
A, H3, and H4, that H> appears as the first two terms of 
13 and H4, and that H3 appears as the first three terms 
of H4. In general, H,, appears as the first m terms of 


H, ifm < k. This observation will help us later because 
the Inductive Step in a proof by induction must relate 
smaller instances of a problem to larger instances of 
the problem. 

In general, it’s a good strategy to delay combining 
terms and simplifying until as late as possible, which is 
why, for example, we left H4 as the sum of four terms 
rather than writing H4 = 25/12. Since we left H4 as the 
sum of four terms, we were able to see that each of 
H, Hp, and H; appears in the expression for Hy. 


Finding a Solution 


The Basis Step is to prove the given statement for the 
smallest value of n, which here is n= 0. For n = 0, in- 
equality (2) that we must prove becomes 


Hy = 14521. 


We have already observed that H; = 1. Thus inequal- 
ity (2) is true when n = 0; in fact, the inequality is an 
equality. (Recall that by definition, if x = y is true, then 
x > y is also true.) 

Let’s move to the Inductive Step. It’s a good idea 
to write down what is assumed (here the case n), 


Hy > 1+, (3) 
and what needs to be proved (here the case n + 1), 
1 
eae at (4) 


It’s also a good idea to write the formulas for any ex- 
pressions that occur. Using equation (1), we may write 
1 


1 
Jeb Slee Seale oe 5 
Bt ete, (5) 


and 


1 1 
Hons Si Gaaarer s 
It’s not so evident from the last equation that Ho ap- 
pears as the first 2” terms of H>n+1. Let’s rewrite the last 


equation as 


1 1 1 1 
Ayn —all = cS see 
2 +5 + on + aq + Qnt+l 
(6) 


to make it clear that H2» appears as the first 2” terms of 
Ans. 

For clarity, we have written the term that follows 
1/2”. Notice that the denominators increase by one, so 
the term that follows 1/2” is 1/(2”+1). Also notice that 
there is a big difference between 1/(2” + 1), the term 
that follows 1/2”, and 1/2”*!, the last term in equa- 
tion (6). 

Using equations (5) and (6), we may relate Hn to 
Hon+1 explicitly by writing 


1 
He et ae th eel (7) 
Combining (3) and (7), we obtain 
1 1 
Hes Slee ae +---4 (8) 


2 2+1 Qn+1” 
This inequality shows that H>.+: is greater than or 
equal to 
n 1 1 
21 ep tt peat 


but our goal (4) is to show that H>,+: is greater than or 
equal to 1 + (n+ 1)/2. We will achieve our goal if we 
show that 


n 1 1 n+1 

hs t ee 2 ils : 
Nl Berl 
In general, to prove an inequality, we replace terms 
in the larger expression with smaller terms so that the 
resulting expression equals the smaller expression; or 
we replace terms in the smaller expression with larger 
terms so that the resulting expression equals the larger 
expression. Here let’s replace each of the terms in the 
sum 


1 ae i! 
We AL, jl " gn+i 
by the smallest term 1/2”*! in the sum. We obtain 
1 1 1 
DE A jl ee atl = gett + °° pati” 


Since there are 2” terms in the latter sum, each equal 
to 1/2”*!, we may rewrite the preceding inequality as 
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1 1 Il 1 
ad + ere + Ft = art | oe all aH 
ical 1 
=2 gn+l = Du (9) 
Combining (8) and (9), 
ig . il n+1 
He sat ia 


We have the desired result, and the Inductive Step is 
complete. 


Formal Solution 


The formal solution could be written as follows. 


Basis Step (n = 0) 


0 
Jah SI & lS Map = 


2 = ) 
Inductive Step 
We assume (2). Now 
H. _ 1 i l Ih i 1 i l a 1 
ae a : 5) ' : gn : Panel gnt+l 
1 1 
= Ho see 
2 als Qn ak il als Qnt+l 
rr n 1 1 
+ 2° Qnt1 ae Qn+1 
n 1 
— 1 n 
) Qnt+l 
id 1 n+1 
= il =14 , 
2? D 


Summary of Problem-Solving Techniques 


= Look at concrete examples of the expressions un- 
der consideration, typically for small values of 
the variables. 


= Look for expressions for small values of n to ap- 
pear within expressions for larger values of n. In 
particular, the Inductive Step depends on relating 
case n to casen+ 1. 


= Delay combining terms and simplifying until as 
late as possible to help discover relationships 
among the expressions. 


= Write out in full the specific cases to prove, 
specifically, the smallest value of n for the Basis 
Step, the case n that is assumed in the Inductive 
Step, and the case n + | to prove in the Induc- 
tive Step. Write out the formulas for the various 
expressions that appear. 
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= To prove an inequality, replace terms in the larger 2. Prove that 
expression with smaller terms so that the re- [eid oboe Mefeh (EU NVEh ars 
sulting expression equals the smaller expres- 2 i i 


: : j > 
sion, or replace terms in the smaller expression for alln > 1. 


with larger terms so that the resulting expression 3. Prove that 
equals the larger expression. iG) SB an - ' 
n 
Comments for alln > 1. 
mineieenes 4. Prove that 
eae lo fil 4D 0 18h thoes 2hiplat. 
oy 3} ; n(n + 1) n(n + 1) 
which surfaces in calculus, is called the harmonic se- as 2 An+1 4 
ries. Inequality (2) shows that the harmonic numbers for alln > 1. 
pene heen bound. In calculus terminology, the BR eadien 
armonic series diverges. Hy fy Ay | He Aq 1 1 
Exercises Ten ys ag fe epee et 
1. Prove that Hy» < 1+ nforalln > 0. for alln > 1. 


2.5 Strong Form of Induction and 
the Well-Ordering Property 


In the Inductive Step of mathematical induction presented in Section 2.4, we assume 
that statement n is true, and then prove that statement n + | is true. In other words, to 
prove that a statement is true (statement n + 1), we assume the truth of its immediate 
predecessor (statement 7). In some cases in the Inductive Step, to prove a statement is 
true, it is helpful to assume the truth of all of the preceding statements (not just the 
immediate predecessor). The Strong Form of Mathematical Induction allows us to 
assume the truth of all of the preceding statements. Following the usual convention, the 
statement to prove is denoted n rather than n+ 1. We next formally state the Strong Form 
of Mathematical Induction. 


Strong Form of Suppose that we have a propositional function S(n) whose domain of discourse is the 
Mathematical Induction set of integers greater than or equal to ng. Suppose that 
S(no) is true; 


for alln > no, if S(k) is true for all k, nop < k <n, then S(n) is true. 


Then S(7) is true for every integer n > no. 


In the Inductive Step of the Strong Form of Mathematical Induction, we let n 
denote an arbitrary integer, n > no. Then, assuming that S(x) is true for all k satisfying 


no <k<n, (2.5.1) 


we prove that S(7) is true. In inequality (2.5.1), k indexes a statement S(k) that is an 
arbitrary predecessor of the statement S(7) (thus k < n), which we are to prove true. In 
inequality (2.5.1), m9 < k ensures that k is in the domain of discourse 


{n9, No + 1,9 + 2,...}. 


Example 2.5.1 


Example 2.5.2 
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The two forms of mathematical induction are logically equivalent (see Exercise 38). 
We present several examples that illustrate the use of the Strong Form of Mathe- 
matical Induction. 


Use mathematical induction to show that postage of 4 cents or more can be achieved by 
using only 2-cent and 5-cent stamps. 


SOLUTION Discussion Consider the Inductive Step, where we want to prove that 
n-cents postage can be achieved using only 2-cent and 5-cent stamps. It would be par- 
ticularly easy to prove this statement if we could assume that we can make postage of 
n — 2 cents. We could then simply add a 2-cent stamp to make n-cents postage. How 
simple! If we use the Strong Form of Mathematical Induction, we can assume the truth 
of the statement for all k <n. In particular, we can assume the truth of the statement for 
k =n-—2. Thus the Strong Form of Mathematical Induction allows us to give a correct 
proof based on our informal reasoning. 

In this example, no in inequality (2.5.1) is equal to 4. When we take k = n — 2, to 
ensure that mg < k, that is 4 < n— 2, we must have 6 < n. Nown = 4 is the Basis Step. 
What about n = 5? We explicitly prove this case. By convention, we add the case n = 5 
to the Basis Step; thus the cases n = 4 and n = 5 become the Basis Steps. In general, 
if the Inductive Step assumes that the case n — p is true (in this example, p = 2), there 
will be p Basis Steps: n = n9,n =n +1,...,n=n9+p-— 1. 


Basis Steps (n = 4, n= 5) 


We can make 4-cents postage by using two 2-cent stamps. We can make 5-cents postage 
by using one 5-cent stamp. The Basis Steps are verified. 


Inductive Step 


We assume that n > 6 and that postage of k cents or more can be achieved by using 
only 2-cent and 5-cent stamps for 4 < k <n. 

By the inductive assumption, we can make postage of n — 2 cents. We add a 2-cent 
stamp to make n-cents postage. The Inductive Step is complete. < 


When an element of a sequence is defined in terms of some of its predecessors, the 
Strong Form of Mathematical Induction is sometimes useful to prove a property of the 
sequence. For example, suppose that the sequence c,, cz, . . . is defined by the equations! 


cy = 0; Cn = Cinj2) +n foralln > 1. 


As examples, 
C2 = Cpa) +2 =e, +2 =e, +2=042=2, 
C3 =¢37) +3 =¢u5; +3 =e, +3=04+3 53, 
C4 = C42) +4 =cyp) +4 =02+4=24+4=6, 
C5 = C52) +5 = C25; +S = CQ +5 =2+5=7. 


Use strong induction to prove that c, < 2n, for alln > 1. 


SOLUTION Discussion In this example, no in inequality (2.5.1) is equal to 1 and 
(2.5.1) becomes 1 < k < n. In particular, since c, is defined in terms of ¢\,/2), in the 
Inductive Step we assume the truth of the statement for k = [n/2]. Inequality (2.5.1) 


iThe floor of x, |x], is the greatest integer less than or equal to x (see Section 3.1). Informally, we are “rounding 
down.” Examples: [2.3] = 2, [5] =5, |—2.7] = —3. 
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Example 2.5.3 


Example 2.5.4 


then becomes | < |[n/2| < n. Because n/2 < n, it follows that [n/2] <n. Ifn > 2, 
then 1 < n/2 and so 1 < |[n/2]. Therefore if n > 2 and k = [n/2], inequality (2.5.1) is 
satisfied. Thus the Basis Step isn = 1. 


Basis Step (n = 1) 
Since c; = 0 < 2 =2 - 1, the Basis Step is verified. 


Inductive Step 


We assume that c, < 2k, for all k, 1 < k <n, and prove that c, < 2n,n > 1. Since 
1 <n,2 <n. Thus 1 <n/2 <n. Therefore 1 < [n/2] < n and taking k = [n/2], we 
see that k satisfies inequality (2.5.1). By the inductive assumption 


Cin/2| = Ck < 2k = 2(|n/2]. 


Now 

Cn = CInj2) +n < 2|[n/2| +n < 2(n/2) +n = 2n. 
The Inductive Step is complete. < 
Define the sequence cj, cz, ... by the equations 


cy = 1, Cn = C|n/2| +n? foralln> 1. 


Suppose that we want to prove a statement for all n > 2 involving c,,. The Inductive Step 
will assume the truth of the statement involving cjn/2;. What are the Basis Steps? 


SOLUTION In this example, o in inequality (2.5.1) is equal to 2 and (2.5.1) becomes 
2 < k <n. In the Inductive Step, we assume the truth of the statement for k = |n/2]. 
Inequality (2.5.1) then becomes 2 < |n/2| < n. Because n/2 < n, it follows that 
[n/2| <n. 

Ifn = 3, then 2 > [n/2]|. Thus we must add n = 3 to the Basis Step (n = 2). If 
n> 4, then 2 < n/2 and so 2 < |n/2]. Therefore ifn > 4 and k = |n/2], inequality 
(2.5.1) is satisfied. Thus the Basis Steps are n = 2 andn = 3. < 


Suppose that we insert parentheses and then multiply the n numbers a,az---+d,. For 
example, if n = 4, we might insert the parentheses as shown: 


(a a2)(a3a4). (2.5.2) 


Here we would first multiply a; by az to obtain a;a2 and a3 by ag to obtain a3a4. We 
would then multiply aja2 by a3a4 to obtain (a\a2)(a3a4). Notice that the number of 
multiplications is three. Use strong induction to prove that if we insert parentheses in 
any manner whatsoever and then multiply the n numbers a,a2--- dn, we perform n — 1 
multiplications. 


SOLUTION 
Basis Step (n = 1) 


We need 0 multiplications to compute a;. The Basis Step is verified. 


Inductive Step 


We assume that for all k, 1 < k <n, it takes k— 1 multiplications to compute the product 
of k numbers if parentheses are inserted in any manner whatsoever. We must prove that 
it takes n multiplications to compute the product aja2 --- a, if parentheses are inserted 
in any manner whatsoever. 


Example 2.5.5 


Theorem 2.5.6 


2.5 ® Strong Form of Induction and the Well-Ordering Property 105 


Suppose that parentheses are inserted in the product a) dp - - - a,. Consider the final 
multiplication, which looks like (a, ---d;)(@i41-+++@n), for some t, 1 < t < n. [For 
example, in equation (2.5.2), tf = 2.] There are rf terms in the first set of parentheses, 
1 < t < n, and n — ¢ terms in the second set of parentheses, | < n — t < n. By the 
inductive assumption, it takes ¢ — 1 multiplications to compute a, ---a, andn —t— 1 
multiplications to compute a,+) --+d,, regardless of how the parentheses are inserted. 
It takes one additional multiplication to multiply a, --- a, by a;41--+d,. Thus the total 
number of multiplications is 


(t-—l)+(—-t—-l1)4+1l=n-1. 


The Inductive Step is complete. < 


Well-Ordering Property 


The Well-Ordering Property for nonnegative integers states that every nonempty set 
of nonnegative integers has a least element. This property is equivalent to the two forms 
of induction (see Exercises 36-38). We use the Well-Ordering Property to prove some- 
thing familiar from long division: When we divide an integer n by a positive integer d, 
we obtain a quotient g and a remainder r satisfying 0 < r < dso thatn = dq +r. 


When we divide n = 74 by d = 13 


5 
13)74 
65 

9 


we obtain the quotient g = 5 and the remainder r = 9. Notice that r satisfies 0 < r < d; 
that is, 0 < 9 < 13. We have 


n=74=13-54+9=dq+r. < 


Quotient-Remainder Theorem 


If d and n are integers, d > 0, there exist integers g (quotient) and r (remainder) 
satisfying 


i =Car ir Wsr<eol 
Furthermore, g and r are unique; that is, if 
n=dq+n Os7 <a 
and 
n=dq.+1ra OS ds 


then qi = q2 andr; = ro. 


Discussion We can devise a proof of Theorem 2.5.6 by looking carefully at the tech- 
nique used in long division. Why is 5 the quotient in Example 2.5.5? Because qg=5 
makes the remainder n — dg nonnegative and as small as possible. If, for example, g = 3, 
the remainder would be n— dq =74— 13 - 3=35, which is too large. As another exam- 
ple, if g=6, the remainder would be n— dq = 74 — 13 - 6= —4, which is negative. The 
existence of a smallest, nonnegative remainder n—dq is guaranteed by the Well-Ordering 
Property. 
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Proof Let 
X = {n—dk|n—dk>0, ke Z}. 


We show that X is nonempty using proof by cases. If n > 0, thenn —d-0 =n > 080 
nis in X. Suppose that n < 0. Since d is a positive integer, 1 — d < 0. Thus n — dn = 
n(1 — d) > 0. In this case, n — dn is in X. Therefore X is nonempty. 

Since X is a nonempty set of nonnegative integers, by the Well-Ordering Property, 
X has a smallest element, which we denote r. We let g denote the specific value of k for 
which r = n — dq. Thenn = dq +r. 

Since r is in X, r> 0. We use proof by contradiction to show that r < d. Suppose 
that r > d. Then 


n—d(qt+1)=n-dq-—d=r-—d=0. 


Thus n — d(q + 1) is in X. Also,n — d(q+1) =r-—d <r. But ris the smallest integer 
in X. This contradiction shows that r < d. 

We have shown that if d and n are integers, d > 0, there exist integers g and r 
satisfying 


n=dq+r O<r<d. 
We turn now to the uniqueness of g and r. Suppose that 

n=dqat+n 0O<rj<d 
and 

n=dqat+hn 0O<1nm <d. 
We must show that g; = qg2 and r; = rz. Subtracting the previous equations, we obtain 

0=n-—n= (dq +r) — (da +r) =d(qi — @) — (2-11), 
which can be rewritten 
d(qi-@) =r -"N. 


The preceding equation shows that d divides r) — r;. However, because 0 < r; < d and 
0<m <d, 


-—d<1rm—-r, <d. 


But the only integer strictly between —d and d divisible by d is 0. Therefore, 7) = ro. 
Thus, d(q, — gz) = 0; hence, g; = q2. The proof is complete. < 


Notice that in Theorem 2.5.6 the remainder r is zero if and only if d divides n. 


2.5 Problem-Solving Tips 


In the Inductive Step of the Strong Form of Mathematical Induction, your goal is to prove 
case n. To do so, you can assume all preceding cases (not just the immediately preceding 
case as in Section 2.4). You could always use the Strong Form of Mathematical Induc- 
tion. If it happens that you needed only the immediately preceding case in the Inductive 
Step, you merely used the form of mathematical induction of Section 2.4. However, 
assuming all previous cases potentially gives you more to work with in proving case n. 
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In the Inductive Step of the Strong Form of Mathematical Induction, when you 
assume that the statement S(k) is true, you must be sure that k is in the domain of dis- 
course of the propositional function S(n). In the terminology of this section, you must 
be sure that no < k (see Examples 2.5.1 and 2.5.2). 

In the Inductive Step of the Strong Form of Mathematical Induction, if you assume 
that case n—p is true, there will be p Basis Steps: n = nop,n =not+1,...,2 =no+p—l. 

In general, the key to devising a proof using the Strong Form of Mathematical 
Induction is to find smaller cases “within” case n. For example, the smaller cases in 
Example 2.5.4 are the parenthesized products (a ---a;) and (a41--+d,) forl <t <n. 


2.5 Review Exercises 


1. State the Strong Form of Mathematical Induction. 


2. State the Well-Ordering Property. 


3. State the Quotient-Remainder Theorem. 


1. Show that postage of 6 cents or more can be achieved by using 
only 2-cent and 7-cent stamps. 


2. Show that postage of 24 cents or more can be achieved by us- 
ing only 5-cent and 7-cent stamps. 


3. Show that postage of 12 cents or more can be achieved by us- 
ing only 3-cent and 7-cent stamps. 


x4. Use the 
If S(n) is true, then S(n + 1) is true 
form of the Inductive Step to prove the statement in Exam- 
ple 2.5.1. 
x5. Use the 
If S(n) is true, then S(n + 1) is true 


form of the Inductive Step to prove the statement in Exercise 1. 
x6. Use the 
If S(n) is true, then S(n + 1) is true 
form of the Inductive Step to prove the statement in Exercise 2. 
Exercises 7 and 8 refer to the sequence c\, C2, ... defined by the 
equations 


ce; = 0, Cn = C{n/2| +r foralln > 1. 

7. Suppose that we want to prove a statement for all n > 3 in- 
volving c,. The Inductive Step will assume the truth of the 
statement involving c{n/2|. What are the Basis Steps? 


8. Suppose that we want to prove a statement for all n > 4 in- 
volving c,. The Inductive Step will assume the truth of the 


statement involving cjn/2). What are the Basis Steps? 
9. Define the sequence c), c2,... by the equations 
cpHen = 0, Cn = Cin/3| +n foralln > 2. 


Suppose that we want to prove a statement for all n > 2 in- 
volving c,. The Inductive Step will assume the truth of the 
statement involving cjn/3;. What are the Basis Steps? 


Exercises 10 and 11 refer to the sequence c,, 2, ... defined by the 
equations 


aes Cn = C|n/2| +n foralln > 1. 


10. Compute c2, c3, c4, and cs. 


11. Prove that c, < 4n? for alln > 1. 


Exercises 12-14 refer to the sequence Cc, ¢2, 
equations 


... defined by the 


cy = 0, Cn = 4e{n/2) +n foralln > 1. 


12. Compute c2, c3, c4, and cs. 
13. Prove that c, < 4(n — 1)? foralln > 1. 


14. Prove that (n + 1)?/8 < cy, for all n > 2. Hint: |n/2| > 
(n — 1)/2 for all n. 


15. Define the sequence co, c1, ... by the equations 
co = 0, Cn = C\n/2} +3 forall n > 0. 


What is wrong with the following “proof” that c, < 2n for all 
n > 3? (You should verify that it is false that c, < 2n for all 
n> 3.) 

We use the Strong Form of Mathematical Induction. 


Basis Step (n = 3) 
We have 
c3 =cy +3 = (co +3) +3 =6<2:-3. 
The Basis Step is verified. 
Inductive Step 
Assume that cy < 2k for all k < n. Then 
Cn = C{n/2) +3 < 2[n/2|4+3 < 2(n/2)+3 = n+3 < ntn = 2n. 


(Since 3 < n,n +3 <n-+n.) The Inductive Step is complete. 
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16. Suppose that we have two piles of cards each containing 
n cards. Two players play a game as follows. Each player, in 
turn, chooses one pile and then removes any number of cards, 
but at least one, from the chosen pile. The player who removes 
the last card wins the game. Show that the second player can 
always win the game. 


In Exercises 17-22, find the quotient q and remainder r as in 
Theorem 2.5.6 when n is divided by d. 

WY 2=47,d=9 18. n=—47,d=9 

19. n=7,d=9 20. n=-7,d=9 

21. n=0,d=9 22. n=47,d=47 

The Egyptians of antiquity expressed a fraction as a sum of frac- 


tions whose numerators were 1. For example, 5/6 might be ex- 
pressed as 


We say that a fraction p/q, where p and q are positive integers, is 
in Egyptian form if 


(2.5.3) 


= T | ncaa ’ 


q ny n2 Nk 


where nj,n2,.. 
see <M. 


., Nk are positive integers satisfying ny < no < 


23. Show that the representation (2.5.3) need not be unique by rep- 
resenting 5/6 in two different ways. 


«24, 


25. By completing the following steps, give a proof by induction 
on p to show that every fraction p/q with 0 < p/q < 1 may be 
expressed in Egyptian form. 


(a) Verify the Basis Step (p = 1). 
(b) Suppose that 0 < p/g <1 and that all fractions i/g’, with 
1<i<p and q arbitrary, can be expressed in Egyptian 


form. Choose the smallest positive integer n with 1/n < 
p/q. Show that 


Show that the representation (2.5.3) is never unique. 


1 
and a 


n=l < : 
q n-l 


(c) Show that if p/g = 1/n, the proof is complete. 
(d) Assume that 1/n < p/q. Let 


Pi=np—q and qi = ng. 
Show that 
1 
Fla? ; ee and p, <p. 
1 q sn on 
Conclude that 
Pi 1. ae 1 
val no m2 Nk 


with m1, 12, ..., nx distinct. 


26. 


«27, 


«28. 


x29, 


x30. 


31. 


32. 


*33. 


34. 


35. 


«36. 


(e) Show that pj/qi < 1/n. 
(f) Show that 


qn n Nk 


and n,n, ..., nx are distinct. 


Use the method of the preceding exercise to find Egyptian 
forms of 3/8, 5/7, and 13/19. 


Show that any fraction p/q, where p and gq are positive inte- 
gers, can be written in Egyptian form. (We are not assuming 
that p/q < 1.) 


Show that any n x n deficient board can be tiled with tromi- 
noes if n is odd, n > 5, and 3 divides n2 — 1. Hint: Use the 
ideas suggested in the hint for Exercise 38, Section 2.4. 


Show that any n x n deficient board can be tiled with tromi- 
noes if n is even, n > 8, and 3 divides n2 — 1. Hint: Use the 
fact that a 4 x 4 deficient board can be tiled with trominoes, 
Exercise 28, and Exercise 36, Section 2.4. 


Show that any m x n deficient rectangle, 2 < m < n, can be 
tiled with trominoes if 3 divides mn — 1, neither side has length 
2 unless both of them do, and m 4 5. 


Give an example of an m x n rectangle with two squares miss- 
ing, where 3 divides mn — 2, that can be tiled with trominoes. 


Give an example of an m x n rectangle with two squares miss- 
ing, where 3 divides mn — 2, that cannot be tiled with tromi- 
noes. 


Which m x n rectangles with two squares missing, where 3 
divides mn — 2, can be tiled with trominoes? 


Give an alternative proof of the existence of g and r in 
Theorem 2.5.6 for the case n > 0 by first showing that 
the set X consisting of all integers k where dk > nisa 
nonempty set of nonnegative integers, then showing that X 
has a least element, and finally analyzing the least element 
of X. 


Give an alternative proof of the existence of qg and r in The- 
orem 2.5.6 using the form of mathematical induction where 
the Inductive Step is “if S(m) is true, then S(n + 1) is true.” 
Hint: First assume that n > 0. Treat the case n = 0 separately. 
Reduce the case n < 0 to the case n > 0. 


Assume the form of mathematical induction where the Induc- 
tive Step is “if S(m) is true, then S(m + 1) is true.” Prove the 
Well-Ordering Property. 


. Assume the Well-Ordering Property. Prove the Strong Form 


of Mathematical Induction. 


. Show that the Strong Form of Mathematical Induction and the 


form of mathematical induction where the Inductive Step is 
“if S(n) is true, then S(n + 1) is true” are equivalent. That 
is, assume the Strong Form of Mathematical Induction and 
prove the alternative form; then assume the alternative form 
and prove the Strong Form of Mathematical Induction. 
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Chapter 2 Notes 


[D’ Angelo; Solow] address the problem of how to construct proofs. Tiling with polyominoes 
is the subject of the book by [Martin]. 

The “Fallacies, Flaws, and Flimflam” section of The College Mathematics Journal, 
published by the Mathematical Association of America, contains examples of mathematical 
mistakes, fallacious proofs, and faulty reasoning. 
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Section 2.1 


COIAMERwWNE 


. Mathematical system 


Axiom 


. Definition 
. Undefined term 
. Theorem 


Proof 
Lemma 


. Direct proof 

. Even integer 

. Odd integer 

. Subproof 

. Disproving a universally quantified statement 
. Begging the question 

14. 


Circular reasoning 


Section 2.2 


. Proof by contradiction 

. Indirect proof 

. Proof by contrapositive 

. Proof by cases 

. Exhaustive proof 

. Proving an if-and-only-if statement 

. Proving several statements are equivalent 
. Existence proof 

. Constructive existence proof 

. Nonconstructive existence proof 


Section 2.3 


25. 


Resolution proof; uses: if p V g and —p V r are both true, 
then q V ris true. 


Chapter 2 Self-Test 


. Distinguish between the terms axiom and definition. 


. What is the difference between a direct proof and a proof 


by contradiction? 


. Show, by giving a proof by contradiction, that if four teams 


play seven games, some pair of teams plays at least two 
times. 


26. 


Clause: consists of terms separated by or’s, where each term 
is a variable or a negation of a variable. 


Section 2.4 


. Principle of Mathematical Induction 
. Basis Step: prove true for the first instance. 
. Inductive Step: assume true for instance n; then prove true 


for instance n + 1. 


. n factorial: n! = n(n —1)---1,0!=1 
. Formula for the sum of the first 1 positive integers: 


n(n + 1) 


Vaasa = 
ep Lap issn 5) 


. Formula for the geometric sum: 


a(r"*! — 1) 


ar +ar'+..-+ar= 
r—1 


7 r#l 


Section 2.5 


33. 
34, 


35. 


4. 


5: 


Strong Form of Mathematical Induction 

Basis Step for the Strong Form of Mathematical Induction: 
prove true for the first instance. 

Inductive Step for the Strong Form of Mathematical Induc- 
tion: assume true for all instances less than n; then prove 
true for instance n. 


. Well-Ordering Property: every nonempty set of nonnegative 


integers has a least element. 


- Quotient-Remainder Theorem: If d and n are integers, 


d>O0, there exist integers g (quotient) and r (remainder) 
satisfying n = dq+r,0 < r<d. Furthermore, g and r 
are unique. 


Prove that for all rational numbers x and y, y # 0, x/y is 
rational. 
Use proof by cases to prove that 


min{min{a, b}, c} = min{a, min{b, c}} 


for all real numbers a, b, and c. 
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Use mathematical induction to prove that the statements in 
Exercises 6—9 are true for every positive integer n. 


6. 


a 


10. 


11. 


12. 


13. 


2 2+44---4+2n=n(n+ 1) 
2n(n + 1)(2n + 1) 


P4+a4t...4+(2n)? = 5 
Vi 20 n 1 
21° 3! a+D! a+! 


tl ep 4(nt 12" 


Prove that for all integers m and n, if m and m — n are odd, 
then n is even. 


Prove that the following are equivalent for sets A and B: 
(gaACB 


(b)ANB=S (c)AUB=B 


Prove that for all sets X, Y, and Z, if X C Y and Y C Z, then 


XCZ. 


Find the quotient g and remainder r as in Theorem 2.5.6 
when n = 101 is divided by d = 11. 


Exercises 14 and 15 refer to the sequence c, C2, . 


.. defined by 


the equations 


14. 
15. 
16. 


17. 


18. 


19. 


20. 


c) = 0, Cn = 2C\nj2) +n foralln > 1. 


Compute 2, c3, c4, and cs. 
Prove that c, < nlgnforalln > 1. 


Use the Well-Ordering Property to show that any nonempty 
set X of nonnegative integers that has an upper bound con- 
tains a largest element. Hint: Consider the set of integer up- 
per bounds for X. 


Find an expression, which is the and of clauses, equivalent 
to(pvg)>r. 


Find an expression, which is the and of clauses, equivalent 
to (p V 7=q) > 71s. 


Use resolution to prove 

PVG 

aq V 7r 

pYv-r 

Ea g 
Reprove Exercise 19 using resolution and proof by 
contradiction. 
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1. 


Implement proof by resolution as a program. 


2. Write a program that gives an Egyptian form of a fraction. 


Functions 

Sequences and Strings 
Relations 

Equivalence Relations 
Matrices of Relations 
Relational Databases 
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Chapter 3 


FUNCTIONS, 
SEQUENCES, 
AND RELATIONS 


All of mathematics, as well as subjects that rely on mathematics, such as computer sci- 
ence and engineering, make use of functions, sequences, and relations. 

A function assigns to each member of a set X exactly one member of a set Y. 
Functions are used extensively in discrete mathematics; for example, functions are used 
to analyze the time needed to execute algorithms. 

A sequence is a special kind of function. A list of the letters as they appear in a 
word is an example of a sequence. Unlike a set, a sequence takes order into account. 
(Order is obviously important since, for example, form and from are different words.) 

Relations generalize the notion of functions. A relation is a set of ordered pairs. 
The presence of the ordered pair (a, b) ina relation is interpreted as indicating a relation- 
ship from a to b. The relational database model that helps users access information in a 
database (a collection of records manipulated by a computer) is based on the concept of 
relation. 


Functions 


Credit card numbers typically consist of 13, 15, or 16 digits. For example, 
4690 3582 1375 4657 (3.1.1) 


is a hypothetical credit card number. The first digit designates the system. In (3.1.1), 
the first digit, 4, shows that the card would be a Visa card. The following digits spec- 
ify other information such as the account number and the bank number. (The precise 
meaning depends on the type of card.) The last digit is special; it is computed from the 
preceding digits and is called a check digit. In (3.1.1), the check digit is 7 and is com- 
puted from the preceding digits 4690 3582 1375 465. Credit card check digits are used 
to identify certain erroneous card numbers. It is not a security measure, but rather it is 
used to help detect errors such as giving a credit card number over the phone and having 
it transcribed improperly or detecting an error in entering a credit card number while 
ordering a product online. 


i This section can be omitted without loss of continuity. 
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For more on 
functions, see 
goo.gl/V3y4pS 


The check digit is computed as follows. Starting from the right and skipping the 
check digit, double every other number. If the result of doubling is a two-digit num- 
ber, add the digits; otherwise, use the original digit. The other digits are not modified. 


469035 8213 7546 5 

Yl lb bbl lb v + YL YY Double every other digit. 

8 6 18 0 65 162 2 3 14 5 8 6 10 

Ylivbilbv vil bv +b + ¥) Add digits of two-digit numbers. 
8 69 065 7223 558 6 1 


Sum the resulting digits 


8+64+9+04+64+54+74+24+24+34+54+548+4+64+1=73. 


If the last digit of the sum is 0, the check digit is 0. Otherwise, subtract the last digit of 
the sum from 10 to get the check digit, 1O—3 = 7. Verify the check digit on your favorite 
Visa, MasterCard, American Express, or Diners Club card. This method of calculating 
a check digit is called the Luhn algorithm. It is named after Hans Peter Luhn (1896— 
1964), who invented it while at IBM. Although originally patented, it is now in the 
public domain and is widely used. 

One common error in copying a number is to change one digit. Each undoubled 
digit contributes a unique value to the sum (0 — 0, 1 — 1, etc.). Each doubled digit 
also contributes a unique value to the sum (0 > 0, 1 > 2,..., 4 > 8, 5 > 1, 
6 — 3,..., 9 — 9). Thus if a single digit is changed in a credit card number, the 
sum used in the Luhn algorithm will change by an absolute amount less than 10, and 
the check digit will change. In the preceding example if 1 is changed to 7, the Luhn 
algorithm calculation becomes 


469 035 8 273 7546 5 

Yi) LY + LY LY YY YL 4 Double every other digit 

8 6 18 0 6 5 16 2 14 3 14 5 8 6 10 

YY LLY tb LY LY bY YL 4) Add digits of two-digit numbers. 
8 69 065 72 5 3 558 6 1 


and the sum becomes 
8+6+9+04+6454+7424+5434+54+54+8+46+1=76. 


Therefore the check digit changes to 4. Thus, if 1 is inadvertently transcribed as 7, the 
error will be detected. 

Another common error is transposition of adjacent digits. For example, if 82 is 
inadvertently written as 28, the error will be detected by the Luhn algorithm because 
the check digit will change (check this). In fact, the Luhn algorithm will detect every 
transposition of adjacent digits except for 90 and 09 (see Computer Exercise 4). 

The Luhn algorithm gives an example of a function. A function assigns to each 
member of a set X exactly one member of a set Y. (The sets X and Y may or may not 
be the same.) The Luhn algorithm assigns to each integer 10 or greater (so there is a 
number available to compute a check digit) a single-digit integer, the check digit. In 
the preceding example, the integer 469035821375465 is assigned the value 7, and the 
integer 469035827375465 is assigned the value 4. We can represent these assignments 
as ordered pairs: 


(469035821375465,7) and (469035827375465, 4). 


Formally, we define a function to be a particular kind of set of ordered pairs. 


Example 3.1.2 


Example 3.1.3 


Figure 3.1.1 The arrow diagram 
of the function of Example 3.1.3. 
There is exactly one arrow from 
each element in X. 


Example 3.1.4 


Example 3.1.5 
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Definition 3.1.1 > Let X and Y be sets. A function f from X to Y is a subset of 
the Cartesian product X x Y having the property that for each x € X, there is exactly one 
y € Y with (x, y) € f. We sometimes denote a function f from X to Y as f: X > Y. 

The set X is called the domain of f and the set Y is called the codomain of f. 
The set 


fy | @y) €f} 


(which is a subset of the codomain Y) is called the range of f. < 


For the check digit function, the domain is the set of positive integers 10 or greater 
and the range is the set of single-digit integers. We can take the codomain to be any set 
containing the set of single-digit integers, for example, the set of nonnegative integers. <4 


The set f = {(1, a), (2, d), (3, a)} is a function from X = {1, 2,3} to Y = {a,b,c}. 
Each element of X is assigned a unique value in Y: 1 is assigned the unique value a; 2 is 
assigned the unique value b; and 3 is assigned the unique value a. We can depict the 
situation as shown in Figure 3.1.1, where an arrow from j to x means that we assign the 
letter x to the integer j. We call a picture such as Figure 3.1.1 an arrow diagram. For an 
arrow diagram to be a function, Definition 3.1.1 requires that there is exactly one arrow 
from each element in the domain. Notice that Figure 3.1.1 has this property. 

Definition 3.1.1 allows us to reuse elements in Y. For the function f, the element a 
in Y is used twice. Further, Definition 3.1.1 does not require us to use all the elements in 
Y. No element in X is assigned to the element c in Y. The domain of f is X, the codomain 
of f is Y, and the range of f is {a, b}. < 


The set 


{(1, a), (2, a), (3, b)} (3.1.2) 


is not a function from X = {1, 2, 3, 4} to Y = {a, b, c} because the element 4 in X is not 
assigned to an element in Y. It is also apparent from the arrow diagram (see Figure 3.1.2) 
that this set is not a function because there is no arrow from 4. The set (3.1.2) is a function 
from X’ = {1, 2, 3} to Y = {a, b, c}. 


oe 
xXx Bi 


Figure 3.1.2 The arrow diagram 

of the set in Example 3.1.4, which 

is not a function because there is 

no arrow from 4. 4 


The set {(1, a), (2, b), (3, c), C1, b)} is not a function from X = {1, 2,3} to Y = {a, b, c} 
because | is not assigned a unique element in Y (1 is assigned the values a and b). It is 
also apparent from the arrow diagram (see Figure 3.1.3) that this set is not a function 
because there are two arrows from 1. 
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Example 3.1.6 


Example 3.1.7 


Example 3.1.8 


Example 3.1.9 


.-™ 
} | 
x Y 


Figure 3.1.3 The arrow diagram 
of the set in Example 3.1.5, which 
is not a function because there are 
two arrows from 1. 4 


Given a function f from X to Y, according to Definition 3.1.1, for each element x 
in the domain X, there is exactly one y in the codomain Y with (x, y) € f. This unique 
value y is denoted f(x). In other words, f(x) = y is another way to write (x, y) € f. 


For the function f of Example 3.1.3, we may write f(1) = a, f(2) = b, and f(3) =a. 4 


If we call the check digit function L, we may write 


L(469035821375465) = 7 and L(469035827375465) = 4. 4 


The next example shows how we sometimes use the f(x) notation to define a 
function. 


Let f be the function defined by the rule f(x) = x”. For example, f(2) = 4, f(—3.5) = 
12.25, and f(0) = 0. Although we frequently find functions defined in this way, the 
definition is incomplete since the domain and codomain are not specified. If we are told 
that the domain is the set of all real numbers and the codomain is the set of all nonnegative 
real numbers, in ordered-pair notation, we would have 


FHzaIe x’) |x is areal number}. 


The range of f is the set of all nonnegative real numbers. < 


Most calculators have a 1 /x key. If you enter a number and hit the 1 /x key, the reciprocal 
of the number entered (or an approximation to it) is displayed. This function can be 
defined by the rule 


1 


The domain is the set of all numbers that can be entered into the calculator and whose 
reciprocals can be computed and displayed by the calculator. The range is the set of all 
the reciprocals that can be computed and displayed. We could define the codomain also 
to be the set of all the reciprocals that can be computed and displayed. Notice that by 
the nature of the calculator, the domain and range are finite sets. < 


Another way to visualize a function is to draw its graph. The graph of a function 
Ff whose domain and codomain are subsets of the real numbers is obtained by plotting 
points in the plane that correspond to the elements in f. The domain is contained in the 
horizontal axis and the codomain is contained in the vertical axis. 


Example 3.1.12 


Example 3.1.13 


Example 3.1.14 
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Example 3.1.10 The graph of the function f(x) = x? is shown in Figure 3.1.4. < 
y 
4 
fix) 
4 
(1, 3) 
(2,2) 
d, 1) 
(0, 0) —_ en 
t— +— ~Xx rt 
2-1 1 2 
Figure 3.1.4 The graph of Figure 3.1.5 A set that is not a 
f(x) =x. function. The vertical line x = | 


intersects two points in the set. 


We note that a set S of points in the plane defines a function precisely when each 
vertical line intersects at most one point of S. If some vertical line contains two or more 
points of some set, the domain point does not assign a unique codomain point and the 
set does not define a function (see Figure 3.1.5). 

Functions involving the modulus operator play an important role in mathematics 
and computer science. 


Definition 3.1.11 >  Ifxisaninteger and yisa positive integer, we define x mod 


y to be the remainder when x is divided by y. < 
We have 
6mod2=0, 5mod1=0, 8mod12=8, 199673 mod2=1. < 


The check digit calculated by the Luhn algorithm can be written 
[10 — (S mod 10)] mod 10, 


where S is the sum used in the intermediate step of the calculation. The last digit in S 
is given by S mod 10. It this digit is 1 through 9, inclusive, 10 — (S mod 10) gives the 
check digit and the last “mod 10” is unnecessary, but harmless. However, if the last digit 
in S is 0, 10 — (S mod 10) = 10. In this case, adding the last “mod 10” gives the check 
digit as 0. < 


What day of the week will it be 365 days from Wednesday? 


SOLUTION Seven days after Wednesday, it is Wednesday again; 14 days after Wednes- 
day, it is Wednesday again; and in general, if n is a positive integer, 7n days after Wednes- 
day, it is Wednesday again. Thus we need to subtract as many 7’s as possible from 365 
and see how many days are left, which is the same as computing 365 mod 7. Since 
365 mod 7 = 1, 365 days from Wednesday, it will be one day later, namely Thurs- 
day. This explains why, except for leap year, when an extra day is added to Febru- 
ary, the identical month and date in consecutive years move forward one day of the 
week. < 
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Example 3.1.15 


Go Online 
For more on hash 
functions, see 
goo.gl/V3y4pS 


Example 3.1.16 


Hash Functions Suppose that we have cells in a computer memory indexed from 0 to 
10 (see Figure 3.1.6). We wish to store and retrieve arbitrary nonnegative integers in 
these cells. One approach is to use a hash function. A hash function takes a data item 
to be stored or retrieved and computes the first choice for a location for the item. For 
example, for our problem, to store or retrieve the number n, we might take as the first 
choice for a location, n mod 11. Our hash function becomes h(n) = n mod 11. Figure 
3.1.6 shows the result of storing 15, 558, 32, 132, 102, and 5, in this order, in initially 
empty cells. 


132 102 15 5 257 558 32 


0 1 2 3 4 5 6 7 8 9 10 


Figure 3.1.6 Cells in a computer memory. 


Now suppose that we want to store 257. Since h(257) = 4, then 257 should be 
stored at location 4; however, this position is already occupied. In this case we say that 
a collision has occurred. More precisely, a collision occurs for a hash function H if 
H(x) = H(y), but x # y. To handle collisions, a collision resolution policy is required. 
One simple collision resolution policy is to find the next highest (with 0 assumed to 
follow 10) unoccupied cell. If we use this collision resolution policy, we would store 
257 at location 6 (see Figure 3.1.6). 

If we want to locate a stored value n, we compute m = h(n) and begin looking at 
location m. If n is not at this position, we look in the next-highest position (again, 0 is 
assumed to follow 10); if7 is not in this position, we proceed to the next-highest position, 
and so on. If we reach an empty cell or return to our original position, we conclude that 
n is not present; otherwise, we obtain the position of n. 

If collisions occur infrequently, and if when one does occur it is resolved quickly, 
then hashing provides a very fast method of storing and retrieving data. As an example, 
personnel data are frequently stored and retrieved by hashing on employee identification 
numbers. < 


Pseudorandom Numbers Computers are often used to simulate random behavior. A 
game program might simulate rolling dice, and a client service program might simu- 
late the arrival of customers at a bank. Such programs generate numbers that appear 
random and are called pseudorandom numbers. For example, the dice-rolling pro- 
gram would need pairs of pseudorandom numbers, each between | and 6, to simu- 
late the outcome of rolling dice. Pseudorandom numbers are not truly random; if one 
knows the program that generates the numbers, one could predict what numbers would 
occur. 

The method usually used to generate pseudorandom numbers is called the linear 
congruential method. This method requires four integers: the modulus m, the multiplier 
a, the increment c, and a seed s satisfying 2 <a <m, 0<c<m, and0O <s <m. We 
then set xy) = s. The sequence of pseudorandom numbers generated, x), x2,..., is given 
by the formula 


Xp = (AX,»—| +c) mod m. 


The formula computes the next pseudorandom number using its immediate predecessor. 
For example, ifm = 11, a=7, c=5, ands = 3, then 


xX, = (axp +c) mod m= (7 -3 +5) mod 11 =4 


Example 3.1.18 


Example 3.1.19 
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and 
X2 = (ax; +c) mod m= (7 -4+5) mod 11 = 0. 


Similar computations show that the sequence continues: 


x3 = 5, x4 = 7, X5 = 10, % = 9, 7 = 2, xg = 8, XH = 6, X19 = 3. 


Since x;9 = 3, which is the value of the seed, the sequence now repeats: 3, 4, 0, 5, 
Teta 

Much effort has been invested in finding good values for a linear congruential 
method. Critical simulations such as those involving aircraft and nuclear research require 
“good” random numbers. In practice, large values are used for m and a. Commonly used 
values are m = 23! — 1 = 2,147,483,647; a = 7° = 16,807; and c = 0, which generate 
a sequence of 2°! — | integers before repeating a value. < 


In the 1990s, Daniel Corriveau of Quebec won three straight games of a com- 
puter keno game in Montreal, each time choosing 19 of 20 numbers correctly. The odds 
against this feat are 6 billion to 1. Suspicious officials at first refused to pay him. Although 
Corriveau attributed his success to chaos theory, what in fact happened was that when- 
ever power was cut, the random number generator started with the same seed, thus gen- 
erating the same sequence of numbers. The embarrassed casino finally paid Corriveau 
the $600,000 due him. 

We next define the floor and ceiling of a real number. 


Definition 3.1.17 >» The floor of x, denoted Lx}, is the greatest integer less than 
or equal to x. The ceiling of x, denoted [x], is the least integer greater than or equal 
tox. < 


[8.3] =8, [9.1] = 10, [-8.7] =—9, [-11.3] =—11, [6] =6,[-8]=-8 < 


The floor of x “rounds x down” while the ceiling of x “rounds x up.” We will use 
the floor and ceiling functions throughout the book. 


Figure 3.1.7 shows the graphs of the floor and ceiling functions. A bracket, [ or ], indi- 
cates that the point is to be included in the graph; a parenthesis, ( or ), indicates that the 
point is to be excluded from the graph. 


4 A 
- i i+ 
i = 
+} ++ t—{—}—_}— ++ 
-2 -1 1 2 3 -2 -1 i. 2 3 
[7-1 yj. al 
——-P 72 772 


Figure 3.1.7 The graphs of the floor (left graph) and ceiling (right graph) 
functions. 


Example 3.1.20 The first-class postage rate for mail up to 13 ounces is 92 cents for the first ounce or 


fraction thereof and 20 cents for each additional ounce or fraction thereof. The postage 
P(w) as a function of weight w is given by the equation 
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P(w) 
A 

332 + Ce] 

132 + (J 

112+ €-4 

92 
— }—> w 
12 3 13 


Figure 3.1.8 The graph of the 
postage function P(w) = 92+ 


20fw — 11. 


Example 3.1.21 


Example 3.1.23 


Example 3.1.24 


Example 3.1.25 


Pw) = 92 + 20[w — 1] 13>w>0. 


The expression [w— 1] counts the number of additional ounces beyond 1, with a fraction 
counting as one additional ounce. As examples, 


P(3.7) = 92 + 20[3.7 — 1] = 92 + 20[2.7] = 92 +20 - 3 = 152, 
P(2) = 92 + 20[2 — 1] = 92 + 20/1] = 92+. 20-1=112. 


The graph of the function P is shown in Figure 3.1.8. < 


The Quotient-Remainder Theorem (Theorem 2.5.6) states that if d and n are inte- 
gers, d > 0, there exist integers q (quotient) and r (remainder) satisfying 


n=dq+r O<r<d. 


Dividing by d, we obtain 


Since 0 < r/d < 1, 


l= [+d]-« 


Thus, we may compute the quotient g as |n/d|. Having computed the quotient g, we may 
compute the remainder as r = n — dq. We previously introduced the notation n mod d 
for the remainder. 


We have 36844/2427 = 15.18088...; thus the quotient is g = |36844/2427| = 15. 
Therefore, the remainder 36844 mod 2427 is r = 36844 — 2427 - 15 = 439. We have 
n= dq +r or 36844 = 2427 - 15 + 439. < 


Definition 3.1.22 >» A function f from X to Y is said to be one-to-one (or in- 
jective) if for all x1, x2 € X, if f(x,) =f (x2) then x, = xo. < 


An equivalent way to state Definition 3.1.22 is: If y is an element of the range of 
f, then there is exactly one x in the domain of f such that f(x) = y. If there were two 
distinct elements x; and x2 of the domain of f with f(x1) = y = f(a), then we would 
have f(x) = f(x2) but x, + x.—a counterexample to the claim that f is one-to-one. 

Because the amount of potential data is usually so much larger than the available 
memory, hash functions are usually not one-to-one (see Example 3.1.15). In other words, 
most hash functions produce collisions. 


The function f = {(1, b), (3, a), (2, c)} from X = {1, 2, 3} to Y = {a, b,c, d} is one-to- 
one. < 


The function f = {(1, a), (2, b), (3, a)} is not one-to-one since f(1) = a = f(3). < 


If X is the set of persons who have social security numbers and we assign each person 
x € X his or her social security number SS(x), we obtain a one-to-one function since 
distinct persons are always assigned distinct social security numbers. It is because this 
correspondence is one-to-one that the government uses social security numbers as iden- 
tifiers. < 


Example 3.1.26 


Example 3.1.27 


Example 3.1.28 
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Ifa function from X to Y is one-to-one, each element in Y in its arrow diagram will have 
at most one arrow pointing to it (see Figure 3.1.9). If a function is not one-to-one, some 
element in Y in its arrow diagram will have two or more arrows pointing to it (see Figure 
3.1.10). 


P et 
a 
f f 
xX Y xX Y 
Figure 3.1.9 The function of Figure 3.1.10 A function that is 
Example 3.1.23. This function is not one-to-one. This function is not 
one-to-one because each element in Y one-to-one because a has two arrows 
has at most one arrow pointing to it. pointing to it. This function is not 
This function is not onto Y because onto Y because there is no arrow 
there is no arrow pointing to d. pointing to c. 4 


Prove that the function f(n) = 2n + 1 from the set of positive integers to the set of 
positive integers is one-to-one. 


SOLUTION We must show that for all positive integers ny and np, if f(n,) = f(n2), 
then ny = m. So, suppose that f(m;) = f(n2). Using the definition of f, this latter 
equation translates as 2ny + 1 = 2n2 + 1. Subtracting | from both sides of the equa- 
tion and then dividing both sides of the equation by 2 yields n; = nz. Therefore, f is 
one-to-one. < 


Prove that the function f(n) = 2” — n? from the set of positive integers to the set of 
integers is not one-to-one. 


SOLUTION We must find positive integers ny and m2, nj 4 no, such that f(m,)) = 
J (n2). By checking the graph (see Figure 3.1.11) or otherwise, we find that f(2) = f(4). 
Therefore, f is not one-to-one. 


fm) 
4 


T+ ° 
6+ 
54 
44 
34 
a+ 
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0~—_+_+—_ +++ 
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Figure 3.1.11 The graph of 
fw) =2"- nr. 4 
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Example 3.1.30 
Example 3.1.31 


Example 3.1.32 


Example 3.1.33 


If the range of a function f is equal to its codomain Y, the function is said to be 
onto Y. 


Definition 3.1.29 >» A function from X to Y is said to be onto Y (or surjective) 
if for every y € Y, there exists x € X such that f(x) = y. < 


The function f = {(1, a), (2, c), (3, b)} from X = {1, 2, 3} to Y = {a, b, c} is one-to-one 
and onto Y. < 


The function f = {(1, d), (3,4), (2, c)} from X = {1, 2,3} to Y = {a, b,c, d} is not 
onto Y. < 


Ifa function from X to Y is onto, each element in Y in its arrow diagram will have at least 
one arrow pointing to it (see Figure 3.1.12). If a function from X to Y is not onto, some 
element in Y in its arrow diagram will fail to have an arrow pointing to it (see Figures 
3.1.9 and 3.1.10). 


be 


f 
x Y 


Figure 3.1.12 The function of 

Example 3.1.30. This function is 

one-to-one because each element 

in Y has at most one arrow. This 

function is onto because each 

element in Y has at least one 

arrow pointing to it. 4 


Prove that the function 
fa) = 
Y x2 
from the set X of nonzero real numbers to the set Y of positive real numbers is onto Y. 


SOLUTION We must show that for every y € Y, there exists x € X such that f(x) = y. 
Substituting the formula for f(x), this last equation becomes 


Solving for x, we find 


Notice that 1/,/y is defined because y is a positive real number. If we take x to be the 
positive square root 


then x € X. (We could just as well have taken x = —1/,/y.) Thus, for every y € Y, there 
exists x, namely, x = 1/,/y such that 


Example 3.1.34 


Example 3.1.36 


Example 3.1.37 


Example 3.1.38 


Example 3.1.39 


Example 3.1.40 


3.1 @ Functions 121 


f@) =fA/Vy) = =). 


1 
(1/./y)" 
Therefore, f is onto Y. < 

A function f from X to Y is not onto Y iffor some y € Y, forevery x € X, f(x) # y. 


In other words, y is a counterexample to the claim that for every y € Y, there exists x € X 
such that f(x) = y. 


Prove that the function f(n) = 2n — 1 from the set X of positive integers to the set Y of 
positive integers is not onto Y. 


SOLUTION We must find an element m € Y such that for alln € X,f(n) A m. 
Since f(n) is an odd integer for all n, we may choose for y any positive, even integer, for 
example, y = 2. Then y € Y and f(n) # y for all n € X. Thus f is not onto Y. < 


Definition 3.1.35 » A function that is both one-to-one and onto is called a 


bijection. < 
The function f of Example 3.1.30 is a bijection. < 
If f is a bijection from a finite set X to a finite set Y, then |X| = |Y], that is, the sets 


have the same cardinality and are the same size. For example, f = {(1, a), (2, b), (3, c), 
(4, d)} is a bijection from X = {1,2,3,4} to Y = {a,b,c,d)}. Both sets have four 
elements. In effect, f counts the elements in Y: f(1) = ais the first element in Y; f(2) = b 
is the second element in Y; and so on. < 


Suppose that f is a one-to-one, onto function from X to Y. It can be shown (see 
Exercise 116) that {(y, x) | (x, y) € f} is a one-to-one, onto function from Y to X. This 
new function, denoted f—', is called f inverse. 


For the function f = {(1, a), (2, c), (3, b)}, we have f—! = {(a, 1), (c, 2), (B, 3)}. < 


Given the arrow diagram for a one-to-one, onto function f from X to Y, we can obtain the 
arrow diagram for f—! simply by reversing the direction of each arrow (see Figure 3.1.13, 
which is the arrow diagram for f~!, where f is the function of Figure 3.1.12). 


f = 
Y xX 
Figure 3.1.13 The inverse of the 
function in Figure 3.1.12. The 
inverse is obtained by reversing all 
of the arrows in Figure 3.1.12. 


< 


The function f(x) = 2* is a one-to-one function from the set R of all real numbers onto 
the set R* of all positive real numbers. Derive a formula for f—!(y). 
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Example 3.1.42 


Example 3.1.43 


Example 3.1.44 


Example 3.1.45 


SOLUTION Suppose that (y, x) is in f~!; that is, 

f"0) =x. (3.1.3) 
Then (x, y) € f. Thus, y = 2*. By the definition of logarithm, 

log, y= x. (3.1.4) 


Combining (3.1.3) and (3.1.4), we have f~!(y) = x = log, y. That is, for each y € Rt, 
f(y) is the logarithm to the base 2 of y. We can summarize the situation by saying that 
the inverse of the exponential function is the logarithm function. < 


Let g be a function from X to Y and let f be a function from Y to Z. Given x € X, 
we may apply g to determine a unique element y = g(x) € Y. We may then apply f to 
determine a unique element z = f(y) = f(g(x)) € Z. This compound action is called 
composition. 


Definition 3.1.41 > Let g be a function from X to Y and let f be a function 
from Y to Z. The composition of f with g, denoted f 0 g, is the function 


(fo g)(x) =f(g(x)) 


from X to Z. < 


Given g = {(1, a), (2, a), (3, c)}, a function from X = {1, 2,3} to Y = {a, b, c}, and 
f = {(a, y), (b, x), (c, z)}, a function from Y to Z = {x, y, z}, the composition function 
from X to Z is the function f o g = {(1, y), (2, y), (3, z)}. < 


Given the arrow diagram for a function g from X to Y and the arrow diagram for a 
function f from Y to Z, we can obtain the arrow diagram for the composition f o g 
simply by “following the arrows” (see Figure 3.1.14). 


| Nite 

ret rt 
y Z 

§ 


xX x Z 


f fog 


Figure 3.1.14 The composition of the functions of 

Example 3.1.42. The composition is obtained by 

drawing an arrow from x in X to z in Z provided that 

there are arrows from x to some y in Y and from y to z. 4 


If f(x) = logs x and g(x) = x*, then f(g(x)) = log,(x), and g(f(x)) = (log;x). 


A store offers 15% off the price of certain items. A coupon is also available that offers 
$20 off the price of the same items. The store will honor both discounts. The function 
D(p) = 0.85p gives the cost with 15% off the price p. The function C(p) = p— 20 gives 
the cost using the $20 coupon. The composition 


(Do C)(p) = 0.85(p — 20) = 0.85p — 17 


gives the cost using first the coupon and then the 15% discount. The composition 
(CoD)(p) = 0.85p— 20 gives the cost using first the 15% discount and then the coupon. 


Example 3.1.46 


Example 3.1.48 


Example 3.1.49 


Example 3.1.51 


Example 3.1.52 
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We see that regardless of the price of an item, it is always cheapest to use the discount 
first. < 


Composition sometimes allows us to decompose complicated functions into simpler 
functions. For example, the function f(x) = sin 2x can be decomposed into the func- 
tions 

g(x) = wee h(x) =sinx, w(x) = 2x. 


We can then write f(x) = g(h(w(x))). This decomposition technique is important in 
differential calculus since there are rules for differentiating simple functions such as g, h, 
and w and also rules about how to differentiate the composition of functions. Combining 
these rules, we can differentiate more complicated functions. < 


A binary operator on a set X associates with each ordered pair of elements in X 
one element in X. 


Definition 3.1.47 >» A function from X x X to X is called a binary operator 
on X. < 


Let X = {1,2,...}. If we define f(x, y) = x + y, where x,y € X, then f is a binary 
operator on X. < 


If X is a set of propositions, \, V, +, and < are binary operators on X. < 


A unary operator on a set X associates with each single element of X one element 
in X. 


Definition 3.1.50 >» A function from X to X is called a unary operator on X. @ 


Let U be a universal set. If we define f(X) = X, where X € P(U), then f is a unary 
operator on P(U). < 


If X is a set of propositions, — is a unary operator on X. < 


3.1 Problem-Solving Tips 


The key to solving problems involving functions is clearly understanding the definition 
of function. A function f from X to Y can be thought of in many ways. Formally, f is a 
subset of X x Y having the property that for every x € X, there is a unique y € Y such 
that (x, y) € X x Y. Informally, f can be thought of as a mapping of elements from X to 
Y. The arrow diagram emphasizes this view of a function. For an arrow diagram to be a 
function, there must be exactly one arrow from each element in X to some element in Y. 

A function is a very general concept. Any subset of X x Y having the property that 
for every x € X, there is a unique y € Y such that (x, y) € X x Y is a function. A function 
may be defined by listing its members; for example, {(a, 1), (b, 3), (c, 2), (d, 1)} is a 
function from {a, b, c, d} to {1, 2, 3}. Here, there is apparently no formula for member- 
ship; the definition just tells us which pairs make up the function. 

On the other hand, a function may be defined by a formula. For example, 


{(n,n + 2) | nis a positive integer} 


defines a function from the set of positive integers to the set of positive integers. The 
“formula” for the mapping is “add 2.” 
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The f(x) notation may be used to indicate which element in the codomain is as- 
sociated with an element x in the domain or to define a function. For example, for the 
functionf = {(a, 1), (b, 3), (c, 2), (d, 1)}, we could write f(a) = 1, f(b) = 3, and so on. 
Assuming that the domain of definition is the positive integers, the equation g(n) = n+2 
defines the function {(n, n + 2) | n is a positive integer} from the set of positive integers 
to the set of positive integers. 

To prove that a function f from X to Y is one-to-one, show that for all x1, x2 € X, 
if f(x1) = f(x), then x; = x. 

To prove that a function f from X to Y is not one-to-one, find x1, x. € X,x1 4 xX, 


such that f(x,) = f(x). 


To prove that a function f from X to Y is onto, show that for all y € Y, there exists 


x € X such that f(x) = y. 


To prove that a function f from X to Y is not onto, find y € Y such that f(x) # y 


for allx € X. 


3.1 Review Exercises 


CS mr ntn wm 


10. 


11. 
12. 
13. 


. What is a function from X to Y? 
. Explain how to use an arrow diagram to depict a function. 
. What is the graph of a function? 


. Given a set of points in the plane, how can we tell whether it 
is a function? 


. What is the value of x mod y? 

. What is a hash function? 

. What is a collision for a hash function? 
. What is a collision resolution policy? 

. What are pseudorandom numbers? 


Explain how a linear congruential random number generator 
works, and give an example of a linear congruential random 
number generator. 


What is the floor of x? How is the floor denoted? 
What is the ceiling of x? How is the ceiling denoted? 


Define one-to-one function. Give an example of a one-to-one 
function. Explain how to use an arrow diagram to determine 
whether a function is one-to-one. 
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15. 


16. 


17. 


18. 


19. 


. Define onto function. Give an example of an onto function. 
Explain how to use an arrow diagram to determine whether a 
function is onto. 


What is a bijection? Give an example of a bijection. Explain 
how to use an arrow diagram to determine whether a function 
is a bijection. 


Define inverse function. Give an example of a function and its 
inverse. Given the arrow diagram of a function, how can we 
find the arrow diagram of the inverse function? 


Define composition of functions. How is the composition of f 
and g denoted? Give an example of functions f and g and their 
composition. Given the arrow diagrams of two functions, how 
can we find the arrow diagram of the composition of the func- 
tions? 


What is a binary operator? Give an example of a binary 
operator. 


What is a unary operator? Give an example of a unary 
operator. 


In 


Exercises 1-6, determine which credit card numbers have cor- 


rect check digits. 


1. 5366-2806-9965-4138 
2. 5194-1132-8860-3905 
3. 4004-6067-3429-0019 
4. 3419-6888-7169-444 


5. 3016-4773-7532-21 
6. 4629-952 1-3698-0203 


7. Show that when 82 in the valid credit card number 4690- 
3582-1375-4657 is transposed to 28, the check digit changes. 


Determine whether each set in Exercises 8-12 is a function from 
X = {1,2,3,4} to Y = {a, b,c, d}. If it is a function, find its do- 


main and range, draw its arrow diagram, and determine if it is 
one-to-one, onto, or both. If it is both one-to-one and onto, give 
the description of the inverse function as a set of ordered pairs, 
draw its arrow diagram, and give the domain and range of the in- 
verse function. 


8. {C, a), (2, a), (3, ¢), (4, 5)} 
9. {(,c), (2, a), (3, b), (4, ¢), (2, a} 
10. {(1,c), (2, d), (3, a), (4, b)} 
iL. {(1, d), (2, d), (4, a)} 
12. {(1, b), (2, b), (3, b), (4, d)} 


Draw the graphs of the functions in Exercises 13-16. The domain 
of each function is the set of real numbers. The codomain of each 
function is also the set of real numbers. 


13. f@) = [21 - La] 
15. f(x) = [27] 


14. fx) =x- Lr] 
16. f(x) = |x? — x] 


Determine whether each function in Exercises 17-22 is one-to-one, 
onto, or both. Prove your answers. The domain of each function is 
the set of all integers. The codomain of each function is also the set 
of all integers. 


17. fn) =n+1 


19. f(n) = [n/2] 
21. f(n) = 2n 


18. f(n) =n? -1 
20. f(n) = Inl 
22. f(n) =n? 


Determine whether each function in Exercises 23-28 is one-to-one, 
onto, or both. Prove your answers. The domain of each function is 
Z x Z. The codomain of each function is Z. 


23. f(m,n) =m—n 
25. f(m,n) = mn 
27. f(m,n) =n? +1 


24. f(m,n) =m 
26. f(m,n) = m? +n? 
28. f(m,n) =m+n+2 


29. Prove that the function f from Z* x Z* to Z* defined by 
f(m, n) = 2'"3” is one-to-one but not onto. 


Determine whether each function in Exercises 30-35 is one-to-one, 
onto, or both. Prove your answers. The domain of each function is 
the set of all real numbers. The codomain of each function is also 
the set of all real numbers. 


30. f(x) = 6x—9 31. f(x) = 3x? —3x+1 
32. f(x) = sinx 33. f(x) =2° -4 


34, f(x) = 3" 2 38. (@) = > 


36. Give an example of a function different from those presented 
in the text that is one-to-one but not onto, and prove that your 
function has the required properties. 


37. Give an example of a function different from those presented 
in the text that is onto but not one-to-one, and prove that your 
function has the required properties. 
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38. Give an example of a function different from those presented 
in the text that is neither one-to-one nor onto, and prove that 
your function has the required properties. 


39. Write the definition of “one-to-one” using logical notation 
(i.e., use V, J, etc.). 


40. Use De Morgan’s laws of logic to negate the definition of 
“one-to-one.” 


41. Write the definition of “onto” using logical notation (i.e., use 
V, di ete.). 


42. Use De Morgan’s laws of logic to negate the definition of 
“onto.” 


Each function in Exercises 43-48 is one-to-one on the specified do- 
main X. By letting Y = range of f, we obtain a bijection from X to 
Y. Find each inverse function. 


43. f(x) = 4x4 2, X = set of real numbers 
44, f(x) = 3°, X = set of real numbers 
45. f(x) = 3 log, x, X = set of positive real numbers 


1 
46. f(x) = 3+ -, X = set of nonzero real numbers 
¥ 


47. fx) = 4x3 — 5, X = set of real numbers 
48. f(x) =6+ 27-1 X = set of real numbers 
49. Given 


g ={(1,d), 2,0), 3, a}, 
a function from X = {1, 2,3} to Y = {a, b, c, d}, and 


f = {(a,x), (b, x), (c, 2), (d, w)}, 


a function from Y to Z = {w, x, y, z}, write f o g as a set of 
ordered pairs and draw the arrow diagram of f o g. 


50. Let and g be functions from the positive integers to the pos- 
itive integers defined by the equations 


f(n) =2n+1, g(n) = 3n—1. 


Find the compositions f of, go g,fog,and gof. 


51. Letf and g be functions from the positive integers to the pos- 
itive integers defined by the equations 


fm= r, 
Find the compositions f of, go g,fog,and gof. 


g(n) = 2". 


52. Let f and g be functions from the nonnegative real numbers 
to the nonnegative real numbers defined by the equations 


fo) =[2xJ, ga) =x. 


Find the compositions f of, go g,fog,and gof. 

53. A store offers a (fixed, nonzero) percentage off the price 
of certain items. A coupon is also available that offers a 
(fixed, nonzero) amount off the price of the same items. 
The store will honor both discounts. Show that regardless 
of the price of an item, the percentage off the price, and 


amount off the price, it is always cheapest to use the coupon 
first. 
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In Exercises 54-59, decompose the function into simpler functions 
as in Example 3.1.46. 


54. 
56. 
58. 
60. 


61. 


62. 


63. 


64. 


65. 


: 1 
f(x) = log, (x? + 2) 55. f(x) = 55 


f(x) = sin 2x 57. f(x) = 2sinx 
Cy) — +4 = 
f(x) = B+ sinx) 59. f(x) = ore? 
Given 
f ={@,2) |x X), 
a function from X = {—5, —4, ..., 4, 5} to the set of integers, 


write f as a set of ordered pairs and draw the arrow diagram 
of f. Is f one-to-one or onto? 


How many functions are there from {1,2} to {a, b}?) Which 
are one-to-one? Which are onto? 


Given 
f = {(a, b), (b, a), (c, b)}, 


a function from X = {a, b, c} to X: 
(a) Write fof and f of of as sets of ordered pairs. 
(b) Define 


fl =fofo--of 


to be the n-fold composition of f with itself. Write f? and 
f°” as sets of ordered pairs. 


Let f be the function from X = {0, 1, 2, 3, 4} to X defined by 
f(x) = 4x mod 5. 


Write f as a set of ordered pairs and draw the arrow diagram 
of f. Is f one-to-one? Is f onto? 

Let f be the function from X = {0, 1, 2, 3, 4, 5} to X defined 
by 


f(x) = 4x mod 6. 


Write f as a set of ordered pairs and draw the arrow diagram 
of f. Is f one-to-one? Is f onto? 
An International Standard Book Number (ISBN) is a 
code of 13 characters separated by dashes, such as 
978-1-59448-950-1. An ISBN consists of five parts: a prod- 
uct code, a group code, a publisher code, a code that uniquely 
identifies the book among those published by the particu- 
lar publisher, and a check digit. For 978-1-59448-950-1, the 
product code 978 identifies the product as a book (the same 
scheme is used for other products). The group code is 1, 
which identifies the book as one from an English-speaking 
country. The publisher code 59448 identifies the book as one 
published by Riverhead Books, Penguin Group. The code 
950 uniquely identifies the book among those published by 
Riverhead Books, Penguin Group (Hosseini: A Thousand 
Splendid Suns, in this case). The check digit is 1. 

Let S equal the sum of the first digit, plus three times 
the second digit, plus the third digit, plus three times the 
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fourth digit, ... 
digit is equal to 


, plus three times the twelfth digit. The check 


[10 — (S mod 10)] mod 10. 


Universal Product Codes (UPC), the bar codes that are 
scanned at the grocery store for example, use a similar 
method to compute the check digit. 

Verify the check digit for this book. 


In Exercises 66—71, determine which ISBNs (see Exercise 65) have 
correct check digits. 


. 978-1-61374-376-9 
- 978-0-8108-8139-2 
- 978-0-939460-9 1-5 
» 978-0-8174-3593-6 
. 978-1-4354-6028-7 
. 978-0-684-87018-0 
. Show that if a single digit of an ISBN is changed, the 


check digit will change. Thus, any single-digit error will be 
detected. 


For each hash function in Exercises 73-76, show how the data 
would be inserted in the order given in initially empty cells. Use 
the collision resolution policy of Example 3.1.15. 


73. 


74. 


75. 
76. 
77. 


78. 


79. 


h(x) = x mod 11; cells indexed 0 to 10; data: 53, 13, 281, 
743, 377, 20, 10, 796 


h(x) = x mod 17; cells indexed 0 to 16; data: 714, 631, 26, 
373, 775, 906, 509, 2032, 42, 4, 136, 1028 

h(x) = x2 mod 11; cells and data as in Exercise 73 

h(x) = (x2 + x) mod 17; cells and data as in Exercise 74 


Suppose that we store and retrieve data as described in Exam- 
ple 3.1.15. Will any problem arise if we delete data? Explain. 


Suppose that we store data as described in Example 3.1.15 
and that we never store more than 10 items. Will any prob- 
lem arise when retrieving data if we stop searching when we 
encounter an empty cell? Explain. 


Suppose that we store data as described in Example 3.1.15 
and retrieve data as described in Exercise 78. Will any prob- 
lem arise if we delete data? Explain. 


Let g be a function from X to Y and let f be a function from Y to 
Z. For each statement in Exercises 80-87, if the statement is true, 
prove it; otherwise, give a counterexample. 


80. 
81. 
82. 
83. 
84. 


85. 
86. 
87. 


If g is one-to-one, then f o g is one-to-one. 
If f is onto, then f o g is onto. 

If g is onto, then f 0 g is onto. 

If f and g are onto, then f o g is onto. 


If f and g are one-to-one and onto, then f o g is one-to-one 
and onto. 


If f o g is one-to-one, then f is one-to-one. 
If f o g is one-to-one, then g is one-to-one. 


If f o g is onto, then f is onto. 


If f is a function from X to Y and A C X and B C Y, we define 
fA) =(f@)|xeA}  f-'B) = tre X | f@) € Bh. 
We call f—'(B) the inverse image of B under f. 
88. Let 
g ={C,4), 2,c), GB, ¢)} 


be a function from X = {1,2,3} to Y = {a,b,c, d}. Let 
S = {1}, 7 = {1,3}, U = {a}, and V = {a, c}. Find g(S), 
g(T), g~'(U), and g~!(V). 

x89. Let f be a function from X to Y. Prove that f is one-to-one if 
and only if 

FAN B) = f(A) Of) 

for all subsets A and B of X. [When S is a set, we define 
f(S) = {f@) | x € S}.] 

x90. Let f be a function from X to Y. Prove that f is one-to-one if 


and only if whenever g is a one-to-one function from any set 
A to X,f o g is one-to-one. 


x91. Let f be a function from X to Y. Prove that f is onto Y if and 
only if whenever g is a function from Y onto any set Z, go f 
is onto Z. 


92. Let f be a function from X onto Y. Let 
S=(f'(b) lye ¥}. 
Show that S is a partition of X. 


Let R¥ denote the set of functions from R to R. We define the eval- 
uation function Eg, where a € R, from RE to Ras 


Ea(f) = f(a). 


93. Is E; one-to-one? Prove your answer. 
94. Is E; onto? Prove your answer. 


95. Let f be a function from R to R such that for some r € R, 
(rx) = rf(x) for all x € R. Prove that f(r"x) = r"f(x) for 
allxe R,ne Zt. 


Exercises 96-100 use the following definitions. Let X = {a, b, c}. 
Define a function S from P(X) to the set of bit strings of length 3 
as follows. Let Y C X. Ifa € Y, sets; = 1; ifa € Y, sets; =0. 
Ifb €Y, set sy = 1; ifb € Y, sets. = 0. Ifc € Y, set s3 = 1; if 
c ZY, set s3 = 0. Define S(Y) = 515283. 

96. What is the value of S({a, c})? 

97. What is the value of S(@)? 

98. What is the value of S(X)? 

99. Prove that S is one-to-one. 
100. Prove that S is onto. 


Exercises 101-107 use the following definitions. Let U be a uni- 
versal set and let X C U. Define 


com af Hex 
m=) Gy rege 
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We call Cx the characteristic function of X (in U). (A look ahead 

at the next Problem-Solving Corner may help in understanding the 

following exercises.) 

101. Prove that Cyny (x) = Cx(x)Cy (x) for all x € U. 

102. Prove that Cyuy(x) = Cy(x) + Cy(x) — Cx(x)Cy (a) for all 
xeUu. 

103. Prove that Cy(x) = 1 — Cx(x) forall x € U. 

104. Prove that Cy_y(x) = Cx(x)[1 — Cy(x)] for all x € U. 

105. Prove that if X C Y, then Cy(x) < Cy(x) for all x € U. 

106. Find a formula for Cy,ay. (X A Y is the symmetric difference 


of X and Y. The definition is given before Exercise 101, Sec- 
tion 1.1.) 


107. Prove that the function f from P(U) to the set of characteris- 
tic functions in U defined by 


f(X) = Cx 


is one-to-one and onto. 


108. Let X and Y be sets. Prove that there is a one-to-one func- 
tion from X to Y if and only if there is a function from Y 
onto X. 


A binary operator f on a set X is commutative if f(x, y) = f(y, x) 
for all x, y € X. In Exercises 109-113, state whether the given 
function f is a binary operator on the set X. If f is not a binary 
operator, state why. State whether or not each binary operator is 
commutative. 


109. f@,y)=xt+y, X={1,2,...} 
110. fX,y)=x-y, X= {l,2,...} 
111. fy) =xUy, X= P({1, 2,3, 4) 
112. f(y) =x/y, ®%={0,1,2,...} 


113. f(x,y) =?+y—xy, X={I,2,...} 


In Exercises 114 and 115, give an example of a unary operator 
[different from f (x) = x, for all x] on the given set. 


114, {..., —2,—1,0,1,2,...} 
115. The set of all finite subsets of {1, 2, 3, ...} 


116. Prove that if f is a one-to-one, onto function from X to Y, 
then 


{0 x) | @ y) ef} 


is a one-to-one, onto function from Y to X. 


In Exercises 117-119, if the statement is true for all real numbers, 
prove it; otherwise, give a counterexample. 


117. [x+3] = [x] +3 

118. [x+y] = [x] + ly] 

119. [x+y] = LxJ + Ty] 

120. Prove that if n is an odd integer, 


l-C5)8) 


128 


121. Prove that if n is an odd integer, 


122. Find a value for x for which [2x] = 2[x] — 1. 
123. Prove that 2[x] — 1 < [2x] < 2[x] for all real numbers x. 


124. Prove that for all real numbers x and integers n, [x] = 
n if and only if there exists ¢, 0 < e€ <_ 1, such that 
xt+ée=n. 

125. State and prove a result analogous to Exercise 124 
for |x]. 


The months with Friday the 13th in year x are found in row 


v= (e+ [FF] [Fr] + [00 |) me” 


in the appropriate column: 


Problem-Solving Corner 
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y Non-Leap Year Leap Year 

0 January, January, 
October April, July 

1 April, September, 
July December 

2 September, June 
December 

3 June March, 

November 

4 February, February, 
March, August 
November 

2 August May 

6 May October 


126. Find the months with Friday the 13th in 1945. 
127. Find the months with Friday the 13th in the current year. 
128. Find the months with Friday the 13th in 2040. 
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Problem 
Let U be a universal set and let X C U. Define 


1 atxe x 
cu = { 4 ifr ¢ X. 


[We call Cy the characteristic function of X (in U)]. 
Assume that X and Y are arbitrary subsets of the uni- 
versal set U. Prove that Cyyy(x) = Cy(x) + Cy(x) for 
allx € U if and only if XN Y=2. 


Attacking the Problem 


First, let’s be clear what we must do. Since the state- 
ment is of the form p if and only if g, we have two tasks: 
(1) Prove if p then q. (2) Prove if q then p. It’s a good 
idea to write out exactly what must be proved: 


If Cxur(@) = Cxy(x) + Cy() for all x € U, 


then XN Y= 2. (1) 
If XN Y =, then Cyyy(x) = Cx(x) + Cy) 
for all x € U. (2) 


Consider the first statement in which we assume 
that Cyuy (x) = Cx(x) + Cy(a) for all x € U and prove 
that X M Y = ©. How do we prove that a set, XN Y 
in this case, is the empty set? We have to show that 
X 1 Y has no elements. How do we do that? There are 


several possibilities, but one thing that comes to mind 
is another question: What if X 9 Y had an element? 
This suggests that we might prove the first statement 
by contradiction or by proving its contrapositive. If we 
let 


DP: Cxyuy(x) = Cx(x) + Cy(x) for all x e U 
OR X(N = @, 
the contrapositive is ~q — —p. Now the negation of g 
is 
S17] 3 SCI s2 Bi, 
and, using De Morgan’s law (roughly, negating V 
results in 4), the negation of p is 
ap : Cxuy(x) 4 Cx(x) + Cy(a) for at least one x € U. 
Thus, the contrapositive is 
If XN Y € ©, then Cyuy(x) # Cy(x) + Cy) 
for at least one x € U. (3) 


For the second statement, we assume that XN Y = 
© and prove that Cyuy(x) = Cy(x) + Cy(x) for all 
x € U. Presumably, we can just use the definition of Cy 
to compute both sides of the equation for all x « U and 
verify that the two sides are equal. The definition of Cy 
suggests that we use proof by cases: x € X U Y (when 
Cyuy(x) = 1) and x ¢ X UY (when Cyuy(x) = 0). 


Finding a Solution 


We first consider proving the contrapositive (3) of 
statement (1). Since we assume that X 1 Y # @, there 
exists an element x € XM Y. Now let’s compare the 
values of the expressions Cyuy(x) and Cx(x) + Cy(x). 
Since x € X UY, Cyyy(x) = 1. Sincex € XNY, x € X 
and x € Y. Therefore 


Cx@0) sp GAeo) = Iap ll = 2, 
We have proved that 
Cyuy(x) 4 Cx(x) + Cy(x) for at least one x € U. 
Now consider proving the statement (2). This time 
we assume that X 1 Y = ©. Let’s compute each side 
of the equation 
Cyuy (x) = Cx) + Cy (x) (4) 


for each x € U. As suggested earlier, we consider the 
cases:x € X UY andx ¢ XUY. Ifx € X UY, then 
Cxuy(x) = 1. Since XN Y = @, either x € X orx Ee Y 
but not both. Therefore, 


Cx) + Cy) = 14+0=1 = Cyr) 
or 

Cx) + Cy) = 0+ 1 =1 = CyuyQ). 
Equation (4) is true ifx e X UY. 

Ifx ¢ X UY, then Cyuy(x) = 0. But ifx ¢ XUY, 

then x ¢ X and x ¢ Y. Therefore, 

Cy@) + Cy) =0+0=0 = CyurQ). 
Equation (4) is true if x ¢ X U Y. Thus it is true for all 
xe U. 


Formal Solution 


The formal proof could be written as follows. 
CASE —: If Cyuy(x) = Cx(x) + Cy(x) for all x € U, 
then XN Y=. 


3.2 ® Sequences and Strings 129 


We prove the equivalent contrapositive 


If XNY £ @, then Cyuy(x) # Cx(x) + Cy(x) 


for at least one x € U. 


Since XM Y # @, there exists x € XM Y. Since 
xe XUY, Cyuy() = 1. Since x € XN Y,x € X and 
x € Y. Therefore 


Cy(x) + Cy(x#) = 14+1=2. 
Thus, 


Cxuy (x) # Cx(x) + Cy(x). 


CASE <: If XNY = @, then Cyyy(x) = Cy(x)+Cy(x) 
for allx € U. 

Suppose that x « X UY. Then Cyyy(x) = 1. Since 
XY = @, either x € X or x € Y but not both. There- 
fore, Cy (x) + Cy(x) = 1 and (4) holds. 

Ifx ¢ X UY, then Cyuyy(@®) = 0. Ifx ¢ XUY, 
then x ¢ X and x ¢ Y. Therefore, Cy(x) + Cy(x) = 0. 
Again, (4) holds. Therefore (4) holds for all x € U. 


Summary of Problem-Solving Techniques 


= Write out exactly what must be proved. 


= Instead of proving p — q directly, consider 
proving its contrapositive ~q — —p or a proof 
by contradiction. 

= For statements involving negation, De Morgan’s 
laws can be very helpful. 

= Look for definitions and theorems relevant to the 
expressions mentioned in the statements to be 
proved. 

= A definition that involves cases suggests a proof 
by cases. 


3.2 Sequences and Strings 


Blue Taxi Inc. charges $1 for the first mile and 50 cents for each additional mile. The 
following table shows the cost of traveling from | to 10 miles. In general, the cost C,, of 
traveling n miles is 1.00 (the cost of traveling the first mile) plus 0.50 times the number 
(n — 1) of additional miles. That is, C, = 1 + 0.5(n — 1). As examples, 


Cc, =14+050—-1)=14+05-0=1, 
C5 =1+0.5(5-1)=14+05-4=142=3. 
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Go Online 

For more on 
sequences, see 
goo.gl/V3y4psS 


Example 3.2.2 


Example 3.2.3 


Example 3.2.4 


Mileage Cost 
1 $1.00 
2 1.50 
3 2.00 
+ 2.50 
5 3.00 
6 3.50 
7 4.00 
8 4.50 
9 5.00 

10 5.50 


The list of fares 


C; = 1.00, Cy=1.50, C3; =2.00, Cy=2.50, Cs = 3.00, 
Co = 3.50, C7 =4.00, Cg = 4.50, Cy =5.00, Cio = 5.50 


furnishes an example of a sequence, which is a special type of function in which the 
domain consists of integers. 


Definition 3.2.1 >» A sequence s is a function whose domain D is a subset of 
integers. The notation s,, is typically used instead of the more general function notation 
s(n). The term n is called the index of the sequence. If D is a finite set, we call s a finite 
sequence; otherwise, s is an infinite sequence. < 


For the sequence of fares for Blue Taxi, Inc., the domain of the sequence C is the 
subset of integers {1, 2,3, 4,5, 6, 7, 8, 9, 10}. The sequence C is a finite sequence. 

A sequence s is denoted s or {s,} if n is the index of the sequence. If, for exam- 
ple, the domain is the set of positive integers Z*, s or {s,} denotes the entire sequence 
51, 82, 83,.... We use the notation s, to denote the single element of the sequence s at 
index n. In this book, we will frequently use Z* or Z"°""“8 as the domain of a sequence. 


Consider the sequence s: 2,4,6,...,2n,.... The first element of the sequence is 2, the 
second element of the sequence is 4, and so on. The nth element of the sequence is 2n. If 
the domain of s is Z*, we have s; = 2, 52 =4, 53 =6,..., 5, = 2n,.... The sequence 
s is an infinite sequence. < 


Consider the sequence ft: a, a, b, a, b. The first element of the sequence is a, the second 
element of the sequence is a, and so on. If the domain of ¢ is {1, 2,3, 4,5}, we have 
=a, h =a, th =b, ty =a, and ts = b. The sequence ¢ is a finite sequence. < 


If the domain of a sequence s is the (infinite) set of consecutive integers {k, 
k+1,k+2,...} and the index of s is n, we can denote the sequence s as {s,}°°,. For 
example, a sequence s whose domain is Z"°""*S can be denoted {5,,}0°.9. If the domain of 
a sequence s is the set of (finite) consecutive integers {i,i+1,...,j}, we can denote the 
sequence s as {s,})_,. For example, a sequence s whose domain is {—1, 0, 1, 2, 3} can 


be denoted {s,}? % 


n=-1° 


The sequence {u,} defined by the rule u, = nr — 1, foralln > 0, can be denoted 
{un}p-o- The name of the index can be chosen in any convenient way. For example, the 
sequence u can also be denoted {u,}°° o: Fhe formula for the term having index m is 


m= 


Um =m —1, for allm > 0. < 
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Example 3.2.5 Define a sequence b by the rule b, is the nth letter in the word digital. If the domain of 
Dis {1,2,..., 7}, then b) = d, bp = by =i, and b7 = 1. < 


Example 3.2.6 _ Ifx is the sequence defined by 


1 
An = 5, —Ils<n<4, 
Qn 
the elements of x are 2, 1, 1/2, 1/4, 1/8, 1/16. < 
Example 3.2.7 Define a sequence s as 
Sp = 2" +4. 3" n> 0. (3.2.1) 
(a) Find so. 
(b) Find s;. 
(c) Find a formula for s;. 
(d) Find a formula for s,_1. 
(e) Find a formula for s,_>. 
(f) Prove that {s,,} satisfies 
Sn = 5Sn—1 — OSn—2 for alln > 2. (3.2.2) 


SOLUTION 
(a) Replacing n by 0 in Definition 3.2.1, we obtain 
so = 2944-39 =5. 
(b) Replacing n by 1 in Definition 3.2.1, we obtain 
5 =2'44.3'=14, 
(c) Replacing n by / in Definition 3.2.1, we obtain 
s = 2i+4-3!, 
(d) Replacing n by n — | in Definition 3.2.1, we obtain 
Spay =I 4 I, 
(e) Replacing n by n — 2 in Definition 3.2.1, we obtain 
Sy = 27 4.4.3", 


(f) To prove equation (3.2.2), we will replace s,,_; and s,_> in the right side of equa- 
tion (3.2.2) by the formulas of parts (d) and (e). We will then use algebra to show 
that the result is equal to s,. We obtain 


58-1 — OSp—2 = 5(2"7 1 +4 - 37!) — 62" 7 4.4 - 377) 
= (5-2—6)2"7 4 (5-4-3-6-4)3"7 
=4.2"7 436.3%? 
— 229"-2 4 (4. 323"? 
= 2" 44-3" =. 
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Example 3.2.8 


Example 3.2.9 


Example 3.2.10 


Example 3.2.11 


Example 3.2.13 


The techniques shown in the example will be useful in checking solutions of re- 
currence relations in Chapter 7. < 


Two important types of sequences are increasing sequences and decreasing 
sequences and their relatives, nonincreasing sequences and nondecreasing sequences. 
A sequence s is increasing if for all i and j in the domain of s, if i < j, then s; < sj. A 
sequence s is decreasing if for all i and j in the domain of s, if i < j, then s; > 5;. 
A sequence s is nondecreasing if for all i and j in the domain of s, if i < j, then 
Sj < s;. (A nondecreasing sequence is like an increasing sequence except that, in the 
condition on s, “<” is replaced by “<.”) A sequence s is nonincreasing if for all i and j 
in the domain of s, if i < j, then s; > s;. (A nonincreasing sequence is like a decreasing 
sequence except that, in the condition on s, “>” is replaced by “>.”) Notice that if the 
domain of a sequence s is a set of consecutive integers, if 5; < sj;41 for all i for which i 
and i+ 1 are in the domain of s, then s is increasing. Similar remarks apply to decreasing, 
nondecreasing, and nonincreasing sequences. (We leave formal justification for this last 
comment to Exercise 4.) 


The sequence 2, 5, 13, 104, 300 is increasing and nondecreasing. < 


The sequence 


is decreasing and nonincreasing. < 
The sequence 100, 90, 90, 74, 74, 74, 30 is nonincreasing, but it is not decreasing. <4 


The sequence 100 (consisting of a single element) is increasing, decreasing, nonincreas- 
ing, and nondecreasing since there are no distinct values of i and j for which both 7 and 


j are indexes. < 


One way to form a new sequence from a given sequence is to retain only certain 
terms of the original sequence, maintaining the order of terms in the given sequence. 
The resulting sequence is called a subsequence of the original sequence. 


Definition 3.2.12 >» Let s be a sequence. A subsequence of s is a sequence 
obtained from s by choosing certain terms of s in the same order in which they appear 
ins. < 


The sequence 


b,c (3.2.3) 
is a subsequence of the sequence 
a,a,b,c,q. (3.2.4) 


Subsequence (3.2.3) is obtained by choosing the third and fourth terms from 
sequence (3.2.4). 

Notice that the sequence c, b is not a subsequence of sequence (3.2.4) since the 
order of terms in the sequence (3.2.4) is not maintained. < 


Example 3.2.14 


Example 3.2.15 


Example 3.2.16 


Go Online 

For more on the sigma 
notation, see 
goo.gl/V3y4ps 
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The sequence 
PAR NG: 2g (3.2.5) 
is a subsequence of the sequence 
2,4, 6,8, 10, 12, 14, 16,...,2n,.... (3.2.6) 


Subsequence (3.2.5) is obtained by choosing the first, second, fourth, eighth, and so on, 
terms from sequence (3.2.6). < 


We turn to the notation for a subsequence. A subsequence of a sequence s is 
obtained by choosing certain terms from s. We let 1), 72,3, ... denote the indexes (in 
order) of s of the terms that are selected to obtain the subsequence. Thus the subsequence 
can be denoted {s,, }. 


Let us return to the sequence (3.2.4) in Example 3.2.13, which we now call r: 


h=a h=da b=b, tt=c, t=4¢. 


The subsequence J, c is obtained from ¢ by selecting the third and fourth terms of ¢. Thus 
n, = 3,n2 = 4, and the subsequence J, c is £3, t4, OF ty, , fn. < 


Let us return to the sequence (3.2.6) in Example 3.2.14. We let s,, = 2n, so the sequence 
(3.2.6) can be denoted {s,,}°° ,. The subsequence (3.2.5) is obtained from s by selecting 
the first, second, fourth, eighth, and so on, terms from s; thus 


n=1, m=2, m=4, m=8,.... 
We see that n, = 24-', k = 1,2,.... The subsequence (3.2.5) {s,,}?2, is defined by 


Sng = Soe = 2-2! = 2K, 


In the notation for the original sequence {s,}°° ,, is the index of the sequence s. 
In the notation for the subsequence {5,,,}7°, m is used in a totally different way; n is 
itself a sequence, namely, the sequence of indexes selected to form the subsequence. 
Furthermore, k is the index of the sequence n. It would have been nice to avoid hav- 
ing n used in two different ways, but tradition demands that we continue this abuse of 
notation. < 


Two important operations on numerical sequences are adding and multiplying 
terms. 


Definition 3.2.17 > If {a;}'L,, is a sequence, we define 


n n 


y di = Am + Angi +++ +n, | | Qi = Gm * Am+1°** Ane 


i=m i=m 


The formalism 


x a; (3.2.7) 
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Example 3.2.18 


Example 3.2.19 


Example 3.2.20 


Example 3.2.21 


is called the sum (or sigma) notation and 


Il ai (3.2.8) 


is called the product notation. 
In (3.2.7) or (3.2.8), iis called the index, m is called the lower limit, and n is called 
the upper limit. < 


Let a be the sequence defined by a, = 2n,n > 1. Then 


3 
So aj = a) +n +a; =24+44+6= 12, 


i=! 


3 
[Ja aa -m-g=2-4-6=48. 
i=l < 


The geometric sum a + ar + ar? +--+ ar" can be rewritten compactly using the sum 
notation as S779 ar". < 


It is sometimes useful to change not only the name of the index, but to change 
its limits as well. (The process is analogous to changing the variable in an integral in 
calculus.) 


Changing the Index and Limits in a Sum Rewrite the sum )>/_, ir’, replacing the 
index i by j, where i = j — 1. 


SOLUTION Since i=j— 1, the term ir”! becomes 
G= pro) _ Gj- pr, 


Since j = i+ 1, wheni = 0,7 = 1. Thus the lower limit forj is 1. Similarly, when i = n, 
j =n-+1, and the upper limit for j is n + 1. Therefore, 


n n+l 


yr as Gir. 
j=l 


i=0 


< 


Let a be the sequence defined by the rule a; = 2(—1)', i > 0. Find a formula for the 
sequence s defined by s, = )>j--9 di. 


SOLUTION We find that 


Sp = 2(—1)° + 2(-1)! + 2(-1)? +. --- + 2(-1)" 


2 ifniseven 
=2-24+2--.:-4£2= : . 
0 ifn is odd. 4 


Sometimes the sum and product notations are modified to denote sums and prod- 
ucts indexed over arbitrary sets of integers. Formally, if S is a finite set of integers and 
a is a sequence, ies a; denotes the sum of the elements {a; | i € S}. Similarly, Tlies dj 
denotes the product of the elements {a; | i € S}. 


Example 3.2.22 


Example 3.2.24 


Example 3.2.25 
Example 3.2.26 


Example 3.2.27 


Example 3.2.28 
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If S denotes the set of prime numbers less than 20, 


a er i i Ce cs ee a | 
Sot asteztetato tat at a =l45s. 
7° 2°3°5°7 11" 13" 17" 19 < 


A string is a finite sequence of characters. In programming languages, strings can 
be used to denote text. For example, in Java 


"Let's read Rolling Stone." 
denotes the string consisting of the sequence of characters 
Let’s read Rolling Stone. 


(The double quotes " mark the start and end of the string.) 

Within a computer, bit strings (strings of 0’s and 1’s) represent data and instruc- 
tions to execute. As we will see in Section 5.2, the bit string 101111 represents the 
number 47. 


Definition 3.2.23 >» A string over X, where X is a finite set, is a finite sequence 
of elements from X. < 


Let X = {a, b, c}. If we let 
Bi=b, Ppo=a, fB3=a, Ba=c, 


we obtain a string over X. This string is written baac. < 


Since a string is a sequence, order is taken into account. For example, the string 
baac is different from the string acab. 

Repetitions in a string can be specified by superscripts. For example, the string 
bbaaac may be written ba°c. 

The string with no elements is called the null string and is denoted A. We let X* 
denote the set of all strings over X, including the null string, and we let Xt denote the 
set of all nonnull strings over X. 


Let X = {a, b}. Some elements in X* are A, a, b, abab, and ba? ba. < 


The length of a string a is the number of elements in a. The length of a is de- 
noted ||. 


If « = aabab and B = a*b*a*?, then |a| = 5 and |B| = 39. < 


If a and £ are two strings, the string consisting of a followed by £, written aw, is 
called the concatenation of a and f. 


If y = aab and 6 = cabd, then 
y@ =aabcabd, @y =cabdaab,h yXR=y =aab, hy =y =aab. < 


Let X = {a, b, c}. If we define f(a, 6) = aB, where a and f are strings over X, then f 
is a binary operator on X*. < 


A substring of a string a is obtained by selecting some or all consecutive ele- 
ments of a. The formal definition follows. 
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Example 3.2.30 


Example 3.2.31 


Example 3.2.32 


Definition 3.2.29 >» A string £ is a substring of the string @ if there are strings 
y and 6 witha = yfo. <4 


The string 8 = add is a substring of the string wa = aaaddad since, if we take y = aa 
and 6 = ad, we have a = y #6. Note that if 6 is a substring of a, y is the part of a that 
precedes £ (in a), and 6 is the part of a that follows 6 (in @). < 


Let X = {a, b}. If a € X%, let a® denote a written in reverse. For example, if a = abb, 
a® — bba. Define a function from X* to X* as f(a) = a*. Prove that f is a bijection. 


SOLUTION We must show thatf is one-to-one and onto X*. We first show that f is one- 
to-one. We must show that if f(w) = f(6), then a = B. So suppose that f(a) = f(B). 
Using the definition of f, we have w*® = B®. Reversing each side, we find that a = . 
Therefore, f is one-to-one. 

Next we show that f is onto X*. We must show that if 6 € X*, there exists a € X* 
such that f(a) = 6. So suppose that 6 € X*. If we let a = B*, we have 


R R)R 
f(a) =a" = (B")" = B 
since if we twice reverse a string, we obtain the original string. Therefore, f is onto X*. 


We have proved that f is a bijection. < 


Let X = {a, b}. Define a function from X* x X* to X* as f(a, B) = af. Is f one-to-one? 
Is f onto X*? 


SOLUTION We try to prove that f is one-to-one. If we succeed, this part of the example 
is complete. If we fail, we may learn how to construct a counterexample. So suppose that 
f(a1, Bi) =f (@2, Bo). We have to prove that w; = £; and a2 = f,. Using the definition 


of f, we have 18; = a262. Can we conclude that a} = 6; and a = f? No! It 
is possible to concatenate different strings and produce the same string. For example, 
baa = af, if we set a; = b and f; = aa. Also, baa = a2f> if we set a2 = ba and 


By = a. Therefore f is not one-to-one. We could write up this part of the solution as 
follows. 

If we set a; = b, By = aa, a, = ba, and B2 = a, then f(a, B;) = baa = 
Ff (Q2, Bz). Since a, 4 a, f is not one-to-one. 

The function f is onto X* if given any string y € X*, there exist (a, 8) € X* x X* 
such that f(a, 8) = y. In words, f is onto X* if every string in X* is the concatenation 
of two strings, each in X*. Since concatenating a string @ with the null string A does not 
change a, every string in X* is the concatenation of two strings, each in X*. This part of 
the solution could be written up as follows. 


Let a € X*. Then f(a, A) = aA = a. Therefore f is onto X*. < 

3.2 Problem-Solving Tips 
A sequence is a special type of function; the domain is a set of integers. If a), a2,... is 
a sequence, the numbers 1, 2, ... are called indexes. Index | identifies the first element 


of the sequence a; index 2 identifies the second element of the sequence az; and so on. 

In this book, “increasing sequence” means strictly increasing; that is, the sequence 
a is increasing if for all i and j in the domain of a, if i < j, then a; < a;. We require that 
q; is strictly less than (not less than or equal to) a;. Allowing equality yields what we call 
in this book a “nondecreasing sequence.” That is, the sequence a is nondecreasing if for 
all i and j in the domain of a, if i < j, then a; < a;. Similar remarks apply to decreasing 
sequences and nonincreasing sequences. 
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3.2 Review Exercises 


— 


. Define sequence. 

. What is an index in a sequence? 
Define increasing sequence. 
Define decreasing sequence. 
Define nonincreasing sequence. 
. Define nondecreasing sequence. 
. Define subsequence. 

. What is 0" ai? 


i=m 


. What is [J/_,, ai? 


cnn nun Bw WN 


. Define string. 

. Define null string. 

. If X isa finite set, what is X*? 
. If X is a finite set, what is X+? 


. Define length of a string. How is the length of the string a de- 


noted? 


. Define concatenation of strings. How is the concatenation of 


strings a and 6 denoted? 


. Define substring. 


Answer I-3 for the sequence {Sy yo , defined by 
c, d,.d,¢, 4, 6 
1. Find s;. 2. Find sq. 


3. Write s as a string. 


4. Let s be a sequence whose domain D is a set of consecutive 
integers. Prove that if s; < sj; for all i for which i andi+ 1 
are in D, then s is increasing. Hint: Let i € D. Use induction 
on j to show that s; < s; for all j,i < j andj € D. 


In Exercises 5-9, tell whether the sequence s defined by sy, = 
2" — n? is 

(a) increasing 

(b) decreasing 

(c) nonincreasing 

(d) nondecreasing 


for the given domain D. 


5. D = {0, 1} 
6. D = {0, 1, 2, 3} 
7. D= {1, 2, 3} 


8. D = {1, 2, 3, 4} 
9. D={n|néeZ, n> 3} 


Answer 10-22 for the sequence t defined by 


t, =2n-1, n> 1, 
10. Find t3. 11. Find t. 
12. Find fiq9. 13. Find t2977. 
3 7 
14. Find ye 15. Find yh 
i=1 i=3 


6 
16. Find [+ 17. Find [[« 
i=3 


18. Find a formula that represents this sequence as a sequence 


whose lower index is 0. 


19. Is ¢ increasing? 20. Is t decreasing? 


21. Is ¢ nonincreasing? 22. Is t nondecreasing? 


Answer 23-30 for the sequence v defined by 


Vy =n! +2, n> 1. 


23. Find 13. 24. Find v4. 
4 3} 
25. Find ) > vj. 26. Find yj. 
i=1 i=3 
27. Is v increasing? 28. Is v decreasing? 
29. Is v nonincreasing? 30. Is v nondecreasing? 


Answer 31-36 for the sequence 


a=8 g=12, gs=12, q4=28, gs =33. 


4 4 
31. Find ye Yi 32. Find > dk: 
i=2 k=2 
33. Is q increasing? 34. Is q decreasing? 
35. Is q nonincreasing? 36. Is g nondecreasing? 


Answer 37-40 for the sequence 


tS = 5; 


37. Is t increasing? 38. Is t decreasing? 


39. Is t nonincreasing? 40. Is t nondecreasing? 


Answer 41-44 for the sequence 


TY. =5. 


41. Is Y increasing? 42. Is Y decreasing? 
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43. Is Y nonincreasing? 44, Is Y nondecreasing? 


Answer 45—56 for the sequence a defined by 


dy =n — 3n +3, n>1. 
4 5 
45. Find Sai. 46. Find S © qj. 
i=1 j=3 
4 6 
47. Find ye 48. Find + 
i=4 k=1 
2 3 
49. Find | [ ai. 50. Find | [ aj. 
i=l i=l 
3 4 
51. Find | | an. 52. Find | [ a. 
n=2 x=3 
53. Is a increasing? 54. Is a decreasing? 
55. Is a nonincreasing? 56. Is a nondecreasing? 


Answer 57-64 for the sequence b defined by by, = n(—1)",n > 1. 


4 10 
57. Find ye 58. Find So oi. 
i=1 i=1 


59. Find a formula for the sequence c defined by 


n 
Ch = ) bj. 
i=l 


60. Find a formula for the sequence d defined by 


n 
dn = Il b;. 
i=l 


61. Is b increasing? 62. Is b decreasing? 


63. Is b nonincreasing? 64. Is b nondecreasing? 


Answer 65-72 for the sequence Q defined by Qy = 3 for all n. 


3 10 
65. Find >> 2. 66. Find >> a. 
i=1 i=1 


67. Find a formula for the sequence c defined by 


n 
Ch = ) ern 
i=l 


68. Find a formula for the sequence d defined by 


n 
dy = Il Qj. 
i=l 


69. Is Q increasing? 70. Is Q decreasing? 


71. Is Q nonincreasing? 72. Is Q nondecreasing? 
Answer 73-79 for the sequence x defined by 


Xn =34+Xn-1, n>2. 


10 
74. Find + 
i=l 


x = 2, 


3 
73. Find Se 
i=l 
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75. Find a formula for the sequence c defined by 


n 
Ch = ) Xj. 
i=l 


76. Is x increasing? 77. Is x decreasing? 


78. Is x nonincreasing? 79. Is x nondecreasing? 


Answer 80-87 for the sequence w defined by 


3 
80. Find Sow. 
i=l 


82. Find a formula for the sequence c defined by 


n 
Ch = ) Wj. 
i=1 


83. Find a formula for the sequence d defined by 


n 


dn = | | Wj. 


i=1 
84. Is w increasing? 85. Is w decreasing? 


86. Is w nonincreasing? 87. Is w nondecreasing? 


Answer 88-100 for the sequence a defined by 
n—1 
==, =3 
ae 8 (n — 2)2 oe 

and the sequence z defined by zy, = Dar dj. 

88. Find a3. 

89. Find aq. 

90. Find z3. 

91. Find z4. 
*92. Find zj09. Hint: Show that 


1 1 1 
a= —> 7 ~ 
and use this form in the sum. Write out a3 + a4 + a5 + a6 to 
see what is going on. 
93. Is a increasing? 
x94. Is a decreasing? 
x95. Is a nonincreasing? 
96. Is a nondecreasing? 
97. Is z increasing? 
98. Is z decreasing? 
99. Is z nonincreasing? 
100. Is z nondecreasing? 
Let X be the set of positive integers that are not perfect squares. 
(A perfect square m is an integer of the form m = i where i is 


an integer.) Exercises 101-107 concern the sequence s from X to 
Z defined as follows. If n € X, let Ss, be the least integer ax for 


which there exist integers a,,..., ax withn < ay < dy <+++ < ax 
such that n+a,--- ax is a perfect square. As an example, consider 
sg. None of 2+3,2+4,2+3-+4 is a perfect square, so s, # 3 and 
so # 4. If sy = 5, 2+a)---ax+5 would be a perfect square for 
some a,..., AK, 2 < ay < az < +++ < ag <5. But then one of the 
a; would be a multiple of 5, which is impossible. Therefore, sy # 5. 
However, 2 +3 -6 is a perfect square, so sz = 6. 
101. Show that s, is defined for every n € X, that is, that for any 
n € X, there exist integers aj,...,a, withn < aj < a2 < 
+++ < ax such that n+ a, --- a, is a perfect square. 


102. 
103. 
104. 
105. 
*106. 
107. 
108. 


Show that s,, < 4n for alln € X. 

Find s3. 

Find s5. 

Find s6. 

Prove that if p is a prime, s, = 2p for all p > 5. 
Prove that s is not increasing. 


Let u be the sequence defined by 


uy =3, Up =3+uUn-1, n> 2, 


Find a formula for the sequence d defined by 


n 


 —— Il Uj. 


i=1 
Exercises 109-112 refer to the sequence {s,} defined by the rule 


Sy = 2n—1, n>1. 


109. List the first seven terms of s. 


Answer 110-112 for the subsequence of s obtained by taking the 
Jirst, third, fifth, ... terms. 


110. 
111. 


List the first seven terms of the subsequence. 


Find a formula for the expression ng as described before 
Example 3.2.15. 


112. Find a formula for the kth term of the subsequence. 


Exercises 113-116 refer to the sequence {t,} defined by the rule 


i= 2", n> 1. 


113. List the first seven terms of f. 


Answer 114-116 for the subsequence of t obtained by taking the 
Jirst, second, fourth, seventh, eleventh, ... terms. 

114. 
115. 


List the first seven terms of the subsequence. 


Find a formula for the expression nx as described before Ex- 
ample 3.2.15. 


116. Find a formula for the kth term of the subsequence. 


Answer 117-120 using the sequences y and z defined by 


Yn = 2" —1, Zn = n(n — 1). 


3 3 5 4 
117. Find (> ») (> :} 118. Find (> ») ( 
i=1 i=l i=l i=l 


119. 


Ans 


121 
123 
125 


139 


3 4 4 
. Find §> yizi. 120. Find (> ») (II “} 
i=1 i=3 i=2 


wer 121-128 for the sequence r defined by 
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fH so QP ad 35", n> 0. 
. Find ro. 122. Find r;. 
. Find rp. 124. Find 73. 
. Find a formula for rp. 126. Find a formula for r;,_1. 


127. Find a formula for rj,_2. 
128. Prove that {r;,} satisfies 

Typ = 7-1 — 10rn_2, n> 2. 
Answer 129-136 for the sequence z defined by 

Ln = (2 + n)3", ne 0. 

129. Find zo. 130. Find z;. 
131. Find zo. 132. Find z3. 
133. Find a formula for z;. 134. Find a formula for z,,_|. 
135. Find a formula for z,,_2. 
136. Prove that {z,,} satisfies 

Zn = 6%n—1 — Wn—2; n> 2. 
137. Find b,, n=1,...,6, where 

by =n+ (n— 1)(n — 2)(n — 3)(n — 4)(n — 5). 


138. 


139. 


140. 


141. 


Rewrite the sum 
‘a 2 yt , 
i=l 
replacing the index i by k, where i =k +1. 


Rewrite the sum 


n 
; Cy-1Cn-k, 
k=l 


replacing the index k by i, where k =i+ 1. 


Let a and b be sequences, and let 
k 
s= » dj. 
i=l 


Prove that 


n n 
x aby = as 8K(bq — Devi) + Snbnzi- 
k=l k=l 


This equation, known as the summation-by-parts formula, is 
the discrete analog of the integration-by-parts formula in cal- 
culus. 


Sometimes we generalize the notion of sequence as defined 
in this section by allowing more general indexing. Suppose 


140 


that {a;;} is a sequence indexed over pairs of positive integers. 
Prove that 


n n n J 
¥ (So) =¥5 (dom). 
i=1 \ jai j=l \i=l 

142. Compute the given quantity using the strings 


a = baab, B = caaba, y = bbab. 
(a) ap (b) Ba (c) aa 
(d) BB (e) |ap| (f) |Ba| 
(g) loa| (h) |BA| (i) adr 
Gj) AB (k) apy () BBya 


143. List all strings over X = {0, 1} 
144. List all strings over X = {0, 1} of length 2 or less. 
145. List all strings over X = {0, 1} of length 3. 

146. List all strings over X = {0, 1} of length 3 or less. 
147. Find all substrings of the string babc. 

148. Find all substrings of the string aabaabb. 


of length 2. 


149. Use induction to prove that 


1 
——— =n, 
Ny +2 Nk 


for all n > 1, where the sum is taken over all nonempty sub- 
sets {n,,12,..., mg} of {1,2,...,n}. 


150. Suppose that the sequence {a,,} satisfies a; = 0, az = 1, and 

An = (n— 1)(Qn-1 + Qy—-2) for all n > 3. 
Use induction to prove that 

n 
—1)* 
ma onl for alln > 1. 
k! 
k=0 


nt 


In Exercises 151-153, x1, X2,...,Xn, n => 2, are real numbers sat- 
isfying X1 <.X2 < +++ <X,, and x is an arbitrary real number. 


151. Prove that if x; < x < x,, then 


for alln > 3. 
152. Prove that if x < x; or x > X,, then 


n-1 


Yon > a ae. 


for alln > 3. 


153. A median of x1,...,X, is the middle value of x1,..., xp 
when n is odd, and any value between the two middle val- 
ues of x1, ...,X, when n is even. For example, if xj < x2 < 

- < x5, the median is x3. If x) < x2 < x3 < x4, a median 
is any value between x2 and x3, including x2 and x3. 
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Use Exercises 151 and 152 and mathematical induc- 
tion to prove that the sum 


n 
|x — xil, 
i=l 


n > 1, is minimized when x is equal to a median of 
hy cay Bae 

If we repeat an experiment n times and observe the val- 
ues X1,...,X,, the sum (3.2.9) can be interpreted as a mea- 
sure of the error in assuming that the correct value is x. This 
exercise shows that this error is minimized by choosing x to 
be a median of the values x1, ..., X,. The requested inductive 
argument is attributed to J. Lancaster. 


154. Prove that 


(3.2.9) 


n- n(n? — 1) — 


vere 


i=l j=l 


155. Let X = {a,b}. Define a function from X* to X* as 
f(a) = aab. Is f one-to-one? Is f onto X*? Prove your 
answers. 


156. Let X = {a, b}. Define a function from X* to X* as f(a) = 
aa. Is f one-to-one? Is f onto X*? Prove your answers. 


157. Let X = {a, b}. A palindrome over X is a string @ for which 
a =a (i.e, a string that reads the same forward and back- 
ward). An example of a palindrome over X is bbaabb. De- 
fine a function from X* to the set of palindromes over X 
as f(a) = aa®. Is f one-to-one? Is f onto? Prove your 
answers. 


Let L be the set of all strings, including the null string, that can be 
constructed by repeated application of the following rules: 


@ Ifa €L, then aab € Land baa € L. 
@ Ifa €Land B €L, thenap € L. 


For example, ab is in L, for if we take a = i, thena € Land the 
first rule states that ab = aab € L. Similarly, ba € L. As another 
example, aabb is in L, for if we take a = ab, then a € L; by the 
first rule, aabb = aab € L. As a final example, aabbba is in L, for 
if we take a = aabb and B = ba, thena € Land B € L; by the 
second rule, aabbba = af € L. 


158. Show that aaabbb is in L. 

159. Show that baabab is in L. 

160. Show that aab is not in L. 

161. Prove that if a € L, a has equal numbers of a’s and b’s. 
*162. Prove that if a has equal numbers of a’s and b’s, then a € L. 


163. Let {a,}°° , be a nondecreasing sequence, which is bounded 
above, and let L be the least upper bound of the set 
{d, | n = 1,2, ...}. Prove that for every real number ¢ > 0, 
there exists a positive integer N such that L — ¢ < a, < L 
for every n > N. In calculus terminology, a nondecreasing 
sequence, which is bounded above, converges to the limit L, 
where L is the least upper bound of the set of elements of the 
sequence. 


3.3 


Go Online 


For more on relations, see 
goo.gl/V3y4pS 


Example 3.3.2 


Example 3.3.3 
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Relations 


A relation from one set to another can be thought of as a table that lists which elements 
of the first set relate to which elements of the second set (see Table 3.3.1). Table 3.3.1 
shows which students are taking which courses. For example, Bill is taking Computer 
Science and Art, and Mary is taking Mathematics. In the terminology of relations, we 
would say that Bill is related to Computer Science and Art, and that Mary is related to 
Mathematics. 

Of course, Table 3.3.1 is really just a set of ordered pairs. Abstractly, we define a 
relation to be a set of ordered pairs. In this setting, we consider the first element of the 
ordered pair to be related to the second element of the ordered pair. 


TABLE 3.3.1 m Relation of 
Students to Courses 


Student Course 
Bill CompSci 
Mary Math 
Bill Art 

Beth History 
Beth CompSci 
Dave Math 


Definition 3.3.1 > A (binary) relation R from a set X to a set Y is a subset of 
the Cartesian product X x Y. If (x, y) € R, we write x Ry and say that x is related to y. 
If X = Y, we call R a (binary) relation on X. < 


A function (see Section 3.1) is a special type of relation. A function f from X to Y 
is a relation from X to Y having the properties: 


(a) The domain of f is equal to X. 
(b) For each x € X, there is exactly one y € Y such that (x, y) € f. 


If we let X = {Bill, Mary, Beth, Dave} and Y = {CompSci, Math, Art, History}, our 
relation R of Table 3.3.1 can be written 
R = {(Bill, CompSci), (Mary, Math), (Bill, Art), (Beth, History), 
(Beth, CompSci), (Dave, Math)}. 


Since (Beth, History) € R, we may write Beth R History. < 


Example 3.3.2 shows that a relation can be given by simply specifying which or- 
dered pairs belong to the relation. Our next example shows that sometimes it is possible 
to define a relation by giving a rule for membership in the relation. 

Let X = {2,3, 4} and Y = {3, 4, 5, 6, 7}. If we define a relation R from X to Y by 
(x,y) ER if x divides y, 
we obtain 
R= {(2,4), (2, 6), (3, 3), (3, 6), (4, 9}. 


If we rewrite R as a table, we obtain 
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Example 3.3.4 


: )3 4 
Figure 3.3.1 The digraph of the 
relation of Example 3.3.4. 


Example 3.3.5 


Example 3.3.7 


BRWWNDHND /] 
RAWAA!N 


< 
Let R be the relation on X = {1, 2, 3, 4} defined by (x, y) € Rif x < y,x, y e X. Then 


R= {d, 1), C, 2), Cd, 3), Cd, 4), (2, 2), (2, 3), (2, 4), (3, 3), (3, 4), (4, 4}. < 


An informative way to picture a relation on a set is to draw its digraph. (Di- 
graphs are discussed in more detail in Chapter 8. For now, we mention digraphs only in 
connection with relations.) To draw the digraph of a relation on a set X, we first draw 
dots or vertices to represent the elements of X. In Figure 3.3.1, we have drawn four 
vertices to represent the elements of the set X of Example 3.3.4. Next, if the element 
(x, y) is in the relation, we draw an arrow (called a directed edge) from x to y. In Fig- 
ure 3.3.1, we have drawn directed edges to represent the members of the relation R of 
Example 3.3.4. Notice that an element of the form (x, x) in a relation corresponds to a 
directed edge from x to x. Such an edge is called a loop. There is a loop at every vertex in 
Figure 3.3.1. 


The relation R on X = {a, b, c, d} given by the digraph of Figure 3.3.2 is R = {(a, a), 
(b,c), (c, b), (d, d)}. 


Figure 3.3.2 The digraph of 
the relation of Example 3.3.5. < 


We next define several properties that relations may have. 


Definition 3.3.6 >» A relation R ona set X is reflexive if (x, x) € R for every 
xeXx., < 


The relation R on X = {1, 2,3, 4} defined by (x, y) € Rif x < y, x,y € X, is reflexive 
because for each element x € X, (x, x) € R; specifically, (1, 1), (2, 2), (3, 3), and (4, 4) 
are each in R. The digraph of a reflexive relation has a loop at every vertex. Notice that 
the digraph of this relation (see Figure 3.3.1) has a loop at every vertex. < 


By the generalized De Morgan’s laws for logic (Theorem 1.5.14), a relation R on 
X is not reflexive if there exists x € X such that (x, x) ZR. 


Example 3.3.8 


Example 3.3.10 


Example 3.3.11 


Example 3.3.13 


Example 3.3.14 
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The relation R = {(a, a), (b,c), (c, b), (d, d)} on X = {a, b, c, d} is not reflexive. For 
example, b € X, but (b,b) ¢ R. That this relation is not reflexive can also be seen by 
looking at its digraph (see Figure 3.3.2); vertex b does not have a loop. < 


Definition 3.3.9 > A relation R ona set X is symmetric if for all x, y € X, if 
(x, y) € R, then (y, x) € R. < 


The relation R = {(a, a), (b,c), (c, b), (d, d)} on X = {a, b, c, d} is symmetric because 
for all x, y, if (x, y) € R, then (y, x) € R. For example, (b, c) is in R and (c, b) is also in 
R. The digraph of a symmetric relation has the property that whenever there is a directed 
edge from v to w, there is also a directed edge from w to v. Notice that the digraph of 
this relation (see Figure 3.3.2) has the property that for every directed edge from v to w, 
there is also a directed edge from w to v. < 


In symbols, a relation R is symmetric if 
vxVy[(x, y) € RJ > [Q, x) € RI]. 
Thus R is not symmetric if 
a[vavyL(, y) € R] > [(y, x) € RI]. (3.3.1) 


Using the generalized De Morgan’s laws for logic (Theorem 1.5.14) and the fact that 
=(p > gq) =p 7g (see Example 1.3.13), we find that (3.3.1) is equivalent to 


Ardy[[(@, y) € R] A 10, x) € RI] 
or, equivalently, 
Axdy[[(@, y) € RIATO, 4) € RII. 


In words, a relation R is not symmetric if there exist x and y such that (x, y) is in R and 
(y, x) is not in R. 


The relation R on X = {1,2,3,4} defined by (x,y) € Rifx < y, x,y € X, is 
not symmetric. For example, (2,3) € R, but (3,2) ¢ R. The digraph of this relation 
(see Figure 3.3.1) has a directed edge from 2 to 3, but there is no directed edge from 
3 to 2. < 


Definition 3.3.12 >» A relation R ona set X is antisymmetric if for all x, y € X, 
if (x, y) € Rand (y, x) € R, then x = y. < 


The relation R on X = {1, 2,3, 4} defined by (x, y) € Rif x < y, x,y € X, is antisym- 
metric because for all x, y, if (x, y) € R (.e., x < y) and (y, x) € R (Le., y < x), then 
cy; < 


It is sometimes more convenient to replace 
if (x, y) € Rand (y, x) € R, thenx = y 


in the definition of “antisymmetric” (Definition 3.3.12) with its logically equivalent con- 
trapositive (see Theorem 1.3.18) 
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Example 3.3.15 


eo ae aay 
Figure 3.3.3 The digraph of the 
relation of Example 3.3.15. 


Example 3.3.16 


Example 3.3.18 


ifx ~ y, then (x, y) g Ror (y,x) €R 


to obtain a logically equivalent characterization of “antisymmetric”: A relation R on a 
set X is antisymmetric if for all x, y € X, if x # y, then (x, y) ¢ Ror (y,x) RK. 

Using this equivalent definition of “antisymmetric,” we again see that the relation 
RonX = {1, 2,3, 4} defined by (x, y) € Rifx < y, x, y € X, is antisymmetric because 
for all x, y, ifx # y, @, y) € R (ie.,x > y) or (y, x) € R (Le., y > x). 

The equivalent characterization of “antisymmetric” translates for digraphs as fol- 
lows. The digraph of an antisymmetric relation has the property that between any two 
distinct vertices there is at most one directed edge. Notice that the digraph of the relation 
R in the previous paragraph (see Figure 3.3.1) has at most one directed edge between 
each pair of vertices. < 


If a relation has no members of the form (x, y), x # y, we see that the equivalent char- 
acterization of “antisymmetric” 


for all x, y € X, ifx # y, then (x, y) g Ror (y,x) €R 


(see Example 3.3.14) is trivially true (since the hypothesis x 4 y is always false). Thus if 
a relation R has no members of the form (x, y), x # y, R is antisymmetric. For example, 
R = {(a, a), (b, b), (c, c)} on X = {a, b, c} is antisymmetric. The digraph of R shown 
in Figure 3.3.3 has at most one directed edge between each pair of distinct vertices. 
Notice that R is also reflexive and symmetric. This example shows that “antisymmetric” 
is not the same as “not symmetric” because this relation is in fact both symmetric and 
antisymmetric. < 


In symbols, a relation R is antisymmetric if 
VxVyl(x, vy) ERA (yx) € R] > [x=]. 
Thus R is not antisymmetric if 
=[VxVy[(x, y) E RA (y, x) € RJ] > [x = y]]. (3.3.2) 


Using the generalized De Morgan’s laws for logic (Theorem 1.5.14) and the fact that 
=(p > gq) =p A-7¢q (see Example 1.3.13), we find that (3.3.2) is equivalent to 


Axdy[(x, y) Ee RA (yy, x) € RJA Lx = yl] 
which, in turn, is equivalent to 
Axay[(x, yy) ERA (x) ERA Y)I. 


In words, a relation R is not antisymmetric if there exist x and y, x # y, such that (x, y) 
and (y, x) are both in R. 


The relation R = {(a, a), (b,c), (c, b), (d, d)} on X = {a, b, c, d} is not antisymmetric 
because both (b, c) and (c, b) are in R. Notice that in the digraph of this relation (see 
Figure 3.3.2) there are two directed edges between b and c. < 


Definition 3.3.17 > A relation R ona set X is transitive if for all x, y, z € X, if 
(x, y) and (y, z) € R, then (x, z) € R. < 


The relation R on X = {1, 2, 3, 4} defined by (x, y) € Rifx < y, x, y € X, is transitive 
because for all x, y, z, if (x, y) and (y, z) € R, then (x, z) € R. To formally verify that this 
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relation satisfies Definition 3.3.17, we can list all pairs of the form (x, y) and (y, z) inR 
and then verify that in every case, (x, z) € R: 


Pairs of Form Pairs of Form 

(yy) 0,2) (xz) | Gy) OD (x, z) 
d,1) dt) (1,1) (2,2) (@,2) (2, 2) 
(,1)) 1, 2) (1, 2) (2,2) (2,3) (2, 3) 
d,1)) d, 3) (1,3) (2,2) (2,4) (2, 4) 
d,1) (d,4) d, 4) (2,3) (3,3) (2, 3) 
(1,2) (2, 2) (1, 2) (2,3) (3,4) (2, 4) 
(1,2) (2,3) (, 3) (2,4) (4,4) (2, 4) 
(1,2) (2,4) d, 4) G33) G3) (3, 3) 
(1,3) (3,3) (1, 3) (3,3) (3,4) (3, 4) 
d,3) (3,4) d, 4) (3,4) (4,4) (3, 4) 
d,4) (4,4) (1, 4) (4,4) (4,4) (4, 4) 


Actually, some of the entries in the preceding table were unnecessary. If x = y or 
y =z, we need not explicitly verify that the condition 


if (x, y) and (y, z) € R, then (x,z) E R 


is satisfied since it will automatically be true. Suppose, for example, that x = y and 
(x, y) and (y, z) are in R. Since x = y, (x, z) = (y, Z) is in R and the condition is satisfied. 
Eliminating the cases x = y and y = z leaves only the following to be explicitly checked 
to verify that the relation is transitive: 


Pairs of Form 


(x,y) 0,2) (x, 2) 
(1,2) (2,3) (1, 3) 
d,2) (2,4) (1, 4) 
d,3) 6,4) ad, 4) 
(2,3) (©,4) (2, 4) 


The digraph of a transitive relation has the property that whenever there are di- 
rected edges from x to y and from y to z, there is also a directed edge from x to z. Notice 
that the digraph of this relation (see Figure 3.3.1) has this property. < 


In symbols, a relation R is transitive if 
WxWyWzl(x, y) € RA (yz) € R] > [2 € RI. 
Thus R is not transitive if 
=[VxVyVzL@, y) € RA (y, z) € RJ > [(x, z) € RI]. (3.3.3) 


Using the generalized De Morgan’s laws for logic (Theorem 1.5.14) and the fact that 
=(p > g) =p A-7¢q (see Example 1.3.13), we find that (3.3.3) is equivalent to 


Avaya, y) € RA (y, 2) € RJA 1G, 2) € R] 
or, equivalently, 


Axdyaz[(x,y) ERA (y,2z) ERA (, z) ERI. 
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Example 3.3.19 


Example 3.3.21 


Example 3.3.22 


In words, a relation R is not transitive if there exist x, y, and z such that (x, y) and (y, z) 
are in R, but (x, z) is not in R. 


The relation R = {(a, a), (b,c), (c, b), (d, d)} on X = {a, b, c, d} is not transitive. For 
example, (b, c) and (c, b) are in R, but (b, b) is not in R. Notice that in the digraph of 
this relation (see Figure 3.3.2) there are directed edges from b to c and from c to b, but 
there is no directed edge from b to b. < 


Relations can be used to order elements of a set. For example, the relation R defined 
on the set of integers by 


(x,y) ER ifx<y 


orders the integers. Notice that the relation R is reflexive, antisymmetric, and transitive. 
Such relations are called partial orders. 


Definition 3.3.20 > A relation R on aset X is a partial order if R is reflexive, 
antisymmetric, and transitive. < 


Since the relation R defined on the positive integers by 
(x,y) ER if x divides y 


is reflexive, antisymmetric, and transitive, R is a partial order. < 


If R is a partial order on a set X, the notation x < y is sometimes used to indicate 
that (x, y) € R. This notation suggests that we are interpreting the relation as an ordering 
of the elements in X. 

Suppose that R is a partial order on a set X. If x, y € X and either x = y ory = x, 
we say that x and y are comparable. If x, y ¢ X andx * yandy Z x, we say that x 
and y are incomparable. If every pair of elements in X is comparable, we call R a total 
order. The less than or equal to relation on the positive integers is a total order since, 
if x and y are integers, either x < y or y < x. The reason for the term “partial order” is 
that in general some elements in X may be incomparable. The “divides” relation on the 
positive integers (see Example 3.3.21) has both comparable and incomparable elements. 
For example, 2 and 3 are incomparable (since 2 does not divide 3 and 3 does not divide 2), 
but 3 and 6 are comparable (since 3 divides 6). 

One application of partial orders is to task scheduling. 


Task Scheduling Consider the set T of tasks that must be completed in order to take an 
indoor flash picture with a particular camera. 


. Remove lens cap. 


. Focus camera. 


1 
2 
3. Turn off safety lock. 
4. Turn on flash unit. 

5 


. Push photo button. 


Some of these tasks must be done before others. For example, task 1 must be done before 
task 2. On the other hand, other tasks can be done in either order. For example, tasks 2 
and 3 can be done in either order. 


Example 3.3.24 


Example 3.3.26 
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The relation R defined on T by 
iRj if i = j or task i must be done before task j 
orders the tasks. We obtain 
R= {(, 1), (2, 2), (3, 3), 4, 4), (5,5), C, 2), (1. 5), (2, 5), (3, 5), (4, 5)}. 


Since R is reflexive, antisymmetric, and transitive, it is a partial order. A solution to the 
problem of scheduling the tasks so that we can take a picture is a total ordering of the 
tasks consistent with the partial order. More precisely, we require a total ordering of 
the tasks f), f2, 4, t4, ts such that if ¢; R4, then i = j or ¢; precedes ¢; in the list. Among 
the solutions are 1, 2,3, 4,5 and 3, 4, 1, 2,5. < 


Given a relation R from X to Y, we may define a relation from Y to X by reversing 
the order of each ordered pair in R. The inverse relation generalizes the inverse function. 
The formal definition follows. 


Definition 3.3.23 » Let Rbearelation from X to Y. The inverse of R, denoted 
R7!, is the relation from Y to X defined by 
RT! ={(y,x) | @y) € R}. < 


If we define a relation R from X = {2, 3, 4} to Y = {3, 4, 5, 6, 7} by 
(x,y) ER if x divides y, 
we obtain 
R= {(2, 4), (2, 6), (3, 3), (3, 6), (4, 4}. 
The inverse of this relation is 
R7' = {(4, 2), ©, 2), (3, 3), (6, 3), (4, 4)}. 

In words, we might describe this relation as “is divisible by.” < 

If we have a relation R; from X to Y and arelation R2 from Y to Z, we can form the 


composition of the relations by applying first relation R, and then relation Ry. Compo- 
sition of relations generalizes composition of functions. The formal definition follows. 


Definition 3.3.25 > Let R, be arelation from X to Y and R> be arelation from 
Y to Z. The composition of R, and Rp, denoted R20R;, is the relation from X to Z defined 
by 


Ro o Ri = {(x, z) | @, y) € Ri and (y, z) € Ro for some y € Y}. < 


The composition of the relations 
R, = {U, 2), C, 6), 2,4), GB, 4), GB, 6), G, 8)} 
and 
Ro = {(2, w), 4, 8), 4,9, (6,9), (8, w)} 
is 
RyoR, = {(1,u), Ud, 9, 2,8), 2,9, 3,8), 3,9, 3, wo}. 
For example, (1, u) € Ry o R; because (1, 2) € R; and (2, uw) € Ro. < 
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Example 3.3.27 Suppose that R and S are transitive relations on a set X. Determine whether each of RUS, 


ROS, or Ro S must be transitive. 


SOLUTION We try to prove each of the three statements. If we fail, we will try to 
determine where our proof fails and use this information to construct a counterexample. 

To prove that R U S is transitive, we must show that if (x, y), (v, z) € RUS, then 
(x, z) € RUS. Suppose that (x, y), (vy, z) € RUS. If (x, y) and (y, z) happen to both be 
in R, we could use the fact that R is transitive to conclude that (x, z) € R and, therefore, 
(x, z) € RUS. A similar argument shows that if (x, y) and (y, z) happen to both be in S, 
then (x, z) € RUS. But what if (x, y) € R and (y, z) € S? Now the fact that R and S are 
transitive seems to be of no help. We try to construct a counterexample in which R and 
S are transitive, but there exist (x, y) € R and (y, z) € S such that (x, z) ¢ RUS. 

We put (1, 2) in R and (2, 3) in S and ensure that (1, 3) is not in R U S. In fact, 
if R = {(1, 2)}, R is transitive. Similarly, if S = {(2,3)}, S is transitive. We have our 
counterexample. We could write up our solution as follows. 

We show that RUS need not be transitive. Let R = {(1, 2)} and S = {(2, 3)}. Then 
Rand S are transitive, but RUS is not transitive; (1, 2), (2, 3) € RUS, but (1,3) ¢ RUS. 

Next we turn our attention to RNS. To prove that RMS is transitive, we must show 
that if (x, y), (y,z) € RNS, then (x, z) € RMS. Suppose that (x, y), (y,z) Ee RNS. 
Then (x, y), (y, z) € R. Since R is transitive, (x, z) € R. Similarly, (x, y), (, z) € S, and 
since S is transitive, (x, z) € S. Therefore (x, z) € RMS. We have proved that RMS is 
transitive. 

Finally, consider R o S. To prove that R o S is transitive, we must show that if 
(x,y), Oz) € RoS, then (x, z) € Ro S. Suppose that (x, y), (y, z) € Ro S. Then there 
exists a such that (x, a) € S and (a, y) € R, and there exists b such that (y, b) € S and 
(b, z) € R. We now know that (a, y), (b, z) € R, but the fact that R is transitive does 
not allow us to infer anything from (a, y), (b, z) € R. A similar statement applies to S. 
We try to construct a counterexample in which R and S are transitive but R o S is not 
transitive. 

We will arrange for (1, 2), (2,3) € RoS, but (1,3) ¢ RoS. In order for (1, 2) € 
RoS, we must have (1,a) € S and (a,2) € R, for some a. We put (1,5) in S and 
(5, 2) in R. (We chose a to be a number different from 1, 2, or 3 to avoid a clash with 
those numbers. Any number different from 1, 2, 3 would do.) So far, so good! In order 
for (2,3) € Ro S, we must have (2, b) € S and (b, 3) € R, for some b. We put (2, 6) 
in S and (6, 3) in R. (Again, we chose b = 6 to avoid a clash with the other numbers 
already chosen.) Now R = {(5, 2), (6, 3)} and S = {(1, 5), (2, 6)}. Notice that R and S$ 
are transitive. We have our counterexample. We could write up our solution as follows. 

We show that R o S need not be transitive. Let R = {(5,2), (6,3)} and S = 
{(1, 5), (2, 6)}. Then R and S are transitive. Now Ro S = {(1, 2), (2, 3)} is not tran- 
sitive; (1, 2), (2,3) € RoS, but (1,3) gRoS. < 


3.3. Problem-Solving Tips 


= To prove that a relation is reflexive, show that (x,x) € R for every x € X. In 
words, a relation is reflexive if every element in X is related to itself. Given an 
arrow diagram, the relation is reflexive if there is a loop at every vertex. 


= To prove that a relation R on a set X is not reflexive, find x € X such that (x, x) ¢ R. 
Given an arrow diagram, the relation is not reflexive if some vertex has no loop. 


= To prove that a relation R on a set X is symmetric, show that for all x, y eX, if 
(x, y) ER, then (y, x) € RK. In words, a relation is symmetric if whenever x is re- 
lated to y, then y is related to x. Given an arrow diagram, the relation is symmetric 
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if whenever there is a directed edge from x to y, there is also a directed edge from 
y tox. 


= To prove that a relation R on a set X is not symmetric, find x, y € X such that 
(x, y) €R and (y, x) ¢ R. Given an arrow diagram, the relation is not symmetric 
if there are two distinct vertices x and y with a directed edge from x to y but no 
directed edge from y to x. 


= To prove that a relation R on a set X is antisymmetric, show that for all x, ye X, 
if (x, y) €R and (y, x) ER, then x = y. In words, a relation is antisymmetric if 
whenever x is related to y and y is related to x, then x = y. An equivalent charac- 
terization of “antisymmetric” can also be used: Show that for all x, ye X, ifx # y, 
then (x, y) Ror (y, x) ¢ R. Given an arrow diagram, the relation is antisymmetric 
if between any two distinct vertices there is at most one directed edge. Note that 
“not symmetric” is not necessarily the same as “antisymmetric.” 


= To prove that a relation R on a set X is not antisymmetric, find x, y¢X,x ¥ y, 
such that (x, y) € R and (y, x) € R. Given an arrow diagram, the relation is not anti- 
symmetric if there are two distinct vertices x and y and two directed edges, one 
from x to y and the other from y to x. 


= To prove that a relation R on a set X is transitive, show that for all x, y, z € X, if 
(x, y) and (y, z) are in R, then (x, z) is in R. [It suffices to check ordered pairs (x, y) 
and (y, z) with x # y and y ¥ z.] In words, a relation is transitive if whenever x is 
related to y and y is related to z, then x is related to z. Given an arrow diagram, the 
relation is transitive if whenever there are directed edges from x to y and from y to 
z, there is also a directed edge from x to z. 


= To prove that a relation R on a set X is not transitive, find x, y, z€ X such that (x, y) 
and (y, z) are in R, but (x, z) is not in R. Given an arrow diagram, the relation is 
not transitive if there are three distinct vertices x, y, z and directed edges from x to 
y and from y to z, but no directed edge from x to z. 


= A partial order is a relation that is reflexive, antisymmetric, and transitive. 


= The inverse R™! of the relation R consists of the elements (y, x), where (x, y) € R. 
In words, x is related to y in R if and only if y is related to x in R“!. 


= If R, is arelation from X to Y and R; is a relation from Y to Z, the composition of 
R, and Ro, denoted R> o Rj, is the relation from X to Z defined by 


Roo Ri = {(x, z) | (x, y) € Ri and (y, z) € Ro for some y € Y}. 


To compute the composition, find all pairs of the form (x, y) € Ri and (y, z) € Ra; 
then put (x, z) in R2 o R;. 


3.3 Review Exercises 


1. What is a binary relation from X to Y? 6. Define transitive relation. Give an example of a transitive 


2. What is the di h of a bi Jation? relation. Give an example of a relation that is not transitive. 
. What is the digraph of a binary relation? 


: i : 7. Define partial order and give an example of a partial order. 
3. Define reflexive relation. Give an example of a reflexive 7 & P P 


relation. Give an example of a relation that is not reflexive. 8. Define inverse relation and give an example of an inverse 


; ‘ ‘ F : ; relation. 
4. Define symmetric relation. Give an example of a symmetric 


relation. Give an example of a relation that is not symmetric. 9. Define composition of relations and give an example of the 


i ; : : : : . composition of relations. 
5. Define antisymmetric relation. Give an example of an antisym- 


metric relation. Give an example of a relation that is not anti- 
symmetric. 
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In Exercises 1-4, write the relation as a set of ordered pairs. 


1 


8840 Hammer 
9921 Pliers 
452 Paint 

2207 Carpet 
2. 

Sally Math 

Ruth Physics 

Sam Econ 
3. 

a 3 

b 1 

b 4 

c 1 
4, 

a a 

b b 


In Exercises 5-8, write the relation as a table. 


5. R= {(, 6), (b, 2), (a, 1), (ce, D} 
6. The relation R on {1, 2, 3, 4} defined by (x, y) ER ifx? > y 


7. R= {(Roger, Music), (Pat, History), (Ben, Math), 
(Pat, PolySci)} 


8. The relation R from the set X of planets to the set Y of integers 
defined by (x, y) € R if x is in position y from the sun (nearest 
the sun being in position 1, second nearest the sun being in 
position 2, and so on) 


In Exercises 9-12, draw the digraph of the relation. 


9. The relation of Exercise 4 on {a, b, c} 
10. The relation R = {(1, 2), (2, 3), (3, 4), (4, LD} on {1, 2, 3, 4} 


11. The relation R={(1, 2), (2, 1), (3,3), (1, 1), (2,2)} on 
X= {1,2,.3} 


12. The relation of Exercise 6 
In Exercises 13-16, write the relation as a set of ordered pairs. 


13. 


14. 
1 2 
3 
4 5 - 
15. 
2. 
16. 


17. Find the inverse (as a set of ordered pairs) of each relation in 
Exercises 1-16. 


Exercises 18 and 19 refer to the relation R on the set {1,2, 3, 4, 5} 


defined by the rule (x, y) € R if 3 divides x — y. 
18. List the elements of R. 19. List the elements of R~!. 


20. Repeat Exercises 18 and 19 for the relation R on the set 
{1, 2, 3, 4, 5} defined by the rule (x,y) e Rifx+y <6. 


21. Repeat Exercises 18 and 19 for the relation R on the set 
{1, 2, 3, 4, 5} defined by the rule (x,y) ¢ Rifx=y-— 1. 


22. Is the relation of Exercise 20 reflexive, symmetric, antisym- 
metric, transitive, and/or a partial order? 

23. Is the relation of Exercise 21 reflexive, symmetric, antisym- 
metric, transitive, and/or a partial order? 


In Exercises 24-34, determine whether each relation defined on 
the set of positive integers is reflexive, symmetric, antisymmetric, 
transitive, and/or a partial order. 
24. (x,y) € Rifxy=1. 

25. (x,y) € Rifxy =2. 

26. (x,y) € Rifxy> 1. 

27. (x,y) €Rifx=y’. 

29. (x,y) ERifx>y. 

31. (x, y) € Rif 3 divides x — y. 
32. (x, y) € R if 3 divides x + 2y. 

33. (x,y) € Rifx—y=2. 34. (x,y) € Rif |x —y| = 2. 


35. Let X be a nonempty set. Define a relation on P(X), the power 
set of X, as (A, B) € Rif A CB. Is this relation reflexive, sym- 
metric, antisymmetric, transitive, and/or a partial order? 


28. (x,y) © Rifx>y. 
30. (x,y) ERifx=y. 


36. Prove that a relation R on a set X is antisymmetric if and only 
if for all x, y € X, if (x,y) e Randx Fy, then (y,x) ZR. 


37. Let X be the set of all four-bit strings (e.g., 0011, 0101, 1000). 
Define a relation R on X as 5; R's2 if some substring of s; of 
length 2 is equal to some substring of s2 of length 2. Exam- 
ples: 0111 R 1010 (because both 0111 and 1010 contain 01). 
1110 R 0001 (because 1110 and 0001 do not share a common 
substring of length 2). Is this relation reflexive, symmetric, an- 
tisymmetric, transitive, and/or a partial order? 

38. Suppose that R; is a partial order on X;, i = 1, 2. Show that R 
is a partial order on X; x X2 if we define 

(x1, x2) R (x4, x5) if x) Ry x, and x2 Ro x4. 

39. Let R; and R2 be the relations on {1, 2, 3, 4} given by 


R, = {0, ), , 2), G, 4), (4, 2)} 
Ro = {0 D, 2, 1), GB, 1), 4,49), @, 2)}. 
List the elements of R; o Rz and R2 o Rj. 
Give examples of relations on {1,2,3,4} having the properties 
specified in Exercises 40-44. 
40. Reflexive, symmetric, and not transitive 
41. Reflexive, not symmetric, and not transitive 
42. Reflexive, antisymmetric, and not transitive 
43. Not reflexive, symmetric, not antisymmetric, and transitive 
44, Not reflexive, not symmetric, and transitive 
Let Rand S be relations on X. Determine whether each statement 


in Exercises 45-57 is true or false. If the statement is true, prove 
it; otherwise, give a counterexample. 


45. If R is transitive, then R7! is transitive. 
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46. If R and S are reflexive, then R U S is reflexive. 

47. If Rand S are reflexive, then RM S is reflexive. 

48. If R and S are reflexive, then R o S is reflexive. 

49. If Ris reflexive, then R~! is reflexive. 

50. If R and S are symmetric, then R U S is symmetric. 

51. If R and S are symmetric, then RN S is symmetric. 

52. If R and S are symmetric, then R o S is symmetric. 

53. If R is symmetric, then R~! is symmetric. 

54. If R and S are antisymmetric, then R U S is antisymmetric. 
55. If R and S are antisymmetric, then RM S is antisymmetric. 
56. If R and S are antisymmetric, then R o S is antisymmetric. 
57. If R is antisymmetric, then R~! is antisymmetric. 

58. How many relations are there on an n-element set? 

In Exercises 59-61, determine whether each relation R defined on 


the collection of all nonempty subsets of real numbers is reflexive, 
symmetric, antisymmetric, transitive, and/or a partial order. 


59. (A, B) € Rif for every ¢ > 0, there exists a € A andb € B 
with ja — b| < e. 

60. (A, B) € Rif for every a € A ande > 0, there exists b € B 
with |a — b| < «. 

61. (A, B) € Rif for every a € A, b € B, and « > 0, there exists 
a €Aandb’' € B with |a — b'| < € and |a’ —b| < «. 

62. What is wrong with the following argument, which 
supposedly shows that any relation R on X that is symmet- 
ric and transitive is reflexive? 

Let x € X. Using symmetry, we have (x, y) and (y, x) 
both in R. Since (x,y), (y,x) € R, by transitivity we have 
(x, x) € R. Therefore, R is reflexive. 


3.4 Equivalence Relations 


Suppose that we have a set X of 10 balls, each of which is either red, blue, or green (see 
Figure 3.4.1). If we divide the balls into sets R, B, and G according to color, the family 


Go Online 

For more on equivalence 
relations, see 
goo.gl/V3y4pS 


{R, B, G} is a partition of X. (Recall that in Section 1.1, we defined a partition of a set X 
to be a collection S of nonempty subsets of X such that every element in X belongs to 
exactly one member of S.) 

A partition can be used to define a relation. If S is a partition of X, we may define 
xRy to mean that for some set S € S, both x and y belong to S. For the example of 


Figure 3.4.1, the relation obtained could be described as “is the same color as.” The next 
theorem shows that such a relation is always reflexive, symmetric, and transitive. 


fr) Er) ( 


Figure 3.4.1 A set of colored 
balls. 
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Theorem 3.4.1 


Example 3.4.2 


Example 3.4.4 


Let S be a partition of a set X. Define x Ry to mean that for some set S in S, both x 
and y belong to S. Then R is reflexive, symmetric, and transitive. 


Proof Let x ¢ X. By the definition of partition, x belongs to some member S of S. 
Thus x Rx and R is reflexive. 

Suppose that x R y. Then both x and y belong to some set S € S. Since both y and 
x belong to S, yRx and R is symmetric. 

Finally, suppose that x R y and y Rz. Then both x and y belong to some set S € S 
and both y and z belong to some set T € S. Since y belongs to exactly one member of S, 
we must have S = T. Therefore, both x and z belong to S and x R z. We have shown that 
R is transitive. < 


Consider the partition S = {{1, 3, 5}, {2, 6}, {4}} of X = {1, 2, 3, 4, 5, 6}. The relation R 
on X given by Theorem 3.4.1 contains the ordered pairs (1, 1), (1, 3), and (1, 5) because 
{1, 3, 5} is in S. The complete relation is 


R=({d, 1), C, 3), 0,5), G, D, GB, 3), G,5), 6, D, G, 3), G, 5), 2, 2), 2, 6), 
(6, 2), (6, 6), (4, 4)}. < 


Let S and R be as in Theorem 3.4.1. If Se S, we can regard the members of S as 
equivalent in the sense of the relation R, which motivates calling relations that are reflex- 
ive, symmetric, and transitive equivalence relations. In the example of Figure 3.4.1, the 
relation is “is the same color as”; hence equivalent means “is the same color as.” Each 
set in the partition consists of all the balls of a particular color. 


Definition 3.4.3 >» A relation that is reflexive, symmetric, and transitive on a 
set X is called an equivalence relation on X. < 


The relation R of Example 3.4.2 is an equivalence relation on {1, 2, 3, 4, 5, 6} because of 
Theorem 3.4.1. We can also verify directly that R is reflexive, symmetric, and 


transitive. 
2 
| | () 
4 
5 6 


Figure 3.4.2 The digraph of the relation of Example 3.4.2. 


The digraph of the relation R of Example 3.4.2 is shown in Figure 3.4.2. Again, 
we see that R is reflexive (there is a loop at every vertex), symmetric (for every directed 
edge from v to w, there is also a directed edge from w to v), and transitive (if there is a 
directed edge from x to y and a directed edge from y to z, there is a directed edge from 
x to z). < 


Example 3.4.5 


Example 3.4.6 


Example 3.4.7 


Theorem 3.4.8 
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Consider the relation 
R= {(, 1), d, 3), Cd, 5), 2, 2), 2,4), G3, D, GB, 3), 3,5), 4,2), 4,9), 
(5, 1), (5, 3), (5, 5)} 


on {1, 2, 3, 4, 5}. The relation is reflexive because (1, 1), (2, 2), (3, 3), (4, 4), (5,5) ER. 
The relation is symmetric because whenever (x, y) is in R, (y, x) is also in R. Finally, the 
relation is transitive because whenever (x, y) and (y, z) are in R, (x, z) is also in R. Since 
Ris reflexive, symmetric, and transitive, R is an equivalence relation on {1, 2,3, 4, 5}. 


< 


The relation R on X = {1, 2,3, 4} defined by (x, y) € Rif x < y, x,y € X, is not an 
equivalence relation because R is not symmetric. [For example, (2, 3) € R, but (3, 2) ¢ 
R.] The relation R is reflexive and transitive. < 


The relation R = {(a, a), (b,c), (c, b), (d, d)} on X = {a, b, c, d} is not an equivalence 
relation because R is neither reflexive nor transitive. [It is not reflexive because, for ex- 
ample, (b, b) € R. It is not transitive because, for example, (b, c) and (c, b) are in R, but 
(b, b) is not in R.] < 


Given an equivalence relation on a set X, we can partition X by grouping related 
members of X. Elements related to one another may be thought of as equivalent. The 
next theorem gives the details. 


Let R be an equivalence relation on a set X. For each a € X, let [a] = {x € X | xRa}. 
(In words, [a] is the set of all elements in X that are related to a.) Then 


S= {[a]|a€X} 


is a partition of X. 


Proof We must show that every element in X belongs to exactly one member of S. 

Let a € X. Since aRa, a € [a]. Thus every element in X belongs to at least one 
member of S. It remains to show that every element in X belongs to exactly one member 
of S; that is, 


if x € X andx € [a] M [b], then [a] = [6]. (3.4.1) 


We first show that for all c,d € X, if cRd, then [c] = [d]. Suppose that c Rd. 
Let x € [c]. Then xRc. Since cRd and R is transitive, x Rd. Therefore, x € [d] and 
{c] © [d]. The argument that [d] C [c] is the same as that just given, but with the roles 
of c and d interchanged. Thus [c] = [d]. 

We now prove (3.4.1). Assume that x € X and x € [a] MN [b]. Then x Ra and xRb. 
Our preceding result shows that [x] = [a] and [x] = [b]. Thus [a] = [b]. < 


Definition 3.4.9 > Let R be an equivalence relation on a set X. The sets [a] 
defined in Theorem 3.4.8 are called the equivalence classes of X given by the 
relation R. < 


Example 3.4.10 In Example 3.4.4, we showed that the relation 


R=({d, ), , 3), 0,5), GB, D, GB, 3), G,5), 6, D, G, 3), G, 5), 2, 2), (2, 6), 
(6, 2), (6, 6), (4, 4)} 
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Example 3.4.11 


Example 3.4.12 


Example 3.4.13 


Example 3.4.14 


Example 3.4.15 


on X = {1, 2,3, 4,5, 6} is an equivalence relation. The equivalence class [1] contain- 
ing | consists of all x such that (x, 1) € R. Therefore, [1] = {1, 3,5}. The remaining 
equivalence classes are found similarly: 


[3] = [5] = {1, 3, 5}, [2] = [6] = {2, 6}, [4] = {4}. < 


The equivalence classes appear quite clearly in the digraph of an equivalence relation. 
The three equivalence classes of the relation R of Example 3.4.10 appear in the digraph 
of R (shown in Figure 3.4.2) as the three subgraphs whose vertices are {1, 3, 5}, {2, 6}, 
and {4}. A subgraph G that represents an equivalence class is a largest subgraph of the 
original digraph having the property that for any vertices v and w in G, there is a directed 
edge from v to w. For example, if v, w € {1, 3, 5}, there is a directed edge from v to w. 
Moreover, no additional vertices can be added to 1, 3, 5, and so the resulting vertex set 
has a directed edge between each pair of vertices. < 


There are two equivalence classes for the equivalence relation 


R= {d, 1), Cd, 3), d,5), 2, 2), 2, 4), G, 1), GB, 3), 3,5), 4,2), (44), 
(5, 1), 5,3), 5, 5)} 


on {1, 2, 3, 4, 5} of Example 3.4.5, namely, [1] = [3] = [5] = {1, 3, 5} and [2] = [4] = 
{2, 4}. < 


We can readily verify that the relation R = {(a, a), (b, b), (c,c)} on X = {a, b, c} is 
reflexive, symmetric, and transitive. Thus R is an equivalence relation. The equivalence 
classes are [a] = {a}, [b] = {b}, and [c] = {c}. < 


Let X = {1,2,..., 10}. Define x Ry to mean that 3 divides x — y. We can readily ver- 
ify that the relation R is reflexive, symmetric, and transitive. Thus R is an equivalence 
relation on X. Determine the members of the equivalence classes. 


SOLUTION The equivalence class [1] consists of all x with x R 1. Thus 
[1] = {x € X | 3 divides x — 1} = {1, 4, 7, 10}. 
Similarly, [2] = {2, 5, 8} and [3] = {3, 6, 9}. These three sets partition X. Note that 
(J=4)=(7]=00), [2)=5)=([8], [3] = [6] = [9]. 


For this relation, equivalence is “has the same remainder when divided by 3.” < 


Show that if a relation R on a set X is symmetric and transitive but not reflexive, the 
collection of sets [a], a € X, defined in Theorem 3.4.8 does not partition X (see also 
Exercises 47-51). 


SOLUTION Let R be a relation on a set X that is symmetric and transitive but not 
reflexive. We define “pseudo equivalence classes” as in Theorem 3.4.8: 


[a] = {x € X | xRa}. 


Since R is not reflexive, there exists b € X such that (b,b) ¢ R. We show that b is 
not in any pseudo equivalence class. Suppose, by way of contradiction, that b € [a] for 
some a € X. Then (b, a) € R. Since R is symmetric, (a, b) € R. Since R is transitive, 
(b, b) € R. But we assumed that (b, b) ¢ R. This contradiction shows that b is not in 
any pseudo equivalence class. Thus the collection of pseudo equivalence classes does 
not partition X. < 


Theorem 3.4.16 
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We close this section by proving a special result that we will need later (see Sec- 
tions 6.2 and 6.3). The proof is illustrated in Figure 3.4.3. 


xX 


X X) Xe 
(r elements) | (r elements) (r elements) 


|X| = rk 
Figure 3.4.3 The proof of Theorem 3.4.16. 


Let R be an equivalence relation on a finite set X. If each equivalence class has r 
elements, there are |X|/r equivalence classes. 


Proof Let X,, X»,..., X, denote the distinct equivalence classes. Since these sets 
partition X, 


IX] = [Xi + [Mol +--+ 1Xel =r tert brake 


and the conclusion follows. < 


3.4 Problem-Solving Tips 


An equivalence relation is a relation that is reflexive, symmetric and transitive. To prove 
that a relation is an equivalence relation, you need to verify that these three properties 
hold (see Problem-Solving Tips for Section 3.3). 

An equivalence relation on a set X partitions X into subsets. (‘‘Partitions” means 
that every x in X belongs to exactly one of the subsets of the partition.) The subsets 
making up the partition can be determined in the following way. Choose x; € X. Find 
the set, denoted [x, ], of all elements related to x,;. Choose another element x2 € X that is 
not related to x;. Find the set [x2] of all elements related to x2. Continue in this way until 
all the elements of X have been assigned to a set. The sets [x;] are called the equivalence 
classes. The partition is [x;], [x2], .... The elements of [x;] are equivalent in the sense 
that they are all related. For example, the relation R, defined by x Ry if x and y are the 
same color, partitions the set into subsets where each subset contains elements that are 
all the same color. Within a subset, the elements are equivalent in the sense that they are 
all the same color. 

In the digraph of an equivalence relation, an equivalence class is a largest subgraph 
of the original digraph having the property that for any vertices v and w in G, there is a 
directed edge from v to w. 


A partition of a set gives rise to an equivalence relation. If X;,..., X, is a parti- 
tion of a set X and we define x Ry if for some i, x and y both belong to X;, then R is 
an equivalence relation on X. The equivalence classes turn out to be Xi, ..., Xn. Thus, 


“equivalence relation” and “partition of a set” are different views of the same situation. 
An equivalence relation on X gives rise to a partition of X (namely, the equivalence 
classes), and a partition of X gives rise to an equivalence relation (namely, x is related to 
y if x and y are in the same set in the partition). This latter fact can be used to solve certain 
problems. If you are asked to find an equivalence relation, you can either find the equiv- 
alence relation directly or construct a partition and then use the associated equivalence 
relation. Similarly, if you are asked to find a partition, you can either find the partition 
directly or construct an equivalence relation and then take the equivalence classes as 
your partition. 
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3.4 Review Exercises 


1. Define equivalence relation. Give an example of an equivalence 
relation. Give an example of a relation that is not an equivalence 
relation. 


Nn 


. Define equivalence class. How do we denote an equivalence 


class? Give an example of an equivalence class for your equiv- 
alence relation of Exercise 1. 


3. Explain the relationship between a partition of a set and an 


equivalence relation. 


In Exercises 1-10, determine whether the given relation is an 

equivalence relation on {1,2,3,4, 5}. If the relation is an equiv- 

alence relation, list the equivalence classes. (In Exercises 5-10, 

x,y € {1, 2, 3, 4, 5}.) 

» {, 1), 2, 2), 3, 3), 4,4), 6,5), CL, 3), B, D} 

» {C, 1), (2, 2), 3, 3), (4,4), GS, 5), 3), GB, D, 3,4), (4, 3)} 

» {, 1), (2, 2), 3, 3), 4,4} 

. (0, ), 2,2), B,3), 4,4), (5, 5), 1,5), G, D, B, 5), (5,3), 
(1, 3), 3, D} 

5. {@,y)|1<x<Sandl<y<5} 

6. {(x, y) | 4 divides x — y} 

7. {(x, y) | 3 divides x + y} 8. {(x, y) | x divides 2 — y} 

9. {(x, y) | x and y are both even} 


& WN 


10. {(x, y) | x and y are both even or x and y are both odd} 


In Exercises 11-16, determine whether the given relation is an 
equivalence relation on the set of all people. 

11. {(x, y) |x and y are the same height} 

12. {(x, y) |x and y have, at some time, lived in the same country} 
13. {(x, y) |x and y have the same first name} 

14. {(x, y) |x is taller than y} 

15. {(x, y) |x and y have the same parents} 

16. {(x, y) |x and y have the same color hair} 

In Exercises 17-22, list the members of the equivalence relation 
on {1, 2, 3, 4} defined (as in Theorem 3.4.1) by the given partition. 
Also, find the equivalence classes [1], [2], [3], and [4]. 

17. {{1, 2}, (3, 4}} 18. {{1}, {2}, (3, 4}} 

19. {{1}, {2}, {3}, {4}} 20. {{1, 2, 3}, {4}} 

21. {{1, 2,3, 4}} 22. {{1}, {2, 4}, {3}} 

In Exercises 23-25, let X ={1,2, 3,4, 5}, Y= {3,4}, and C= 


{1, 3}. Define the relation R on P(X), the set of all subsets of X, 
as 


ARB ifAUY=BUY. 


23. Show that R is an equivalence relation. 


24. List the elements of [C], the equivalence class containing C. 


25. 
26. 


How many distinct equivalence classes are there? 
Let 


X = {San Francisco, Pittsburgh, Chicago, San Diego, 
Philadelphia, Los Angeles}. 


Define a relation R on X as x Ry if x and y are in the same state. 
(a) Show that R is an equivalence relation. 


(b) List the equivalence classes of X. 


. Ifan equivalence relation has only one equivalence class, what 


must the relation look like? 


. If R is an equivalence relation on a finite set X and |X| = |R|, 


what must the relation look like? 


. By listing ordered pairs, give an example of an equivalence 


relation on {1, 2,3, 4,5, 6} having exactly four equivalence 
classes. 


. How many equivalence relations are there on the set {1, 2, 3}? 


. Let R be areflexive relation on X satisfying: for all x, y, z € X, 


ifxRy and yRz, then zRx. Prove that R is an equivalence re- 
lation. 


. Define a relation R on R&, the set of functions from R to R, 


by f Rg if f(0) = g(0). Prove that R is an equivalence relation 
on RE. Let f(x) = x for all x € R. Describe [f]. 


. Define a relation R on R&, the set of functions from R to R by 


f Rg if there exist a,b € R such that f(x) = gx+a)+b 
for all x € R. Prove that R is an equivalence relation on 
R®. What property do all functions in an equivalence class 
share? 


. Let X = {1,2,..., 10}. Define a relation R on X x X by 


(a,b) R(c, d) ifa+d=b+ce. 
(a) Show that R is an equivalence relation on X x X. 


(b) List one member of each equivalence class of X x X. 


. Let X = {1,2,..., 10}. Define a relation R on X x X by 


(a, b) R (c, d) if ad = be. 
(a) Show that R is an equivalence relation on X x X. 
(b) List one member of each equivalence class of X x X. 


(c) Describe the relation R in familiar terms. 


. Let R be a reflexive and transitive relation on X. Show that 


ROR7*! is an equivalence relation on X. 


37. 


38. 


39. 


40. 


41. 


42. 


43. 


Let R; and R2 be equivalence relations on X. 
(a) Show that Rj M R2 is an equivalence relation on X. 


(b) Describe the equivalence classes of Rj M R2 in terms of 
the equivalence classes of R; and the equivalence classes 
of Ro. 

Suppose that S is a collection of subsets of a set X and X = 

US. (It is not assumed that the family S is pairwise dis- 

joint.) Define x Ry to mean that for some set S € S, both 

x and y are in S. Is R necessarily reflexive, symmetric, or 

transitive? 


Let S be a unit square including the interior, as shown in the 
following figure. 


(0, 1) (1, 1) 


s > X 
(0, 0) (1, 0) 


Define a relation R on S by (x,y) RQ’, y’) if (@ = x and 
y = y), or (y = y' andx = Oand1’ 1), or (vy = y' and 
x =landx = 0). 


(a) Show that R is an equivalence relation on S. 


(b) If points in the same equivalence class are glued together, 
how would you describe the figure formed? 


Let S be a unit square including the interior (as in Exer- 
cise 39). Define a relation R’ on S by (x, y) Rp(x’,y’) if 


(x = x’ andy = y), or (vy = y’ andx = Oandx’ = 1), or 
(y = y andx = land = 0), or (x = x andy = 0 and 
y’ = 1), or (@ = and y= 1 and y’ = 0). Let 


R=R'U{((0,0), (1, 1), (0, 1, C1, 0)), 
(C1, 9), (0, 1)), (CL, 1), O, 0))}. 


(a) Show that R is an equivalence relation on S. 


(b) If points in the same equivalence class are glued together, 
how would you describe the figure formed? 


Let f be a function from X to Y. Define a relation R on X by 


xRy — iff(x) =f0). 


Show that R is an equivalence relation on X. 


Let f be a characteristic function in X. (“Characteristic func- 
tion” is defined before Exercise 101, Section 3.1.) Define a 
relation R on X by xRy if f(x) = f(y). According to the pre- 
ceding exercise, R is an equivalence relation. What are the 
equivalence classes? 


Let f be a function from X onto Y. Let 


S={f-'(f}) lye ¥}. 


44. 


45. 


46. 


47. 


48. 


49. 


50. 


51. 


52. 
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[The definition of f —!(B), where B is a set, precedes Exer- 
cise 88, Section 3.1.] Show that S is a partition of X. Describe 
an equivalence relation that gives rise to this partition. 


Let R be an equivalence relation on a set A. Define a func- 
tion f from A to the set of equivalence classes of A by the rule 
f(x) = [x]. When do we have f(x) = f(y)? 


Let R be an equivalence relation on a set A. Suppose that g is 
a function from A into a set X having the property that if x Ry, 
then g(x) = g(y). Show that h([x]) = g(x) defines a function 
from the set of equivalence classes of A into X. [What needs 
to be shown is that h uniquely assigns a value to [x]; that is, if 
[x] = [y], then g(x) = g(y).] 

Suppose that a relation R on a set X is symmetric and transitive 
but not reflexive. Suppose, in particular, that (b, b) ¢ R. Prove 
that the pseudo equivalence class [b] (see Example 3.4.15) is 
empty. 

Prove that if a relation R on a set X is not symmetric but transi- 
tive, the collection of pseudo equivalence classes (see Exam- 
ple 3.4.15) does not partition X. 


Prove that if a relation R on a set X is reflexive but not symmet- 
ric, the collection of pseudo equivalence classes (see Example 
3.4.15) does not partition X. 


Prove that if a relation R on a set X is reflexive but not transi- 
tive, the collection of pseudo equivalence classes (see Exam- 
ple 3.4.15) does not partition X. 


Give an example of a set X and a relation R on X that is not 
reflexive, not symmetric, and not transitive, but for which the 
collection of pseudo equivalence classes (see Example 3.4.15) 
partitions X. 


Give an example of a set X and a relation R on X that is not 
reflexive, symmetric, and not transitive, but for which the col- 
lection of pseudo equivalence classes (see Example 3.4.15) 
partitions X. 


Let X denote the set of all sequences of real numbers. Define a 
relation R on X as s Rt if there exists an increasing function f 
from the domain of s onto the domain of ¢ such that s» = fn) 
for all n in the domain of s. 


(a) Show that R is an equivalence relation. 


(b) Explain in words what it means for two sequences in X to 
be equivalent under the relation R. 


(c) Since a sequence is a function, a sequence is a set of or- 
dered pairs. Two sequences are equal if the two sets of 
ordered pairs are equal. Contrast the difference between 
two equivalent sequences in X and two equal sequences 
in X. 


Let R be a relation on a set X. Define 


P(R) = RU {(x, x) | xe X} 
o(R) = RUR™! 
R" =RoRoRo::-oR_ (nR’s) 


T(R) = U{R" | w=, 2,4. 


The relation t(R) is called the transitive closure of R. 
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53. For the relations R; and R2 of Exercise 39, Section 3.3, find 60. e(R; UR2) = p(R1) U p(R2) 


p(Ri), o (Rj), T(R;), and t(o(p(R;))) for i = 1, 2. 61. o(R} A Ro) = 0 (R}) No (Ro) 
54. Show that p(R) is reflexive. 62. t(Rj UR) = T(R1) UT(R2) 
55. Show that o(R) is symmetric. 63. T(Ry NR2) = T(R1) N(R?) 
56. Show that t(R) is transitive. 64. o(t(R1)) = t(o(R)) 


x57. Show that t(o(p(R))) is an equivalence relation containing R. 65. o(p(R1)) = p(a(R})) 


x58. Show that t(a(p (R))) is the smallest equivalence relation on 66. p(t(R1)) = t(p(R1)) 
X containing R; that is, show that if R’ is an equivalence rela- 
tion on X and R’ > R, then R’ D t(o(p(R))). If X and Y are sets, we define X to be equivalent to Y if there is a 


F bette ; -to- to function from X to Y. 
«59. Show that R is transitive if and only if t(R) = R. One ter ane ante punenon [rem.% <0 


F 2 : ; 67. Show that set equivalence is an equivalence relation. 
In Exercises 60-66, if the statement is true for all relations R, q q 


and Ry on an arbitrary set X, prove it; otherwise, give a 68. Show that the sets {1, 2, ...} and (2, 4, .. .} are equivalent. 
counterexample. *69. Show that for any set X, X is not equivalent to P(X), the power 
set of X. 


Problem-Solving Corner Equivalence Relations 


Problem Notice that if we take any string in [01111010], 
say 01111100, and compute its equivalence class 
{01111100], we will obtain exactly the same set of 
strings—namely, the set of eight-bit strings that begin 
0111. 

To obtain a different example, we would have to 
start with a string whose first four bits are different 
from 0111, say 1011. As an example, the strings re- 


Answer the following questions for the relation R de- 
fined on the set of eight-bit strings by s; R.s2, provided 
that the first four bits of s; and s> coincide. 


(a) Show that R is an equivalence relation. 


(b) List one member of each equivalence class. 


(c) How many equivalence classes are there? lated to 10110100 are 

F 10110000, 10110001, 10110010, 10110011, 
Attacking the Problem 10110100, 10110101, 10110110, 10110111, 
Let’s begin by looking at some specific eight-bit strings 10111000, 10111001, 10111010, 10111011, 
that are related according to the relation R. Let’s take 10111100, 10111101, 10111110, 10111111. 
an arbitrary string 01111010 and find strings related to What we have just computed are the members of 
it. A string s is related to 01111010 if the first four bits [10110100]. We see that [01111010] and [10110100] 
of 01 111010 and s coincide. This HOTSR HS that 5 must have no members in common. It is always the case that 
begin 0111 and the last four bits can be anything. An two equivalence classes are identical or have no mem- 


example is s = 01111000. bers in common (see Theorem 3.4.8). 


Let’s list all of the strings related to 01111010. In Before reading on, compute the members of some 
doing so, we must be careful to follow 0111 with every other equivalence class. 


possible four-bit string: 


01110000, 01110001, 01110010, 01110011, Finding a Solution 
01110100, 01110101, 01110110, 01110111, 
01111000, 01111001, 01111010, 01111011, 
01111100, 01111101, 01111110, 01111111. 


To show that R is an equivalence relation, we must 
show that R is reflexive, symmetric, and transitive (see 
Definition 3.4.3). For each property, we will go directly 


Assuming for the moment that R is an equivalence to the definition and check that the conditions specified 
relation, the equivalence class containing 01111010, in the definition hold. 
denoted [01111010], consists of all strings related to For R to be reflexive, we must have s R's for every 
01111010. Therefore, what we have just computed are eight-bit string s. For s R's to be true, the first four bits 


the members of [01111010]. of s and s must coincide. This is certainly the case! 


For R to be symmetric, for all eight-bit strings s; 
and 32, if 5; Rs2, then sz R.s;. Using the definition of R, 
we may translate this condition to: If the first four bits 
of s; and sz coincide, then the first four bits of sz and 
s, coincide. This is also certainly the case! 

For R to be transitive, for all eight-bit strings 51, 
So, and 53, if s; Rsz and sz R's3, then s; Rs3. Again us- 
ing the definition of R, we may translate this condi- 
tion to: If the first four bits of s; and s2 coincide and 
the first four bits of s) and s3 coincide, then the first 
four bits of s; and s3 coincide. This too is certainly the 
case! We have proved that R is an equivalence rela- 
tion. 

In our earlier discussion, we found that each dis- 
tinct four-bit string determines an equivalence class. 
For example, the string 0111 determines the equiva- 
lence class consisting of all eight-bit strings that begin 
0111. Therefore, the number of equivalence classes is 
equal to the number of four-bit strings. We can simply 
list them all 


0000, 0001, 0010, 0011, 
0100, 0101, 0110, O111, 
1000, 1001, 1010, 1011, 
1100, 1101, 1110, 1111 


and then count them. There are 16 equivalence 
classes. 

Consider the problem of listing one member of 
each equivalence class. The 16 four-bit strings listed 
previously determine the 16 equivalence classes. The 
first string 0000 determines the equivalence class con- 
sisting of all eight-bit strings that begin 0000; the 
second string 0001 determines the equivalence class 
consisting of all eight-bit strings that begin 0001; and 
so on. Thus to list one member of each equivalence 
class, we simply need to append some four-bit string 
to each of the strings in the previous list: 


00000000, 00010000, 00100000, 00110000, 
01000000, 01010000, 01100000, 01110000, 
10000000, 10010000, 10100000, 10110000, 
11000000, 11010000, 11100000, 11110000. 


Formal Solution 


(a) We have already presented a formal proof that 
Ris an equivalence relation. 


Problem-Solving Corner: Equivalence Relations 


(b) 

00000000, 00010000, 00100000, 00110000, 
01000000, 01010000, 01100000, 01110000, 
10000000, 10010000, 10100000, 10110000, 
11000000, 11010000, 11100000, 11110000 


lists one member of each equivalence class. 


(c) There are 16 equivalence classes. 


Summary of Problem-Solving Techniques 


= List elements that are related. 


= Compute some equivalence classes; that is, list 
all elements related to a particular element. 


m It may help to solve the parts of a problem in 
a different order than that given in the prob- 
lem statement. In our example, it was helpful 
in looking at some concrete cases to assume 
that the relation was an equivalence relation be- 
fore actually proving that it was an equivalence 
relation. 


m= To show that a particular relation R is an 
equivalence relation, go directly to the defi- 
nitions. Show that R is reflexive, symmetric, 
and transitive by directly verifying that R satis- 
fies the definitions of reflexive, symmetric, and 
transitive. 


= If the problem is to count the number of items 
satisfying some property (e.g., in our problem we 
were asked to count the number of equivalence 
classes) and the number is sufficiently small, just 
list all the items and count them directly. 


Comments 


In programming languages, usually only some speci- 
fied number of characters of the names of variables and 
special terms (technically these are called identifiers) 
are significant. For example, in the C programming 
language, only the first 31 characters of identifiers are 
significant. This means that if two identifiers begin with 
the same 31 characters, the system is allowed to con- 
sider them identical. 

If we define a relation R on the set of C identifiers 
by s; Rs2, provided that the first 31 characters of 5, 
and sz coincide, then R is an equivalence relation. An 
equivalence class consists of identifiers that the system 
is allowed to consider identical. 
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3.5 


Matrices of Relations 


Go Online 

For more matrices of 
relations, see 
goo.gl/V3y4pS 


Example 3.5.1 


Example 3.5.2 


Example 3.5.3 


Example 3.5.4 


A matrix is a convenient way to represent a relation R from X to Y. Such a representation 
can be used by a computer to analyze a relation. We label the rows with the elements of 
X (in some arbitrary order), and we label the columns with the elements of Y (again, in 
some arbitrary order). We then, set the entry in row x and column y to | if xR y and to 
0 otherwise. This matrix is called the matrix of the relation R (relative to the orderings 
of X and Y). 


The matrix of the relation 
R={(,)5), d, 4), 2,0), 3,0), 3, d), (4, O} 


from X = {1, 2,3, 4} to Y = {a, b, c, d} relative to the orderings 1, 2,3, 4 anda, b, c, dis 


abed 
1/0 10 1 
210 0 1 0 
3 {0 1 1 0 
4\1 0 0 0 4 


The matrix of the relation R of Example 3.5.1 relative to the orderings 2, 3, 4, 1 and 
d, b, a, cis 


eB Wb 

Poocosnsd. 
=e OF Oo 
oroo;oes 
oocorrF 5 


Obviously, the matrix of a relation from X to Y is dependent on the orderings of X 
and Y. < 


The matrix of the relation R from {2, 3, 4} to {5, 6, 7, 8}, relative to the orderings 2, 3, 4 
and 5, 6, 7, 8, defined by 
xRy if x divides y 


1s 


5 6 7 8 
2/0 1 0 1 
3 {0 1 0 0 
4\0 0 0 1 < 


When we write the matrix of a relation R on a set X (i.e., from X to X), we use the 
same ordering for the rows as we do for the columns. 


The matrix of the relation 
R = {(a, a), (b, b), (c, c), (d, d), (b, ©), (c, b)} 


on {a, b, c, d}, relative to the ordering a, b, c, d, is 
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uo Fa 

ooorsa 
Correo 
OoOrrons 
FPoook. 


Notice that the matrix of a relation on a set X is always a square matrix. 

We can quickly determine whether a relation R on a set X is reflexive by examining 
the matrix A of R (relative to some ordering). The relation R is reflexive if and only if 
A has 1’s on the main diagonal. (The main diagonal of a square matrix consists of the 
entries on a line from the upper left to the lower right.) The relation R is reflexive if and 
only if (x, x) € R for all x € X. But this last condition holds precisely when the main 
diagonal consists of 1’s. Notice that the relation R of Example 3.5.4 is reflexive and that 
the main diagonal of the matrix of R consists of 1’s. 

We can also quickly determine whether a relation R on a set X is symmetric by 
examining the matrix A of R (relative to some ordering). The relation R is symmetric if 
and only if for all i and j, the ijth entry of A is equal to the jith entry of A. (Less formally, 
R is symmetric if and only if A is symmetric about the main diagonal.) The reason is 
that R is symmetric if and only if whenever (x, y) is in R, (y, x) is also in R. But this last 
condition holds precisely when A is symmetric about the main diagonal. Notice that the 
relation R of Example 3.5.4 is symmetric and that the matrix of R is symmetric about 
the main diagonal. 

We can also quickly determine whether a relation R is antisymmetric by examining 
the matrix of R (relative to some ordering) (see Exercise 11). 

We conclude by showing how matrix multiplication relates to composition of re- 
lations and how we can use the matrix of a relation to test for transitivity. 


Let R, be the relation from X = {1, 2, 3} to Y = {a, b} defined by 
R, = {U, 4), 2, d), 3, a), G, d)}, 

and let R> be the relation from Y to Z = {x, y, z} defined by 
Ry = {(a, x), (a, y), By), B, DI. 


The matrix of R, relative to the orderings 1, 2, 3 and a, b is 


and the matrix of R> relative to the orderings a, b and x, y, zis 


x y Z 
a(1 1 0 
n= 5 (4 1 a 


The product of these matrices is 


1 
AiAx=|0 1 1 
1 


i) 
a 


Let us interpret this product. 
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Theorem 3.5.6 


The ikth entry in AA, is computed as 


k 


i(s 1) (") = sum, 


If this value is nonzero, then either su or tv is nonzero. Suppose that su ~¢ 0. (The 
argument is similar if tv # 0.) Then s # O and u ¥ O. This means that (i,a) € 
R, and (a,k) € Ro. This implies that (i,k) € R2 o Rj. We have shown that if the 
ikth entry in A;A2 is nonzero, then (i,k) € R2 o R,. The converse is also true, as we 
now show. 

Assume that (i, k) € Ro o R;. Then, either 


1. (i,a) € R, and (a,k) E Ro 
or 
2. (i, b) € R, and (b, k) € Ro. 


If 1 holds, then s = 1 and u = 1, so su = 1 and su + fv is nonzero. Similarly, if 2 holds, 
tv = | and again we have su + tv nonzero. We have shown that if (7, k) € Ro o R;, then 
the kth entry in AjA2 is nonzero. 

We have shown that (i,k) € R» o R, if and only if the ikth entry in A,Ap2 is 
nonzero; thus A,A> is “almost” the matrix of the relation Ry o R,. To obtain the ma- 
trix of the relation R2 o Ri, we need only change all nonzero entries in A;A2 to 1. Thus 
the matrix of the relation Ry o R;, relative to the previously chosen orderings 1, 2, 3 and 
x,y, Z, is 


RS eS eS 
et et SS 


x 
1 
0 
1 


WNre 


< 


The argument given in Example 3.5.5 holds for any relations. We summarize this 
result as Theorem 3.5.6. 


Let R; be a relation from X to Y and let R2 be a relation from Y to Z. Choose order- 
ings of X, Y, and Z. Let A; be the matrix of R; and let A> be the matrix of Rj with 
respect to the orderings selected. The matrix of the relation R2 oR; with respect to the 
orderings selected is obtained by replacing each nonzero term in the matrix product 
AjA2 by ihe 


Proof = The proof is sketched before the statement of the theorem. < 


Theorem 3.5.6 provides a quick test for determining whether a relation is transi- 
tive. If A is the matrix of R (relative to some ordering), we compute A”. We then compare 
A and A”. The relation R is transitive if and only if whenever entry i, j in A? is nonzero, 
entry i,j in A is also nonzero. The reason is that entry i, j in A? is nonzero if and only 
if there are elements (i,k) and (k, 7) in R (see the proof of Theorem 3.5.6). Now R is 
transitive if and only if whenever (i, k) and (k, j) are in R, then (i,j) is in R. But (i, /) 
is in R if and only if entry 7,7 in A is nonzero. Therefore, R is transitive if and only if 
whenever entry i, j in A? is nonzero, entry i, j in A is also nonzero. 


Example 3.5.7 The matrix of the relation 
R= {(a, a), (b, b), (c, c), (d, d), (b,c), (c, by} 


Example 3.5.8 


on {a, b, c, d}, relative to the ordering a, b, c, d, is 


1 0 
0 1 
fe 0 1 
0 0 
Its square is 
1 0 
0 2 
2 
aan OF 2 
0 0 


0 


NNO 


0 
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We see that whenever entry i, j in A? is nonzero, entry i,j in A is also nonzero. There- 


fore, R is transitive. 


The matrix of the relation 


R= {(,a), (b, b), (c,¢), (d, d), (a, 0), (c, b)} 


on {a, b, c, d}, relative to the ordering a, b, c, d, is 


1 0 

0 1 

a 0 1 

0 0 

Its square is 

1 1 

2_ {0 1 

== 0 2 

0 0 


oror 


orow 


Proce 


- OC SO 


< 


The entry in row 1, column 2 of A? is nonzero, but the corresponding entry in A is zero. 


Therefore, R is not transitive. 


3.5 Problem-Solving Tips 


< 


m= The matrix of a relation R is another way to represent or specify a relation from 
X to Y. The entry in row x and column y is 1 ifxRy, or Oifx Ry. 


= A relation is reflexive if and only if the main diagonal of its matrix representation 


consists of all 1’s. 


= A relation is symmetric if and only if its matrix is symmetric (i.e., entry 7, j always 


equals entry /, i). 


= Let R; be arelation from X to Y and let Ro be a relation from Y to Z. Let A, be the 
matrix of R; and let A> be the matrix of R.. The matrix of the relation R2 o R, is 


obtained by replacing each nonzero term in the matrix product A;Ao2 by 1. 


= To test whether a relation is transitive, let A be its matrix. Compute A”. The rela- 
tion is transitive if and only if whenever entry i,j in A? is nonzero, entry i, j in A 


is also nonzero. 


164 


Chapter 3 ® Functions, Sequences, and Relations 


3.5 Review Exercises 


1. What is the matrix of a relation? 


2. Given the matrix of a relation, how can we determine whether 
the relation is reflexive? 


ue 


. Given the matrix of a relation, how can we determine whether 
the relation is symmetric? 


4. Given the matrix of a relation, how can we determine whether 
the relation is transitive? 


5. Given the matrix A, of the relation R; and the matrix A» of 
the relation R2, explain how to obtain the matrix of the relation 
Ra oR). 


In Exercises 1-3, find the matrix of the relation R from X to Y rel- 
ative to the orderings given. 


1. R={(1, 5), (2, a), (2, 4), (3, B), 3, &)}s ordering of X:1, 
2, 3; ordering of Y:a, 6, X,6 

2. R as in Exercise 1; ordering of X:3, 2,1; ordering of Y: X, 
B,a, 5 


3. R={(, 4), (0), (a), 0,5), (, DI 
y, z; ordering of Y: a,b, c,d 


ordering of X:x, 


In Exercises 4—6, find the matrix of the relation R on X relative to 
the ordering given. 


4. R={(1, 2), (2,3), (3, 4), (4, 5)}; ordering of X:1, 2,3, 4,5 
5. Ras in Exercise 4; ordering of X:5, 3, 1, 2,4 

6. R= {(x, y) | x < y}; ordering of X: 1, 2,3, 4 

7. Find matrices that represent the relations of Exercises 13-16, 


Section 3.3. 


In Exercises 8-10, write the relation R, given by the matrix, as a 
set of ordered pairs. 


8. 


aoe 
RPooo,y 


NS eR = 
Comore: 
coooys 


». 
1 
0 
1 
0 


rPOOON 


11. How can we quickly determine whether a relation R is anti- 
symmetric by examining the matrix of R (relative to some or- 
dering)? 

12. Tell whether the relation of Exercise 10 is reflexive, symmet- 
ric, transitive, antisymmetric, a partial order, and/or an equiv- 
alence relation. 


13. Given the matrix of a relation R from X to Y, how can we find 
the matrix of the inverse relation R~!? 

14. Find the matrix of the inverse of each of the relations of Exer- 
cises 8 and 9. 

15. Use the matrix of the relation to test for transitivity (see 
Examples 3.5.7 and 3.5.8) for the relations of Exercises 4, 6, 
and 10. 


In Exercises 16-18, find 

(a) The matrix A, of the relation R, (relative to the given or- 
derings). 

(b) The matrix Az of the relation R2 (relative to the given or- 
derings). 

(c) The matrix product A\A2. 

(d) Use the result of part (c) to find the matrix of the relation 
Roo Rj. 

(e) Use the result of part (d) to find the relation R2 0 Ry (as a 
set of ordered pairs). 


16. Ry = {(, x), (1, y), (2, x), (3, x)}; Ro = {(@, 5), , b), (Ya), 
(y, c)}; orderings: 1, 2, 3; x, y; a, b,c 

17. R; = {(x, y) | x divides y}; R is from X to Y; Ro = {(,z) | 
y > z}; Ro is from Y to Z; ordering of X and Y: 2, 3, 4, 5; or- 
dering of Z: 1, 2, 3,4 

18. Rj ={(x,y) | x +y < 6}; R; is from X to Y; Ro = {(y, z) | 
y=z-+ 1}; Ro is from Y to Z; ordering of X, Y, and 
Z:1,2,3,4,5 

19. Given the matrix of an equivalence relation R on X, how can 

we easily find the equivalence class containing the element 

xe xX? 

Let R; be a relation from X to Y and let Ro be a relation from 

Y to Z. Choose orderings of X, Y, and Z. All matrices of rela- 

tions are with respect to these orderings. Let A; be the matrix 

of R; and let Az be the matrix of R2. Show that the ikth entry 

in the matrix product A;Ap2 is equal to the number of elements 

in the set 


«20. 


{m | (i,m) € R, and (m, k) € Ro}. 


21. Suppose that R; and R2 are relations ona set X, A, is the matrix 
of R; relative to some ordering of X, and A2 is the matrix of 


22. 


23. 


R> relative to the same ordering of X. Let A be a matrix whose 
ijth entry is | if the ijth entry of either A; or Az is 1. Prove that 
A is the matrix of Rj U Ro. 


Suppose that R; and R2 are relations on a set X, Ay is the ma- 
trix of R; relative to some ordering of X, and A2 is the matrix 
of R2 relative to the same ordering of X. Let A be a matrix 
whose jth entry is | if the ijth entries of both A, and A2 are 1. 
Prove that A is the matrix of Rj M Ro. 


Suppose that the matrix of the relation R; on {1, 2, 3} is 


relative to the ordering 1, 2, 3, and that the matrix of the rela- 
tion R2 on {1, 2, 3} is 


3.6 


24. 
25. 
26. 


27. 
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relative to the ordering 1,2,3. Use Exercise 21 to find 
the matrix of the relation Rj U R2 relative to the ordering 
1y2,,3; 


Use Exercise 22 to find the matrix of the relation R; 
R>2 relative to the ordering 1,2,3 for the relations of 
Exercise 23. 


How can we quickly determine whether a relation R is a 
function by examining the matrix of R (relative to some 
ordering)? 


Let A be the matrix of a function f from X to Y (relative to 
some orderings of X and Y). What conditions must A satisfy 
for f to be onto Y? 


Let A be the matrix of a function f from X to Y (relative to 
some orderings of X and Y). What conditions must A satisfy 
for f to be one-to-one? 


tT 


The “bi” in a binary relation R refers to the fact that R has two columns when we write 
Rasa table. It is often useful to allow a table to have an arbitrary number of columns. If 
a table has n columns, the corresponding relation is called an n-ary relation. 


Example 3.6.1 


Table 3.6.1 represents a 4-ary relation. This table expresses the relationship among iden- 
tification numbers, names, positions, and ages. 


TABLE 3.6.1 m PLAYER 
ID Number 


22012 
93831 
58199 
84341 
01180 
26710 
61049 
39826 


Name Position Age 
Johnsonbaugh c 22 
Glover of 24 
Battey p 18 
Cage c 30 
Homer lb 37 
Score p 22 
Johnsonbaugh of 30 
Singleton 2b 31 


We can also express an n-ary relation as a collection of n-tuples. 


Example 3.6.2 Table 3.6.1 can be expressed as the set 


{(22012, Johnsonbaugh, c, 22), 
(58199, Battey, p, 18), 


(93831, Glover, of, 24), 
(84341, Cage, c, 30), 


+ This section can be omitted without loss of continuity. 
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Example 3.6.3 


Example 3.6.4 


Example 3.6.5 


(01180, Homer, 1b, 37), (26710, Score, p, 22), 
(61049, Johnsonbaugh, of, 30), (39826, Singleton, 2b, 31)} 


of 4-tuples. < 


A database is a collection of records that are manipulated by a computer. For 
example, an airline database might contain records of passengers’ reservations, flight 
schedules, equipment, and so on. Computer systems are capable of storing large amounts 
of information in databases. The data are available to various applications. Database 
management systems are programs that help users access the information in databases. 
The relational database model, invented by E. F. Codd, is based on the concept of an 
n-ary relation. We will briefly introduce some of the fundamental ideas in the theory of 
relational databases. For more details on relational databases, the reader is referred to 
[Codd; Date; and Kroenke]. We begin with some of the terminology. 

The columns of an n-ary relation are called attributes. The domain of an attribute 
is a set to which all the elements in that attribute belong. For example, in Table 3.6.1, 
the attribute Age might be taken to be the set of all positive integers less than 100. The 
attribute Name might be taken to be all strings over the alphabet having length 30 or less. 

A single attribute or a combination of attributes for a relation is a key if the values 
of the attributes uniquely define an n-tuple. For example, in Table 3.6.1, we can take the 
attribute ID Number as a key. (It is assumed that each person has a unique identification 
number.) The attribute Name is not a key because different persons can have the same 
name. For the same reason, we cannot take the attribute Position or Age as a key. Name 
and Position, in combination, could be used as a key for Table 3.6.1, since in our example 
a player is uniquely defined by a name and a position. 

A database management system responds to queries. A query is a request for 
information from the database. For example, “Find all persons who play outfield” is a 
meaningful query for the relation given by Table 3.6.1. We will discuss several operations 
on relations that are used to answer queries in the relational database model. 


Select The selection operator chooses certain n-tuples from a relation. The choices 
are made by giving conditions on the attributes. For example, for the relation PLAYER 
given in Table 3.6.1, PLAYER [Position = c] will select the tuples 


(22012, Johnsonbaugh, c, 22), (84341, Cage, c, 30). 4 


Project Whereas the selection operator chooses rows of a relation, the projection 
operator chooses columns. In addition, duplicates are eliminated. For example, for the 
relation PLAYER given by Table 3.6.1, PLAYER [Name, Position] will select the tuples 


(Johnsonbaugh, c), (Glover, of), (Battey, p), (Cage, c), 
(Homer, 1b), (Score, p), (Johnsonbaugh, of), (Singleton, 2b). >| 


Join The selection and projection operators manipulate a single relation; join manipu- 
lates two relations. The join operation on relations R; and Ry begins by examining all 
pairs of tuples, one from R; and one from R3. If the join condition is satisfied, the tuples 
are combined to form a new tuple. The join condition specifies a relationship between 
an attribute in R; and an attribute in Ry. For example, let us perform a join operation on 
Tables 3.6.1 and 3.6.2. As the condition we take ID Number = PID. We take a row from 
Table 3.6.1 and a row from Table 3.6.2 and if ID Number = PID, we combine the rows. 


TABLE 3.6.2 m ASSIGNMENT 


PID 


39826 
26710 
58199 
01180 


Team 


Blue Sox 
Mutts 
Jackalopes 
Mutts 


Example 3.6.6 


Example 3.6.7 
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For example, the ID Number 01180 in the fifth row (01180, Homer, 1b, 37) of Table 
3.6.1 matches the PID in the fourth row (01180, Mutts) of Table 3.6.2. These tuples are 
combined by first writing the tuple from Table 3.6.1, following it by the tuple from Table 
3.6.2, and eliminating the equal entries in the specified attributes to give 
(01180, Homer, 1b, 37, Mutts). 
This operation is expressed as 


PLAYER [ID Number = PID] ASSIGNMENT. 


The relation obtained by executing this join is shown in Table 3.6.3. 


TABLE 3.6.3 m PLAYER [ID Number = PID] ASSIGNMENT 


ID Number Name Position Age Team 
58199 Battey P 18 Jackalopes 
01180 Homer lb 37 Mutts 
26710 Score ) 22 Mutts 
39826 Singleton 2b 31 Blue Sox 


< 


Most queries to a relational database require several operations to provide the 
answer. 


Describe operations that provide the answer to the query “Find the names of all persons 
who play for some team.” 


SOLUTION If we first join the relations given by Tables 3.6.1 and 3.6.2 subject to the 
condition ID Number = PID, we will obtain Table 3.6.3, which lists all persons who play 
for some team as well as other information. To obtain the names, we need only project 
on the attribute Name. We obtain the relation 


Name 


Battey 
Homer 
Score 
Singleton 


Formally, these operations would be specified as 


TEMP := PLAYER [ID Number = PID] ASSIGNMENT 


TEMP [Name] < 


Describe operations that provide the answer to the query “Find the names of all persons 
who play for the Mutts.” 


SOLUTION If we first use the selection operator to pick the rows of Table 3.6.2 that 
reference Mutts’ players, we obtain the relation 
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TEMP1 
PID Team 
26710 Mutts 
01180 Mutts 


If we now join Table 3.6.1 and the relation TEMP! subject to ID Number = PID, we 
obtain the relation 


TEMP2 


ID Number Name Position Age Team 


01180 Homer lb 37. Mutts 
26710 Score p 22 Mutts 


If we project the relation TEMP2 on the attribute Name, we obtain the relation 


Name 


Homer 
Score 


We would formally specify these operations as follows: 


TEMP! := ASSIGNMENT [Team = Mutts] 
TEMP2 :=PLAYER [ID Number = PID] TEMP1 
TEMP? [Name] < 


Notice that the operations 


TEMP! := PLAYER [ID Number = PID] ASSIGNMENT 
TEMP2 := TEMP! [Team = Mutts] 
TEMP2 [Name] 


would also answer the query of Example 3.6.7. 


3.6 Problem-Solving Tips 


A relational database represents data as tables (n-ary relations). Information from the 
database is obtained by manipulating the tables. In this section, we discussed the 
operations select (choose rows specified by a given condition), project (choose columns 
specified by a given condition), and join (combine rows from two tables as specified by 
a given condition). 


3.6 Review Exercises 


1. What is an n-ary relation? . What is a query? 


3. What is a relational database? 


5 

2. What is a database management system? 6. Explain how the selection operator works and give an example. 
7. Explain how the project operator works and give an example. 
8 


4. What is a key? . Explain how the join operator works and give an example. 
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1. Express the relation given by Table 3.6.4 as a set of n-tuples. 


TABLE 3.6.4 m EMPLOYEE 


ID Name Manager 
1089 Suzuki Zamora 
5620 Kaminski Jones 

9354 Jones Yu 

9551 Ryan Washington 
3600 Beaulieu Yu 

0285 Schmidt Jones 

6684 Manacotti Jones 


2. Express the relation given by Table 3.6.5 as a set of n-tuples. 


TABLE 3.6.5 m DEPARTMENT 


Dept Manager 
23 Jones 
04 Yu 
96 Zamora 
66 Washington 


3. Express the relation given by Table 3.6.6 as a set of n-tuples. 


TABLE 3.6.6 m SUPPLIER 


Dept Part No Amount 
04 335B2 220 
23 2A 14 
04 8C200 302 
66 42C 3 
04 900 7720 
96 20A8 200 
96 1199C 296 
23 7712 39 


4. Express the relation given by Table 3.6.7 as a set of n-tuples. 


TABLE 3.6.7 m BUYER 


Name Part No 
United Supplies 2A 
ABC Unlimited 8C200 
United Supplies 1199C 
JCN Electronics 2A 
United Supplies 335B2 
ABC Unlimited 772 
Danny’s 900 
United Supplies 772 
Underhanded Sales 20A8 
Danny’s 20A8 
DePaul University 42C 
ABC Unlimited 20A8 


In Exercises 5-20, write a sequence of operations to answer the 
query. Also, provide an answer to the query. Use Tables 3.6.4— 
3.6.7. 


22. 


23. 


24. 


. Find the names of all employees. (Do not include any man- 


agers.) 


. Find the names of all managers. 

. Find all part numbers. 

. Find the names of all buyers. 

. Find the names of all employees who are managed by Jones. 
. Find all part numbers supplied by department 96. 

. Find all buyers of part 20A8. 

. Find all employees in department 04. 


. Find the part numbers of parts of which there are at least 100 


items on hand. 


. Find all department numbers of departments that supply parts 


to Danny’s. 


. Find the part numbers and amounts of parts bought by United 


Supplies. 


. Find all managers of departments that produce parts for ABC 


Unlimited. 


. Find the names of all employees who work in departments that 


supply parts for JCN Electronics. 


. Find all buyers who buy parts in the department managed by 


Jones. 


. Find all buyers who buy parts that are produced by the depart- 


ment for which Suzuki works. 


. Find all part numbers and amounts for Zamora’s department. 


. Make up at least three n-ary relations with artificial data that 


might be used in a medical database. Illustrate how your 
database would be used by posing and answering two queries. 
Also, write a sequence of operations that could be used to an- 
swer the queries. 


Describe a union operation on a relational database. Illustrate 
how your operator works by answering the following query, 
using the relations of Tables 3.6.4—3.6.7: Find the names of 
all employees who work in either department 23 or depart- 
ment 96. Also, write a sequence of operations that could be 
used to answer the query. 


Describe an intersection operation on a relational database. 
Illustrate how your operator works by answering the follow- 
ing query, using the relations of Tables 3.6.4—3.6.7: Find the 
names of all buyers who buy both parts 2A and 1199C. Also, 
write a sequence of operations that could be used to answer 
the query. 

Describe a difference operation on a relational database. II- 
lustrate how your operator works by answering the follow- 
ing query, using the relations of Tables 3.6.4—3.6.7: Find the 
names of all employees who do not work in department 04. 
Also, write a sequence of operations that could be used to an- 
swer the query. 
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Chapter 3 Notes 


Most general references on discrete mathematics address the topics of this chapter. [Halmos; 
Lipschutz; and Stoll] are recommended to the reader wanting to study functions in more 
detail. [Codd; Date; Kroenke; and Ullman] are recommended references on databases in 
general and the relational model in particular. 


Chapter 3 Review 


Section 3.1 


1. 


CSRAFNAMEWHN 


Function from X to Y, f : X — Y: a subset f of X x Y 
such that for each x € X, there is exactly one y € Y with 
@yef 


x mod y: remainder when x is divided by y 


. Hash function 

. Collision for a hash function H: H(x) = H(y) 

. Collision resolution policy 

. Floor of x, |x]: greatest integer less than or equal to x 

. Ceiling of x, [x]: least integer greater than or equal to x 

. One-to-one function f: if f(x) = f(x’), then x = x’ 

. Onto function f from X to Y: range of f = Y 

. Bijection: one-to-one and onto function 

. Inverse f~! of a one-to-one, onto function f : {(y,x) | 


(y) €f} 


. Composition of functions: f o g = {(x,z) | (x,y) € g and 


(y,2) €f} 


. Binary operator on X: function from X x X to X 
. Unary operator on X: function from X to X 


Section 3.2 


. Sum or sigma notation: y Qj = An + mpi ++: 


. Sequence: function whose domain is a subset of integers 
. Index: in the sequence {s,}, m is the index 

. Increasing sequence: if i < j, then s; < s; 

. Decreasing sequence: if i < j, then s; > s; 

. Nonincreasing sequence: if i < j, then s; = 5; 

. Nondecreasing sequence: if i < j, then s; < s; 

. Subsequence s,, of the sequence {s,,} 


n 


t an 


i=m 
n 


. Product notation: | | Qj = Am * Gn4i+** An 


. Geometric sum: y ar’ 


i=0 


. String: finite sequence 

. Null string, A: string with no elements 

. X*: set of all strings over X, including the null string 

. Xt: set of all nonnull strings over X 

. Length of string a, |w|: number of elements in w 

. Concatenation of strings a and 6, af: a followed by 6 

. Substring of aw: a string 6 for which there are strings y and 


6 witha = yBd 


Section 3.3 


. Binary relation from X to Y: set of ordered pairs (x, y), 


xEexX,yeY 


. Digraph of a binary relation 
. Reflexive relation R on X: (x, x) € R for all x € X 
. Symmetric relation R on X: for all x, y € X, if (x,y) € R, 


then (y,x) ER 


. Antisymmetric relation R on X: for all.x, y € X,if (x,y) ER 


and (y, x) € R, thenx = y 


. Transitive relation R on X: for all x, y, z € X, if (x, y) and 


(y, z) are in R, then (x, z) ER 


. Partial order: relation that is reflexive, antisymmetric, and 


transitive 


. Inverse relation R~!: {(y, x) | (x, y) € R} 
. Composition of relations Rz o Ry: {(x, z) | (x,y) € Ry and 


(y, z) € Ro} 


Section 3.4 


. Equivalence relation: relation that is reflexive, symmetric, 


and transitive 


. Equivalence class containing a given by equivalence rela- 


tion R: [a] = {x | xRa} 


. Equivalence classes partition the set (Theorem 3.4.8) 


Section 3.5 


44. 
45. 


46. 


47. 


Matrix of a relation 

R is a reflexive relation if and only if the main diagonal of 
the matrix of R consists of 1’s. 

R is a symmetric relation if and only if the matrix of R is 
symmetric about the main diagonal. 

If A; is the matrix of the relation R; and A> is the matrix of 
the relation Ro, the matrix of the relation R2 o R; is obtained 
by replacing each nonzero term in the matrix product A; A2 
by lL. 


. If A is the matrix of a relation R, R is transitive if and only 


if whenever entry i, j in A? is nonzero, entry i, j in A is also 
nonzero. 


Section 3.6 


. Key 


. Project 


. n-ary relation: Set of n-tuples 
. Database management system 
. Relational database 


53. Query 54. Select 


56. Join 


Chapter 3 Self-Test 171 


Chapter 3 Self-Test 


1. Let X be the set of strings over {a, b} of length 4 and let Y be 
the set of strings over {a, b} of length 3. Define a function f 
from X to Y by the rule 


f(a) = string consisting of the first three characters of a. 


Is f one-to-one? Is f onto? 
2. Let by = SL + 1? - 7. 
(a) Find bs and bj. 
(b) Find a formula for b,,. 
(c) Is b increasing? 
(d) Is b decreasing? 
In Exercises 3 and 4, determine whether the relation defined on 
the set of positive integers is reflexive, symmetric, antisymmet- 
ric, transitive, and/or a partial order. 
3. (x,y) € Rif 2 divides x + y 
4. (x,y) € Rif 3 divides x + y 


5. Is the relation 


{(, 1), Cl, 2), 2, 2), 4,4), 2, 1), G3, 3)} 


an equivalence relation on {1, 2, 3, 4}? Explain. 
6. Given that the relation 


{(1, 1), @, 2), 3, 3), (4,4, , 2), 2, D, GB, 4), 4, 3)} 


is an equivalence relation on {1, 2, 3, 4}, find [3], the equiv- 
alence class containing 3. How many (distinct) equivalence 
classes are there? 


7. Find real numbers x and y satisfying |x| |y] = Lxy] — 1. 
8. For the sequence a defined by a, = 2n + 2, find 


(a) d6 


‘f 

(b) Soa 
= 

(©) [ai 
i=1 


(d) a formula for the subsequence of a obtained by select- 
ing every other term of a starting with the first. 


9. Give examples of functions f and g such that f o g is onto, 
but g is not onto. 
10. Let w = cedde and B = c3d?. Find 
(a) of (b) Bo (c) |@| (d) |owBar| 


11. For the hash function h(x) = x mod 13, show how the data 
784, 281, 1141, 18, 1, 329, 620, 43, 31, 684 
would be inserted in the order given in initially empty cells 


indexed 0 to 12. 


12. Find the equivalence relation (as a set of ordered pairs) on 
{a, b,c, d, e} whose equivalence classes are 


{a}, {b,d,e},  {c}. 


13. Suppose that R is a relation on X that is symmetric and tran- 
sitive but not reflexive. Suppose also that |X| > 2. Define 
the relation R on X by R = X x X — R. Which of the fol- 
lowing must be true? For each false statement, provide a 
counterexample. 


(a) R is reflexive. 

(b) R is symmetric. 

(c) R is not antisymmetric. 
(d) R is transitive. 


14. Rewrite the sum 5~_,(n — dr' replacing the index i by k, 
where i = k + 2. 


15. Give an example of a relation on {1, 2, 3, 4} that is reflexive, 
not antisymmetric, and not transitive. 


16. Let R be the relation defined on the set of eight-bit strings 
by s; Rs provided that s; and s) have the same number of 
Zeros. 


(a) Show that R is an equivalence relation. 
(b) How many equivalence classes are there? 


(c) List one member of each equivalence class. 
Exercises 17-20 refer to the relations 


R; = {C, x), 2, x), 2, y), 3, y)}, 
Ry = {(x, a), (x, b), (y, a), (Y, ©)}. 


17. Find the matrix A of the relation Rj relative to the orderings 
152,35 ys 

18. Find the matrix A of the relation R) relative to the orderings 
HV; Gy DECr 


19. Find the matrix product A;Ao. 
20. Use the result of Exercise 19 to find the matrix of the rela- 
tion Ro o Ry. 


In Exercises 21-24, write a sequence of operations to answer 
the query. Also, provide an answer to the query. Use Tables 3.6.1 
and 3.6.2. 

21. Find all teams. 

22. Find all players’ names and ages. 

23. Find the names of all teams that have a pitcher. 


24. Find the names of all teams that have players aged 30 years 
or older. 
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1. Implement a hashing system for storing integers in an array. 


2. Write a program that determines whether a credit card num- 
ber has a valid check digit. 

3. Write a program that determines whether an ISBN has a 
valid check digit. 


4. Write a program that lists all transpositions of two distinct 
adjacent digits (e.g., 25 <> 52) that are not detected by 
the Luhn algorithm (i.e., transpositions that produce equal 
check digits). 


5. Write a program that lists all transpositions of two distinct 
adjacent digits (e.g., 25 <> 52) that are not detected by 
the ISBN check digit (i.e., transpositions that produce equal 
check digits in their ISBNs). 


6. Write a program that generates pseudorandom integers. 


In Exercises 7-12, assume that a sequence from {1,...,n} 
to the real numbers is represented as an array A, indexed 
from I ton. 

7. Write a program that tests whether A is one-to-one. 

8. Write a program that tests whether A is onto a given set. 

9. Write a program that tests whether A is increasing. 


10. Write a program that tests whether A is decreasing. 


11. 
12. 
13. 


14. 


15. 
16. 


17. 


18. 
19. 


20. 


21. 


22. 


Write a program that tests whether A is nonincreasing. 
Write a program that tests whether A is nondecreasing. 


Write a program to determine whether one sequence is a 
subsequence of another sequence. 


Write a program to determine whether one string is a sub- 
string of another string. 


Write a program to determine whether a relation is reflexive. 


Write a program to determine whether a relation is 
antisymmetric. 

Write a program to determine whether a relation is 
transitive. 

Write a program that finds the inverse of a relation. 

Write a program that finds the composition RoS of relations 
Rand S. 

Write a program that checks whether a relation R is an 
equivalence relation. If R is an equivalence relation, the pro- 
gram outputs the equivalence classes of R. 

Write a program to determine whether a relation is a func- 
tion from a set X to a set Y. 

[Project] Prepare a report on a commercial relational 
database such as Oracle or Access. 
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Chapter 4 


ALGORITHMS 


An algorithm is a step-by-step method of solving some problem. Such an approach 
to problem-solving is not new; indeed, the word “algorithm” derives from the name of 
the ninth-century Persian mathematician al-Khowarizmt. Today, “algorithm” typically 
refers to a solution that can be executed by a computer. In this book, we will be concerned 
primarily with algorithms that can be executed by a “traditional” computer, that is, a 
computer, such as a personal computer, with a single processor that executes instructions 
step-by-step. 

After introducing algorithms and providing several examples, we turn to the anal- 
ysis of algorithms, which refers to the time and space required to execute algorithms. 
We conclude by discussing recursive algorithms—algorithms that refer to themselves. 


Introduction 


Algorithms typically have the following characteristics: 


= Input The algorithm receives input. 
= Output The algorithm produces output. 
= Precision ‘The steps are precisely stated. 


= Determinism The intermediate results of each step of execution are unique and 
are determined only by the inputs and the results of the preceding steps. 


= Finiteness The algorithm terminates; that is, it stops after finitely many instruc- 
tions have been executed. 


= Correctness The output produced by the algorithm is correct; that is, the algo- 
rithm correctly solves the problem. 


= Generality The algorithm applies to a set of inputs. 


As an example, consider the following algorithm that finds the maximum of three num- 
bers a, b, and c: 

1. large =a. 

2. If b > large, then large = b. 

3. Ifc > large, then large = c. 


(As explained in Appendix C, = is the assignment operator.) 
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The idea of the algorithm is to inspect the numbers one by one and copy the largest 
value seen into a variable large. At the conclusion of the algorithm, /arge will then be 
equal to the largest of the three numbers. 

We show how the preceding algorithm executes for some specific values of a, b, 
and c. Such a simulation is called a trace. First suppose that a = 1, b = 5, andc = 3. 
At line 1, we set large to a (1). At line 2, b > large (5 > 1) is true, so we set large to 
b (5). At line 3, c > large (3 > 5) is false, so we do nothing. At this point large is 5, the 
largest of a, b, and c. 

Suppose that a = 6, b = 1, and c = 9. At line 1, we set large to a (6). At line 2, 
b > large (1 > 6) is false, so we do nothing. At line 3, c > large (9 > 6) is true, so we 
set large to 9. At this point large is 9, the largest of a, b, and c. 

We verify that our example algorithm has the properties set forth at the beginning 
of this section. 

The algorithm receives the three values a, b, and c as input and produces the value 
large as output. 

The steps of the algorithm are stated sufficiently precisely so that the algorithm 
could be written in a programming language and executed by a computer. 

Given values for the input, each intermediate step of an algorithm produces a 
unique result. For example, given the values a = 1, b = 5, and c = 3, at line 2, large 
will be set to 5 regardless of who executes the algorithm. 

The algorithm terminates after finitely many steps (three steps) correctly answer- 
ing the given question (find the largest of the three values input). 

The algorithm is general; it can find the largest value of any three numbers. 

Our description of what an algorithm is will suffice for our needs in this book. 
However, it should be noted that it is possible to give a precise, mathematical definition 
of “algorithm” (see the Notes for Chapter 12). 

Although ordinary language is sometimes adequate to specify an algorithm, most 
mathematicians and computer scientists prefer pseudocode because of its precision, 
structure, and universality. Pseudocode is so named because it resembles the actual code 
of computer languages such as C++ and Java. There are many versions of pseudocode. 
Unlike actual computer languages, which must be concerned about semicolons, upper- 
case and lowercase letters, special words, and so on, any version of pseudocode is ac- 
ceptable as long as its instructions are unambiguous. Our pseudocode is described in 
detail in Appendix C. 

As our first example of an algorithm written in pseudocode, we rewrite the first 
algorithm in this section, which finds the maximum of three numbers. 


Finding the Maximum of Three Numbers 


This algorithm finds the largest of the numbers a, b, and c. 
Input: a,b,c 


Output: Jarge (the largest of a, b, and c) 


max3(a, b,c) { 
large =a 
if (b > large) // if b is larger than large, update large 
large =b 
if (c > large) // if c is larger than large, update large 
large =c 
return large 


Go SI ey En gs POS 


} 


Algorithm 4.1.2 
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Our algorithms consist of a title, a brief description of the algorithm, the input 
to and output from the algorithm, and the functions containing the instructions of the 
algorithm. Algorithm 4.1.1 consists of a single function. To make it convenient to refer to 
individual lines within a function, we sometimes number some of the lines. The function 
in Algorithm 4.1.1 has eight numbered lines. 

When the function in Algorithm 4.1.1 executes, at line 2 we set large to a. At 
line 3, b and large are compared. If b is greater than large, we execute line 4 


large = b 


but if b is not greater than /arge, we skip to line 5. At line 5, c and large are compared. 
If c is greater than /arge, we execute line 6 


large =c 


but if c is not greater than /arge, we skip to line 7. Thus when we arrive at line 7, large 
will correctly hold the largest of a, b, and c. 

At line 7 we return the value of large, which is equal to the largest of the numbers 
a, b, and c, to the invoker of the function and terminate the function. Algorithm 4.1.1 
has correctly found the largest of three numbers. 

The method of Algorithm 4.1.1 can be used to find the largest value in a 
sequence. 


Finding the Maximum Value in a Sequence 
This algorithm finds the largest of the numbers 51, ..., Sn. 


Input: s,n 
Output: /arge (the largest value in the sequence s) 


max(s, n) { 
large = s1 
fom — 2ton1 
if (s; > large) 
large = 5; 
return large 


} 


We verify that Algorithm 4.1.2 is correct by proving that 
large is the largest value in the subsequence s),..., 5; (4.1.1) 


is a loop invariant using induction on 7. 

For the Basis Step (i = 1), we note that just before the for loop begins executing, 
large is set to s,; so large is surely the largest value in the subsequence s}. 

Assume that /arge is the largest value in the subsequence s1,..., sj. If i < nis 
true (so that the for loop body executes again), i becomes i+ 1. Suppose first that s;, > 
large. It then follows that s;,; is the largest value in the subsequence s, ..., 5;, 8:41. In 
this case, the algorithm assigns large the value s;,;. Now large is equal to the largest 
value in the subsequence s),..., 5;, 5:41. Suppose next that s;;, < large. It then fol- 
lows that /arge is the largest value in the subsequence sj, ..., 5;, Si+1. In this case, the 
algorithm does not change the value of Jarge; thus, large is the largest value in the 
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subsequence 5), ..., 5;,.5;41. We have proved the Inductive Step. Therefore, (4.1.1) is 
a loop invariant. 

The for loop terminates when i = n. Because (4.1.1) is a loop invariant, at this 
point /arge is the largest value in the sequence 5, ..., 5,. Therefore, Algorithm 4.1.2 is 
correct. 


4.1 Problem-Solving Tips 


To construct an algorithm, it is often helpful to assume that you are in the middle of 
the algorithm and part of the problem has been solved. For example, in finding the 
largest element in a sequence s;,..., 5, (Algorithm 4.1.2), it was helpful to assume 
that we had already found the largest element Jarge in the subsequence sj, ..., sj. 
Then, all we had to do was look at the next element s;,, and, if s;,; was larger than 
large, we simply updated Jarge. If s;+; was not larger than large, we did not mod- 
ify /arge. Iterating this procedure yields the algorithm. These observations also led to 
the loop invariant (4.1.1) which allowed us to prove that Algorithm 4.1.2 is 
correct. 


4.1 Review Exercises 


= 


. What is an algorithm? 


4. What are the advantages of pseudocode over ordinary text in 
writing an algorithm? 


n 


ue 


. Describe the following properties an algorithm typically has: 


input, output, precision, determinism, finiteness, correctness, 
and generality. 


. What is a trace of an algorithm? 


5. How do algorithms relate to pseudocode functions? 


1. Consult the instructions for connecting a DVD or Blu- 


ray player to a TV. Which properties of an algorithm— 
input, output, precision, determinism, finiteness, correctness, 
generality—are present? Which properties are lacking? 


- Consult the instructions for adding a contact to a cell phone. 


Which properties of an algorithm—input, output, precision, 
determinism, finiteness, correctness, generality—are present? 
Which properties are lacking? 


. Goldbach’s conjecture states that every even number greater 


than 2 is the sum of two prime numbers. Here is a pro- 
posed algorithm that checks whether Goldbach’s conjecture is 
true: 


1. Letn = 4. 

2. If nis not the sum of two primes, output “no” and stop. 

3. Else increase n by 2 and continue with step 2. 

4. Output “yes” and stop. 
Which properties of an algorithm—input, output, precision, 
determinism, finiteness, correctness, generality—does this 
proposed algorithm have? Do any of them depend on the truth 


of Goldbach’s conjecture (which mathematicians have not yet 
settled)? 


4. 


Write an algorithm that finds the smallest element among a, b, 
and c. 


. Write an algorithm that finds the second-smallest element 


among a, b, and c. Assume that the values of a, b, and c are 
distinct. 


. Write an algorithm that returns the smallest value in the se- 


quence 51,..., Sp. 


. Write an algorithm that returns the largest and second-largest 


values in the sequence s;,..., 5,. Assume that n > 1 and the 
values in the sequence are distinct. 


. Write an algorithm that returns the smallest and second- 


smallest values in the sequence s1,..., S,. Assume thatn > 1 
and the values in the sequence are distinct. 


. Write an algorithm that outputs the smallest and largest values 


in the sequence 51, ..., Sp. 


. Write an algorithm that returns the index of the first occur- 


rence of the largest element in the sequence s1, ..., 8,. Exam- 
ple: If the sequence is 6.2, 8.9, 4.2, 8.9, the algorithm returns 
the value 2. 


. Write an algorithm that returns the index of the last oc- 


currence of the largest element in the sequence s1,..., Sp. 


12. 


13. 


14. 


15. 


16. 


Example: If the sequence is 6.2, 8.9, 4.2, 8.9, the algorithm 
returns the value 4. 


Write an algorithm that returns the sum of the sequence of 
numbers s;,..., 5 Sa 


Write an algorithm that returns the index of the first item that 
is less than its predecessor in the sequence 51, ..., S,. If s is in 
nondecreasing order, the algorithm returns the value 0. Exam- 
ple: If the sequence is 


AMY BRUNO ELIE DAN ZEKE, 


the algorithm returns the value 4. 


Write an algorithm that returns the index of the first item that 
is greater than its predecessor in the sequence s1,..., Sy. If s 
is in nonincreasing order, the algorithm returns the value 0. 
Example: If the sequence is 


AMY BRUNO ELIE DAN ZEKE, 


the algorithm returns the value 2. 


Write an algorithm that reverses the sequence s1,..., Sp. 
Example: If the sequence is 


AMY BRUNO ELIE, 
the reversed sequence is 


ELIE BRUNO AMY. 


Write the standard method of adding two positive decimal 
integers, taught in elementary schools, as an algorithm. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


25. 
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Write an algorithm that receives as input the n x n matrix A 
and outputs the transpose A’. 


Write an algorithm that receives as input the matrix of a rela- 
tion R and tests whether R is reflexive. 


Write an algorithm that receives as input the matrix of a rela- 
tion R and tests whether R is symmetric. 


Write an algorithm that receives as input the matrix of a rela- 
tion R and tests whether R is transitive. 


Write an algorithm that receives as input the matrix of a rela- 
tion R and tests whether R is antisymmetric. 


Write an algorithm that receives as input the matrix of a rela- 
tion R and tests whether R is a function. 


Write an algorithm that receives as input the matrix of a rela- 
tion R and produces as output the matrix of the inverse relation 
Ro, 

Write an algorithm that receives as input the matrices of re- 
lations R; and R2 and produces as output the matrix of the 
composition R2 o R}. 


Write an algorithm whose input is a sequence s1,..., 5, and 
a value x. (Assume that all the values are real numbers.) The 
algorithm returns true if s; + s; = x, for some i 4 j, and false 
otherwise. Example: If the input sequence is 2, 12, 6, 14 and 
x = 26, the algorithm returns true because 12 + 14 = 26. 
If the input sequence is 2, 12,6,14 and x = 4, the algo- 
rithm returns false because no distinct pair in the sequence 
sums to 4. 


4.2 Examples of Algorithms 


Algorithms have been devised to solve many problems. In this section, we give examples 
of several useful algorithms. Throughout the remainder of the book, we will investigate 
many additional algorithms. 


Searching 


A large amount of computer time is devoted to searching. When a teller looks for a record 
in a bank, a computer program searches for the record. Looking for a solution to a puzzle 
or for an optimal move in a game can be stated as a searching problem. Using a search 
engine on the web is another example of a searching problem. Looking for specified text 
in a document when running a word processor is yet another example of a searching 
problem. We discuss an algorithm to solve the text-searching problem. 

Suppose that we are given text f (e.g., a word processor document) and we want 
to find the first occurrence of pattern p in ¢ (e.g., we want to find the first occurrence of 
the string p = “Nova Scotia” in f) or determine that p does not occur in ft. We index the 
characters in f starting at 1. One approach to searching for p is to check whether p occurs 
at index | in t. If so, we stop, having found the first occurrence of p in ¢. If not, we check 
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whether p occurs at index 2 in r. If so, we stop, having found the first occurrence of p in 
t. If not, we next check whether p occurs at index 3 in f, and so on. 
We state the text-searching algorithm as Algorithm 4.2.1. 


Text Search 


This algorithm searches for an occurrence of the pattern p in text f. It returns the 
smallest index i such that p occurs in f starting at index i. If p does not occur in ¢, it 
returns 0. 


Input: p (indexed from 1 to m), m, t (indexed from 1 ton), n 
Output: i 


text_search(p, m, t, n) { 
fori=lton—m+1{ 
jul 


// i is the index in ¢ of the first character of the substring 
// to compare with p, and j is the index in p 


// the while loop compares fj - «+ ti+m—1 and pj +++ Pm 
while (¢;,;-1 == p;) { 
j=jtl 
if Gj > m) 
return i 


return 0 


The variable 7 marks the index in ¢ of the first character of the substring to compare 
with p. The algorithm first tries i = 1, then i = 2, and so on. Index n — m + 1 is the last 
possible value for i since, at this point, the string f)—m+1t—m42 ++ tn has length exactly m. 

After the value of i is set, the while loop compares f; + - - fi4n—1 and p; -- + Dm. If the 
characters match, 


litj-1 == Pj 
j is incremented 
j=j+l 


and the next characters are compared. If j is m+ 1, all m characters have matched and 
we have found p at index j in ¢. In this case, the algorithm returns 7: 


if G > m) 


return 7 


If the for loop runs to completion, a match was never found; so the algorithm returns 0. 


Figure 4.2.1 shows a trace of Algorithm 4.2.1 where we are searching for the pattern 
“001” in the text “010001”. 
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j=l j=2 j=l 
1 1 0x) 1 (x) 
001 001 001 
010001 010001 010001 
t iy t 
i=l r=1 i=2 
(1) (2) (3) 
j=l j=2 i= 
1 1 + (x) 
001 001 001 
010001 010001 010001 
t t t 
i= 3 L=3 i 
(4) (5) (6) 
foi j=2 j=3 
001 001 001 
010001 010001 010001 
t t i 
i=4 i=4 i=4 
(7) (8) (9) 
Figure 4.2.1 Searching for “001” in “010001” using Algorithm 4.2.1. 
The cross (x) in steps (2), (3), and (6) marks a mismatch. 4 
Sorting 


To sort a sequence is to put it in some specified order. If we have a sequence of names, 
we might want the sequence sorted in nondecreasing order according to dictionary order. 
For example, if the sequence is 


Jones, Johnson, Appel, Zamora, Chu, 
after sorting the sequence in nondecreasing order, we would obtain 
Appel, Chu, Johnson, Jones, Zamora. 


A major advantage of using a sorted sequence rather than an unsorted sequence is that 
it is much easier to find a particular item. Imagine trying to find the phone number 
of a particular individual in the New York City telephone book if the names were not 
sorted! 

Many sorting algorithms have been devised (see, e.g., [Knuth, 1998b]). 
Which algorithm is preferred in a particular situation depends on factors such as the 
size of the data and how the data are represented. We discuss insertion sort, which 
is one of the fastest algorithms for sorting small sequences (less than 50 or so 
items). 

We assume that the input to insertion sort is s;,..., 5, and that the goal is to sort 
the data in nondecreasing order. At the ith iteration of insertion sort, the first part of 
the sequence s,,..., 5; will have been rearranged so that it is sorted. (We will explain 


180 


Chapter 4 @ Algorithms 


shortly how s;,..., 8; gets sorted.) Insertion sort then inserts s;; in s,,..., 5; So that 
S1,..., Si, Si41 18 sorted. 
For example, suppose that i = 4 and 51, ..., 54 is 
8 | 13 | 20 | 27 
If ss is 16, after it is inserted, s,,..., 55 becomes 


8 | 13 | 16 | 20 | 27 


Notice that 20 and 27, being greater than 16, move one index to the right to make room 
for 16. Thus the “insert” part of the algorithm is: Beginning at the right of the sorted 
subsequence, move an element one index to the right if it is greater than the element to 
insert. Repeat until reaching the first index or encountering an element that is less than 
or equal to the element to insert. 

For example, to insert 16 in 


we first compare 16 and 27. Since 27 is greater than 16, 27 moves one index to the 
right: 


We next compare 16 with 20. Since 20 is greater than 16, 20 moves one index to the 
right: 


We next compare 16 with 13. Since 13 is less than or equal to 16, we insert (i.e., copy) 
16 to the third index: 


This subsequence is now sorted. 

Having explained the key idea of insertion sort, we now complete the explanation 
of the algorithm. Insertion sort begins by inserting 52 into the subsequence s;. Note that s 
by itself is sorted! Now s1, s2 is sorted. Next, insertion sort inserts s3 into the now-sorted 
subsequence $1, 52. Now 81, 82, 83 is sorted. This procedure continues until insertion sort 
inserts s, into the sorted subsequence 51, ..., 8,1. Now the entire sequence s1,..., Sy 
is sorted. We obtain the following algorithm. 
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Insertion Sort 
This algorithm sorts the sequence 51, ..., 5, in nondecreasing order. 


Input: s,n 
Output: 5 (sorted) 


insertion_sort(s, n) { 
fonr—2itom{ 
val = s; // save s; so it can be inserted into the correct place 
j=i-l 
// if val < sj, move s; right to make room for s; 
while (j = 1 A val < sj) { 
Sarl = oy 
j=j-1 
} 


S;41 = val // insert val 


We leave proving that Algorithm 4.2.3 is correct as an exercise (see Exercise 14). 


Time and Space for Algorithms 


It is important to know or be able to estimate the time (e.g., the number of steps) and 
space (e.g., the number of variables, length of the sequences involved) required by algo- 
rithms. Knowing the time and space required by algorithms allows us to compare algo- 
rithms that solve the same problem. For example, if one algorithm takes n steps to solve 
a problem and another algorithm takes n? steps to solve the same problem, we would 
surely prefer the first algorithm, assuming that the space requirements are acceptable. In 
Section 4.3, we will give the technical definitions that allow us to make rigorous state- 
ments about the time and space required by algorithms. 

The for loop in Algorithm 4.2.3 always executes n — | times, but the number of 
times that the while loop executes for a particular value of i depends on the input. Thus, 
even for a fixed size n, the time required by Algorithm 4.2.3 depends on the input. For 
example, if the input sequence is already sorted in nondecreasing order, 


val < 8; (4.2.1) 


will always be false and the body of the while loop will never be executed. We call this 
time the best-case time. 

On the other hand, if the sequence is sorted in decreasing order, (4.2.1) will always 
be true and the while loop will execute the maximum number of times. (The while loop 
will execute i — | times during the ith iteration of the for loop.) We call this time the 
worst-case time. 


Randomized Algorithms 


It is occasionally necessary to relax the requirements of an algorithm stated in Sec- 
tion 4.1. Many algorithms currently in use are not general, deterministic, or even finite. 
An operating system (e.g., Windows), for example, is better thought of as a program that 
never terminates rather than as a finite program with input and output. Algorithms written 
for more than one processor, whether for a multiprocessor machine or for a distributed 
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environment (such as the internet), are rarely deterministic, for example, because of dif- 
ferent execution speeds of the processors. Also, many practical problems are too difficult 
to be solved efficiently, and compromises either in generality or correctness are neces- 
sary. As an illustration, we present an example that shows the usefulness of allowing an 
algorithm to make random decisions, thereby violating the requirement of determinism. 

A randomized algorithm does not require that the intermediate results of each 
step of execution be uniquely defined and depend only on the inputs and results of the 
preceding steps. By definition, when a randomized algorithm executes, at some points 
it makes random choices. In practice, a pseudorandom number generator is used (see 
Example 3.1.16). 

We shall assume the existence of a function rand(i, j), which returns a random 
integer between the integers 7 and j, inclusive. As an example, we describe a random- 
ized algorithm that shuffles a sequence of numbers. More precisely, it inputs a sequence 


a1,..., A, and moves the numbers to random positions. Major bridge tournaments use 
computer programs to shuffle the cards. 
The algorithm first swaps (i.e., interchanges the values of) a, and Gyana(1n). At this 


point, the value of a; might be equal to any one of the original values in the sequence. 
Next, the algorithm swaps a2 and Gyana(2,n). Now the value of az might be equal to any 
of the remaining values in the sequence. The algorithm continues in this manner until it 
swap d,—1 and Ayand(n—1,n)- Now the entire sequence is shuffled. 


Shuffle 
This algorithm shuffles the values in the sequence 
Oils eoan Chas 
Input: a,n 


Output: a (shuffled) 


shuffle(a, n) { 
fori=lton—1 
swap (ai, Qrand(i,n)) 


Suppose that the sequence a 


17} 9 | 5 | 23 21 | 


is input to shuffle. We first swap a; and a;, where i = 1 and j = rand(1, 5). Ifj = 3, after 
the swap we have 


5 | 9 | 17) 23 21 | 
tof 
J 


i 


Next, i = 2. If j = rand(2,5) = 5, after the swap we have 


5 | 21 | 17} 23) 9 
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Next, i = 3. If j = rand(3, 5) = 3, the sequence does not change. 
Finally, i = 4. If j = rand(4, 5) = 5, after the swap we have 


5 | 21 | 17] 9 | 23 
tf 
J 


I 


Notice that the output (i.e., the rearranged sequence) depends on the random choices 
made by the random number generator. < 


Randomized algorithms can be used to search for nonrandom goals. For example, 
a person searching for the exit in a maze could randomly make a choice at each inter- 
section. Of course, such an algorithm might not terminate (because of bad choices at the 
intersections). In Chapter 8, Graph Theory, we will present a randomized algorithm that 
searches for a particular structure in a graph (see Algorithm 8.3.10). 


4.2 Problem-Solving Tips 


Again, we emphasize that to construct an algorithm, it is often helpful to assume that 
you are in the middle of the algorithm and that part of the problem has been solved. In 
insertion sort (Algorithm 4.2.3), it was helpful to assume that the subsequence sj, ..., sj 
was sorted. Then, all we had to do was insert the next element s;,, in the proper place. 
Iterating this procedure yields the algorithm. These observations lead to a loop invariant 


that can be used to prove that Algorithm 4.2.3 is correct (see Exercise 14). 


4.2 Review Exercises 


1. Give examples of searching problems. 
2. What is text searching? 


3. Describe, in words, an algorithm that solves the text-searching 
problem. 


4. What does it mean to sort a sequence? 


5. Give an example that illustrates why we might want to sort a 
sequence. 


6. Describe insertion sort in words. 


7. What do we mean by the time and space required by an 
algorithm? 


. Why is it useful to know or be able to estimate the time and 


space required by an algorithm? 


. Why is it sometimes necessary to relax the requirements of an 


algorithm as stated in Section 4.1? 


. What is a randomized algorithm? 


. Which requirements of an algorithm as stated in Section 4.1 


does a randomized algorithm violate? 


. Describe the shuffle algorithm in words. 


. Give an application of the shuffle algorithm. 


1. Trace Algorithm 4.2.1 for the input tf = “balalaika” and 


. Trace Algorithm 4.2.3 for the input 34, 55, 144, 259. 


p = “bala”. 7. Trace Algorithm 4.2.3 for the input 34, 34, 34, 34. 

2. Trace Algorithm 4.2.1 for the input tf = “balalaika” and 8. Trace Algorithm 4.2.4 for the input 34, 57, 72, 101, 135. 
p= “lai”. Assume that the values of rand are 

3. Trace Algorithm 4.2.1 for the input tf = “000000000” and 
p= “001”. rand(1,5)=5,  rand(2,5) = 4, 

4. Trace Algorithm 4.2.3 for the input 34, 20, 144, 55. TERNS AVS Sy, RAE, 9s 

5. Trace Algorithm 4.2.3 for the input 34, 20, 19, 5. 9. Trace Algorithm 4.2.4 for the input 34, 57, 72, 101, 135. 
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10. 


11. 


12. 


13. 
14. 
15. 


16. 
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Assume that the values of rand are 


rand(1, 5) = 2, 
rand(3, 5) = 3, 


rand(2,5) = 5, 
rand(4, 5) = 4. 


Trace Algorithm 4.2.4 for the input 34,57, 72, 101, 135. 
Assume that the values of rand are 


rand(1, 5) = 5, 
rand(3, 5) = 4, 


rand(2, 5) = 5, 
rand(4, 5) = 4. 


Is it possible that Algorithm 4.2.4 sorts the input 
67, 32, 6, 89, 52 in increasing order? If so, show possible val- 
ues for rand that perform the sort. It not, prove that Algorithm 
4.2.4 cannot sort this input in increasing order. 


Is it possible that Algorithm 4.2.4 sorts the input 
67, 32, 6, 89,52 in decreasing order? If so, show possible 
values for rand that perform the sort. It not, prove that Algo- 
rithm 4.2.4 cannot sort this input in decreasing order. 


Prove that Algorithm 4.2.1 is correct. 
Prove that Algorithm 4.2.3 is correct. 
Write an algorithm that returns the index of the first occur- 
rence of the value key in the sequence 51, ..., S,. If key is not 
in the sequence, the algorithm returns the value 0. Example: 


If the sequence is 12, 11, 12, 23 and key is 12, the algorithm 
returns the value 1. 


Write an algorithm that returns the index of the last occurrence 
of the value key in the sequence s;,...,5,. If key is not in 


17. 


18. 


19. 
20. 
21. 


22. 


23. 


24. 


the sequence, the algorithm returns the value 0. Example: If 
the sequence is 12, 11, 12, 23 and key is 12, the algorithm re- 
turns the value 3. 

Write an algorithm whose input is a sequence 5, ..., 5 Sy sorted 
in nondecreasing order and a value x. (Assume that all the 
values are real numbers.) The algorithm inserts x into the se- 
quence so that the resulting sequence is sorted in nondecreas- 
ing order. Example: If the input sequence is 2, 6, 12, 14 and 
x = 5, the resulting sequence is 2, 5, 6, 12, 14. 


Modify Algorithm 4.2.1 so that it finds all occurrences of 
pint. 


Describe best-case input for Algorithm 4.2.1. 


Describe worst-case input for Algorithm 4.2.1. 


Modify Algorithm 4.2.3 so that it sorts the sequence 51, ..., Sy 
in nonincreasing order. 
The selection sort algorithm sorts the sequence s1,..., 5 Sy in 


nondecreasing order by first finding the smallest item, say s;, 
and placing it first by swapping s; and _5;. It then finds the 
smallest item in s2,..., 5), again say s;, and places it second 
by swapping s2 and s;. It continues until the sequence is sorted. 
Write selection sort in pseudocode. 


Trace selection sort (see Exercise 22) for the input of 
Exercises 4—7. 


Show that the time for selection sort (see Exercise 22) is the 
same for all inputs of size n. 


4.3 Analysis of Algorithms 


A computer program, even though derived from a correct algorithm, might be useless for 
certain types of input because the time needed to run the program or the space needed 
to hold the data, program variables, and so on, is too great. Analysis of an algorithm 
refers to the process of deriving estimates for the time and space needed to execute the 
algorithm. In this section we deal with the problem of estimating the time required to 


execute algorithms. 


Suppose that we are given a set X of n elements, some labeled “red” and some 
labeled “black,” and we want to find the number of subsets of X that contain at least one 
red item. Suppose we construct an algorithm that examines all subsets of X and counts 
those that contain at least one red item and then implement this algorithm as a computer 
program. Since a set that has n elements has 2” subsets (see Theorem 2.4.6), the program 
would require at least 2” units of time to execute. It does not matter what the units of 
time are—2” grows so fast as n increases (see Table 4.3.1) that, except for small values 
of n, it would be impractical to run the program. 

Determining the performance parameters of a computer program is a difficult task 
and depends on a number of factors such as the computer that is being used, the way 
the data are represented, and how the program is translated into machine instructions. 
Although precise estimates of the execution time of a program must take such factors 
into account, useful information can be obtained by analyzing the time of the underlying 


algorithm. 


The time needed to execute an algorithm is a function of the input. Usually, it is 
difficult to obtain an explicit formula for this function, and we settle for less. Instead of 
dealing directly with the input, we use parameters that characterize the size of the input. 
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TABLE 4.3.1 ™ Time to Execute an Algorithm if One Step Takes 1 Microsecond to Execute. Ign Denotes log, n 
(the logarithm of n to base 2) 


Number of Steps 
to Termination 
for Input of Size n 3 
1 10-6 sec 
Iglgn 10~® sec 
Ign 2 x 10-6 sec 
n 3 x 10~° sec 
nign 5 x 1076 sec 
in 9 x 10-6 sec 
n? 3 x 107 sec 
Qe 8 x 10~° sec 
50 
1 10-6 sec 
Iglgn 2 x 107° sec 
Ign 6 x 10-6 sec 
n 5 x 10> sec 
nign 3 x 107+ sec 
in 3 x 1073 sec 
n? 0.13 sec 
2 36 yr 


Time to Execute ifn = 


6 9 12 
10~° sec 10~® sec 10~° sec 
10~° sec 2 x 107° sec 2 x 107° sec 
3 x 107° sec 3 x 107° sec 4 x 107° sec 
6 x 10~° sec 9 x 10~® sec 10~> sec 
2 x 107> sec 3 x 1075 sec 4 x 1075 sec 
4 x 1075 sec 8 x 107 sec 10-4 sec 
2 x 1074 sec 7 x 1074 sec 2 x 1073 sec 
6 x 1075 sec 5 x 1074 sec 4 x 1073 sec 
100 1000 105 10° 
10~° sec 10~° sec 10~® sec 10-® sec 
3 x 10-6 sec 3 x 10-6 sec 4x 107° sec 4x 107° sec 
7x 107° sec 1075 sec 2 x 1075 sec 2 x 1075 sec 
10-4 sec 1073 sec 0.1 sec 1 sec 
7 x 10-4 sec 10~? sec 2 sec 20 sec 
0.01 sec 1 sec 3 hr 12 days 
1 sec 16.7 min 32 yr 31,710 yr 
4 x 10!6 yr 3 x 10787 yr 3 x 1030089 yr 3 x 19701016 yp 


Example 4.3.1 


For example, if the input is a set containing n elements, we would say that the size of the 
input is n. We can ask for the minimum time needed to execute the algorithm among all 
inputs of size n. This time is called the best-case time for inputs of size n. We can also 
ask for the maximum time needed to execute the algorithm among all inputs of size n. 
This time is called the worst-case time for inputs of size n. Another important case is 
average-case time—the average time needed to execute the algorithm over some finite 
set of inputs all of size n. 

Since we are primarily concerned with estimating the time of an algorithm rather 
than computing its exact time, as long as we count some fundamental, dominating steps 
of the algorithm, we will obtain a useful measure of the time. For example, if the principal 
activity of an algorithm is making comparisons, as might happen in a sorting routine, we 
might count the number of comparisons. As another example, if an algorithm consists 
of a single loop whose body executes in at most C steps, for some constant C, we might 
count the number of iterations of the loop. 


A reasonable definition of the size of input for Algorithm 4.1.2 that finds the largest 
value in a finite sequence is the number of elements in the input sequence. A reasonable 
definition of the execution time is the number of iterations of the while loop. With these 
definitions, the worst-case, best-case, and average-case times for Algorithm 4.1.2 for 
input of size n are each n — | since the loop is always executed n — | times. < 


Usually we are less interested in the exact best-case or worst-case time required 
for an algorithm to execute than we are in how the best-case or worst-case time grows 
as the size of the input increases. For example, suppose that the worst-case time of an 
algorithm is 


t(n) = 60n* +5n+1 


for input of size n. For large n, the term 60n* is approximately equal to #(n) (see Ta- 
ble 4.3.2). In this sense, f(n) grows like 60n?. 
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TABLE 4.3.2 m Comparing Growth of t(n) with 60n? 


n t(n) = 60n? + 5n+1 60n? 
10 6,051 6,000 
100 600,501 600,000 
1,000 60,005,001 60,000,000 
10,000 6,000,050,001 6,000,000,000 


If t(n) measures the worst-case time for input of size n in seconds, then 


Faye po ae 

7 60 60 
measures the worst-case time for input of size n in minutes. Now this change of units 
does not affect how the worst-case time grows as the size of the input increases but only 
the units in which we measure the worst-case time for input of size n. Thus when we 
describe how the best-case or worst-case time grows as the size of the input increases, 
we not only seek the dominant term [e.g., 60n? in the formula for ¢(n)], but we also may 
ignore constant coefficients. Under these assumptions, t(m) grows like n” as n increases. 
We say that ¢(n) is of order n° and write t(n) = O(n’), which is read “t(n) is theta of 
n°.” The basic idea is to replace an expression, such as t(n) = 60n? + 5n + 1, with a 
simpler expression, such as n’, that grows at the same rate as ¢(n). The formal definitions 
follow. 


Definition 4.3.2 > Let and g be functions with domain {1, 2, 3, ...}. 
We write 


fn) = O(g(n)) 


and say that f(n) is of order at most g(n) or f (n) is big oh of g(n) if there exists a positive 
constant C; such that 


lf(™)| < Cilg@)| 


for all but finitely many positive integers n. 
We write 


fn) = Q(g(n)) 


and say that f(n) is of order at least g(n) or f(n) is omega of g(n) if there exists a positive 
constant Cz such that 


f(m)| = Co |g()| 


for all but finitely many positive integers n. 
We write 


fin) = O(gn)) 


and say that f() is of order g(n) or f(n) is theta of g(n) if f(n) = O(g(n)) and f(n) = 
2(g(n)). < 


Definition 4.3.2 can be loosely paraphrased as follows: f(n) = O(g(n)) if, except 
for a constant factor and a finite number of exceptions, f is bounded above by g. We also 
say that g is an asymptotic upper bound for f. Similarly, f(m) = Q(g(n)) if, except for 
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a constant factor and a finite number of exceptions, f is bounded below by g. We also say 
that g is an asymptotic lower bound for f. Also, f(n) = ©(g(n)) if, except for constant 
factors and a finite number of exceptions, f is bounded above and below by g. We also 
say that g is an asymptotic tight bound for /. 

According to Definition 4.3.2, if f(n) = O(g(n)), all that we can conclude is that, 
except for a constant factor and a finite number of exceptions, f is bounded above by g, 
so g grows at least as fast as f. For example, if f(m) = n and g(n) = 2”, then f(n) = 
O(g(n)), but g grows considerably faster than f. The statement f(m) = O(g(n)) says 
nothing about a /ower bound for f. On the other hand, if f(n) = ©(g(n)), we can draw 
the conclusion that, except for constant factors and a finite number of exceptions, f is 
bounded above and below by g, sof and g grow at the same rate. Notice that n = O(2"), 
but n A O(2"). 


Example 4.3.3 Since 
60n? + 5n +1 < 60n> +5? +n? =66n? — foralln>1, 
we may take C; = 66 in Definition 4.3.2 to obtain 
60n* + 5n +1 = O(n’). 
Since 
60n? +5n+1>60n — foralln>1, 
we may take C) = 60 in Definition 4.3.2 to obtain 
60n* + 5n + 1 = Q(n’). 
Since 60n? + 5n + 1 = O(n’) and 60n? + 5n+ 1 = Q(n?), 


60n? + 5n+1 = O(n’). 4 
The method of Example 4.3.3 can be used to show that a polynomial in n of degree 
k with nonnegative coefficients is @(n"). [In fact, any polynomial in n of degree k is 


@(n*), even if some of its coefficients are negative. To prove this more general result, 
the method of Example 4.3.3 has to be modified.] 


Theorem 4.3.4 Let 
p(n) = agn* + ayn! +--+ ain + a9 
be a polynomial in n of degree k, where each q; is nonnegative. Then 


p(n) = O(n’). 


Proof We first show that p(n) = O(n‘). Let 


Cy = ag + ag-1 + +++ +a, +a. 


Then, for all n, 
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p(n) = ayn We an! +++++ayn+ao 


a,yn* aa ayn Ss ayn* + agn* 
k k 
= (a, +ay_) +--+ +a, +a0)n* = Cin". 


IA 


Therefore, p(n) = O(n‘). 
Next, we show that p(n) = Q (n*). For all n, 


k k-1 k k 
p(n) = agn™ + agin + +++ + ayn+ ao = an’ = Cn", 


where C) = a,. Therefore, p(n) = Qn*). 
Since p(n) = O(n") and p(n) = Q(n'), p(n) = O(n"). < 


Example 4.3.5 _ In this book, we let lgn denote log, n (the logarithm of n to the base 2). Since lgn < n 
ee — for all n > 1 (see Figure 4.3.14), 


2n+ 3lgn < 2n+3n=5n for alln > 1. 
Thus, 
2n+3lgn= O(n). 


Also, 2n + 3lgn > 2n, for alln > 1. Thus, 


2n+3lgn = Q(n). 


Therefore, 


2n+3lgn = O(n). < 


Figure 4.3.1 Growth of some common functions. 


'In Figure 4.3.1, the spacing on the y-axis is proportional to the logarithm of the number rather than to the 
number itself so that we can plot large y-values against smaller x-values. This y-axis scale is called a logarithmic 
scale. On the standard xy-graph, y = n would be a straight line; here it is curved. 


Example 4.3.6 


Example 4.3.7 
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Ifa > 1 and b > 1 (to ensure that log, a > 0), by the change-of-base formula for 
logarithms [Theorem B.37(e)], 


log, n = log, alog,n for alln > 1. 
Therefore, 
log, n < Clog,n for alln > 1, 


where C = log, a. Thus, log, n = O(log, n). 
Also, 


log, n = Clog,n for alln > 1; 
so log, n = Q<(log, n). Since log, n = O(log, n) and log, n = Q(log, n), we conclude 
that log, n = O(log, n). 
Because log, n = ©(log, 1), when using asymptotic notation we need not worry 
about which number is used as the base for the logarithm function (as long as the base is 


greater than 1). For this reason, we sometimes simply write log without specifying the 
base. 


If we replace each integer 1, 2,..., by nin the sum 1 +2+----+n, the sum does not 
decrease and we have 


14+2+---t+n<ntn+-:-tn=n-n=nr for alln > 1. (4.3.1) 
It follows that 
14+24+---4+n=O(r’). 


To obtain a lower bound, we might imitate the preceding argument and replace 
each integer 1,2,...,n by 1 inthe sum 1 +2+4---+ 27 to obtain 


142+4+---4n>1414---4+1l=n foralln > 1. 
In this case we conclude that 
14+24+---+2=Q(n), 


and while the preceding expression is true, we cannot deduce a ©-estimate for 
1+2-+----+n, since the upper bound n? and lower bound n are not equal. We must be 
craftier in deriving a lower bound. 

One way to get a sharper lower bound is to argue as in the previous paragraph, but 
first throw away approximately the first half of the terms, to obtain 


1424---+0> [@+)/21+---+@—1+n 
> [(n+D/2])+---+ [nt D/214+ [at 2/21 
2 
= [n/2][(n + 1)/2] = (n/2)(n/2) = _ (4.3.2) 


for all n > 1. We can now conclude that 


1+24---+n=Q(n’). 
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Therefore, 
142+4+---+n=0(n’). 4 
Example 4.3.8 If k isa positive integer and, as in Example 4.3.7, we replace each integer 1, 2, ..., 1 by 
—=>——  _n, we have 


Pe hehe? Se a oa el Sane fap 


for all n > 1; hence 
1* +4 2% thi + nk -_ O(n'*!), 
We can also obtain a lower bound as in Example 4.3.7: 


h42ky...4¢n8 > [mt 1)/21' +--- + (n— Dk +n 
> [iat 1/2} +--+ + [a+ 2/21 + [at 1/21 
= [n/2][(n + 1)/21* = (n/2)(n/2)* = ni! 241 


for all n > 1. We conclude that 
peas ..4 nha aint), 


and hence 


h42k4...4n* = @O(n't!), < 
Notice the difference between the polynomial 
an’ + ayn! +--+. + ain +ao 
in Theorem 4.3.4 and the expression 
e4ob4 ant 


in Example 4.3.8. A polynomial has a fixed number of terms, whereas the number of 

terms in the expression in Example 4.3.8 is dependent on the value of n. Furthermore, the 

polynomial in Theorem 4.3.4 is @(n*), but the expression in Example 4.3.8 is @(n‘*!). 
Our next example gives a theta notation for lg n!. 


Example 4.3.9 Use an argument similar to that in Example 4.3.7, to show that lgn! = O(nlgn). 
SOLUTION By properties of logarithms, we have 
Ign! =Ign+lg(n—1)+---+1]1g2+]1g1 


for all n > 1. Since lg is an increasing function, 


Ign+lg(n—1)+---+]lg24+l]lg1 <lgn+lgn+---+lgn+lgn=nlgn 


for alln > 1. We conclude that lgn! = O(nlgn). 


For all n > 4, we have 


Ign+lIg—1)4+---+]g2+1g1 > Ign+lgm—1)4+---+1g[(n+1)/2] 
> Igf(n+ 1/21 +---+1g[@+ 1/21 
= [n/2]lg[(v+ 1/2] 


Example 4.3.10 
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= (n/2)|g(n/2) 

= (n/2) [Ign — 1g 2] 

= (n/2)[Ugn)/2 + (dgn)/2 — 1] 
= (n/2)(1g n)/2 

=nign/4 


[since (lgn)/2 > 1 for all n > 4]. Therefore, lgn! = Q(nlgn). It follows that lgn! = 
O(nign). < 
Show that if f(n) = ©O(g(n)) and g(n) = O(A(n)), then f(n) = O(h(n)). 

SOLUTION Because f(n) = ©(g(n)), there are constants C; and C2 such that 


Cilg@| < If@| < Cole@| 


for all but finitely many positive integers n. Because g(n) = ©(h(n)), there are constants 
C3 and C4 such that 


C3|h(n)| < |g@a)| S< Cyl) 
for all but finitely many positive integers n. Therefore, 
CiC3|h(n)| < Cilg(@)| Sf) S Calg@| Ss CoCalh@)| 


for all but finitely many positive integers n. It follows that f(n) = O(h(n)). < 


We next define what it means for the best-case, worst-case, or average-case time 
of an algorithm to be of order at most g(7). 


Definition 4.3.11 > Ifan algorithm requires f(n) units of time to terminate in 
the best case for an input of size n and 


t(n) = O(g(n)), 


we say that the best-case time required by the algorithm is of order at most g(n) or that 
the best-case time required by the algorithm is O(g(n)). 

If an algorithm requires ¢(n) units of time to terminate in the worst case for an 
input of size n and 


t(n) = O(g(n)), 


we say that the worst-case time required by the algorithm is of order at most g(n) or that 
the worst-case time required by the algorithm is O(g(n)). 

If an algorithm requires ¢(7) units of time to terminate in the average case for an 
input of size n and 


t(n) = O(g(n)), 


we say that the average-case time required by the algorithm is of order at most g(n) or 
that the average-case time required by the algorithm is O(g(n)). < 


By replacing O by Q and “at most” by “at least” in Definition 4.3.11, we obtain 
the definition of what it means for the best-case, worst-case, or average-case time of 
an algorithm to be of order at least g(n). If the best-case time required by an algorithm 
is O(g(n)) and Q(g(n)), we say that the best-case time required by the algorithm is 
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Example 4.3.12 


Example 4.3.13 


Example 4.3.14 


©(g(n)). An analogous definition applies to the worst-case and average-case times of an 
algorithm. 


Suppose that an algorithm is known to take 60n?+5n+ 1 units of time to terminate in the 
worst case for inputs of size n. We showed in Example 4.3.3 that 60n7+5n+1 = @(n’). 
Thus the worst-case time required by this algorithm is ©(n7). < 


Find a theta notation in terms of n for the number of times the statement x = x + | is 
executed. 


1. fori=lton 
2 forj = 1 toi 
3. x=x+1 


SOLUTION First, ¢is set to 1 and, asj runs from 1 to 1, line 3 is executed one time. Next, 
iis set to 2 and, asj runs from | to 2, line 3 is executed two times, and so on. Thus the total 
number of times line 3 is executed is (see Example 4.3.7) 1+2+---+n= @(n?). Thus 
a theta notation for the number of times the statement x = x + 1 is executed is @(n’). 4 


Find a theta notation in terms of n for the number of times the statement x = x + | is 
executed: 


i=n 

while (i > 1) { 
x=x+1 
i= |i/2| 


OV Geko 


SOLUTION First, we examine some specific cases. Because of the floor function, the 
computations are simplified if n is a power of 2. Consider, for example, the case n = 8. 
At line 1, 7 is set to 8. At line 2, the condition i > | is true. At line 3, we execute the 
statement x = x + | the first time. At line 4, 7 is reset to 4 and we return to line 2. 

At line 2, the condition i > 1 is again true. At line 3, we execute the statement 
x =x-+1 the second time. At line 4, 7 is reset to 2 and we return to line 2. 

At line 2, the condition i > 1 is again true. At line 3, we execute the statement 
x =x-+ 1 the third time. At line 4, 7 is reset to | and we return to line 2. 

At line 2, the condition i > 1 is again true. At line 3, we execute the statement 
x =x-+ 1 the fourth time. At line 4, 7 is reset to 0 and we return to line 2. 

This time at line 2, the condition i > | is false. The statement x = x + | was 
executed four times. 

Now suppose that n is 16. At line 1, i is set to 16. At line 2, the condition i > 1 is 
true. At line 3, we execute the statement x = x + | the first time. At line 4, 7 is reset to 
8 and we return to line 2. Now execution proceeds as before; the statement x = x + | is 
executed four more times, for a total of five times. 

Similarly, if 7 is 32, the statement x = x + | is executed a total of six times. 

A pattern is emerging. Each time the initial value of n is doubled, the statement 
x =x+1 is executed one more time. More precisely, if n = 2", the statement x = x+ 1 
is executed k + 1 times. Since k is the exponent for 2, k = Ign. Thus if n = 2*, the 
statement x = x + 1 is executed 1 + Ign times. 


Example 4.3.15 
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If n is an arbitrary positive integer (not necessarily a power of 2), it lies between 
two powers of 2; that is, for some k > 1, 


Hen <2 
We use induction on k to show that in this case the statement x = x + | is executed k 
times. 

If k = 1, we have 

1=2''!<n<2' =2. 
Therefore, n is 1. In this case, the statement x = x + 1 is executed once. Thus the Basis 
Step is proved. 
Now suppose that if n satisfies 

wen < 2k, (4.3.3) 
the statement x = x + | is executed k times. We must show that if n satisfies 

<n <2, (4.3.4) 
the statement x = x + | is executed k + | times. 

Suppose that n satisfies (4.3.4). At line 1, is set ton. At line 2, the condition i > 1 
is true. At line 3, we execute the statement x = x + | the first time. At line 4, i is reset to 
[n/2| and we return to line 2. Notice that 

Wl <n/2 < 2. 
Because 2‘~! is an integer, we must also have 
2! << [n/2| < 2". 
By the inductive assumption (4.3.3), the statement x = x + 1 is executed k more times, 
for a total of k+ 1 times. The Inductive Step is complete. Therefore, if n satisfies (4.3.3), 
the statement x = x + | is executed k times. 
Suppose that n satisfies (4.3.3). Taking logarithms to the base 2, we have 
k-—1<lgn<k. 
Therefore, k, the number of times the statement x = x + | is executed, satisfies 
Ign<k<1+lgn. 
Because k is an integer, we must have k < 1+ |[lgn]. Furthermore, |lgn]| < k. It follows 


from the last two inequalities that k = 1 + [Ign]. Since 1 + [Ign] = ©O(1gn), a theta 
notation for the number of times the statement x = x + | is executed is O(1g 7). < 


Many algorithms are based on the idea of repeated halving. Example 4.3.14 shows 
that for size n, repeated halving takes time © (1g 1). Of course, the algorithm may do work 
in addition to the halving that will increase the overall time. 


Find a theta notation in terms of n for the number of times the statement x = x + | is 
executed. 
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Example 4.3.16 


1, -j=n 

2. while 7 > 1) { 
3. for i = 1 toj 
4. x=x+1 
5. f= U/2 

6. } 


SOLUTION Let t(n) denote the number of times we execute the statement x = x + 1. 
The first time we arrive at the body of the while loop, the statement x = x+ 1 is executed 
n times. Therefore t(n) > n for alln > 1 and t(n) = Q(n). 

Next we derive a big oh notation for f(m). After j is set to n, we arrive at the while 
loop for the first time. The statement x = x+ 1 is executed n times. At line 5, j is replaced 
by |n/2]; hence j < n/2. Ifj > 1, we will execute x = x + 1 at most n/2 additional 
times in the next iteration of the while loop, and so on. If we let k denote the number of 
times we execute the body of the while loop, the number of times we execute x = x + 1 
is at most 


Now 


n (1 - x) 1 
t(n) < ——~+ =2n(1-— ) <2n for alln > 1, 
L=5 2k 
so t(n) = O(n). Thus a theta notation for the number of times we execute x = x + 1 
is O(n). < 


Determine, in theta notation, the best-case, worst-case, and average-case times required 
to execute Algorithm 4.3.17, which follows. Assume that the input size is n and that the 
run time of the algorithm is the number of comparisons made at line 3. Also, assume 
that the n + 1 possibilities of key being at any particular position in the sequence or not 
being in the sequence are equally likely. 


SOLUTION The best-case time can be analyzed as follows. If s; = key, line 3 is exe- 
cuted once. Thus the best-case time of Algorithm 4.3.17 is O(1). 

The worst-case time of Algorithm 4.3.17 is analyzed as follows. If key is not in the 
sequence, line 3 will be executed n times, so the worst-case time of Algorithm 4.3.17 
is O(n). 

Finally, consider the average-case time of Algorithm 4.3.17. If key is found at the 
ith position, line 3 is executed i times; if key is not in the sequence, line 3 is executed n 
times. Thus the average number of times line 3 is executed is 


(+2+---+n)+n 


n+1 
Now 
eae 2 
d+2+ +n)+n ¥ n+n by (4.3.1) 
n+1 n+1 
_ n(n + 1) _ 


n+1 


Algorithm 4.3.17 
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Therefore, the average-case time of Algorithm 4.3.17 is O(n). Also, 


Lar? sees 2744 
Cie a Sees by (4.3.2) 
n+1 n+1 
2 
as _ 
n+l 4 


Therefore the average-case time of Algorithm 4.3.17 is Q(n). Thus the average-case 
time of Algorithm 4.3.17 is @(n). For this algorithm, the worst-case and average-case 
times are both ©(n). < 


Searching an Unordered Sequence 


Given the sequence s,..., s, anda value key, this algorithm returns the index of key. 
If key is not found, the algorithm returns 0. 


Input: 5), 52,..., S,,, and key (the value to search for) 
Output: The index of key, or if key is not found, 0 


1. linear_search(s, n, key) { 

2 fori— liton 

3 if (key == 5;) 

4. return i // successful search 
5 return 0 // unsuccessful search 

@ | 


Example 4.3.18 Matrix Multiplication and Transitive Relations If A is a matrix, we let Aj; denote the 


entry in row i, column j. The product of n x n matrices A and B (i.e., A and B have n 
rows and n columns) is defined as the n x n matrix C, where 


Algorithm 4.3.19, which computes the matrix product, is a direct translation of the pre- 
ceding definition. Because of the nested loops, it runs in time @ (n°). 

Recall (see the discussion following Theorem 3.5.6) that we can test whether a 
relation R on an n-element set is transitive by squaring its adjacency matrix, say A, and 
then comparing A” with A. The relation R is transitive if and only if, whenever the entry 
in row i, column j in A? is nonzero, the corresponding entry in A is also nonzero. Since 
there are n’ entries in A and A’, the worst-case time to compare the entries is O(n’). 
Using Algorithm 4.3.19 to compute A? requires time @(n*). Therefore, the overall time 
to test whether a relation on an n-element set is transitive, using Algorithm 4.3.19 to 
compute A’, is Q(n?). 

For many years it was believed that the minimum time to multiply two n x n matri- 
ces was @(n°); thus it was quite a surprise when a more efficient algorithm was discov- 
ered. Strassen’s algorithm (see [Johnsonbaugh: Section 5.4]) to multiply two n x n ma- 
trices runs in time @(n'87). Since lg 7 is approximately 2.807, Strassen’s algorithm runs 
in time approximately © (n**°’), which is asymptotically faster than Algorithm 4.3.19. 
An algorithm by Coppersmith and Winograd (see [Coppersmith]) runs in time @(n?*”°) 
and, so, is even asymptotically faster than Strassen’s algorithm. Since the product of 
two n x n matrices contains n? terms, any algorithm that multiplies two n x n matrices 
requires time at least Q(n*). At the present time, no sharper lower bound is known. <4 
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Algorithm 4.3.19 


TABLE 4.3.3 & Common 
Growth Functions 


Theta Form 


0d) 
O(lglgn) 
O(gn) 
@(n) 
O(nlgn) 
O(n?) 
O(n?) 
O(n’), k>1 
O(c"),c>1 
en!) 


Name 


Constant 
Log log 
Log 

Linear 
nlogn 
Quadratic 
Cubic 
Polynomial 
Exponential 
Factorial 


Matrix Multiplication 


This algorithm computes the product C of the n x n matrices A and B directly from 
the definition of matrix multiplication. 


Input: A, B,n 
Output: C, the product of A and B 


matrix_product(A, B, n) { 
fori—" lito 
fori — tome 
G7 —0 
fork =1ton 
Cj = Gi + Aix * By 
} 


return C 


The constants that are suppressed in the theta notation may be important. Even 
if for any input of size n, algorithm A requires exactly Cin time units and algorithm B 
requires exactly Cn” time units, for certain sizes of inputs algorithm B may be superior. 
For example, suppose that for any input of size n, algorithm A requires 300n units of 
time and algorithm B requires 5n units of time. For an input size of n = 5, algorithm 
A requires 1500 units of time and algorithm B requires 125 units of time, and thus al- 
gorithm B is faster. Of course, for sufficiently large inputs, algorithm A is considerably 
faster than algorithm B. 

A real-world example of the importance of constants in the theta notation is pro- 
vided by matrix multiplication. Algorithm 4.3.19, which runs in time © (n°), is typically 
used to multiply matrices even though the Strassen and Coppersmith-Winograd algo- 
rithms (see Example 4.3.18), which run in times © (n*8°7) and O(n?>"), are asymptoti- 
cally faster. The constants in the Strassen and Coppersmith-Winograd algorithms are so 
large that they are faster than Algorithm 4.3.19 only for very large matrices. 

Certain growth functions occur so often that they are given special names, as 
shown in Table 4.3.3. The functions in Table 4.3.3, with the exception of @(n*), are 
arranged so that if O(f()) is above ©(g(n)), then f(m) < g(n) for all but finitely many 
positive integers n. Thus, if algorithms A and B have run times that are O(f(n)) and 
O(g(n)), respectively, and O(f(n)) is above ©(g(n)) in Table 4.3.3, then algorithm A is 
more time-efficient than algorithm B for sufficiently large inputs. 

It is important to develop some feeling for the relative sizes of the functions in 
Table 4.3.3. In Figure 4.3.1 we have graphed some of these functions. Another way to 
develop some appreciation for the relative sizes of the functions f(n) in Table 4.3.3 is to 
determine how long it would take an algorithm to terminate whose run time is exactly 
J (n). For this purpose, let us assume that we have a computer that can execute one step in 
1 microsecond (10~° sec). Table 4.3.1 shows the execution times, under this assumption, 
for various input sizes. Notice that it is practical to implement an algorithm that requires 
2” steps for an input of size n only for very small input sizes. Algorithms requiring n? or 
n steps also become impractical to implement, but for relatively larger input sizes. Also, 
notice the dramatic improvement that results when we move from n’ steps to 71g 7 steps. 

A problem that has a worst-case polynomial-time algorithm is considered to have a 
“good” algorithm; the interpretation is that such a problem has an efficient solution. Such 
problems are called feasible or tractable. Of course, if the worst-case time to solve the 
problem is proportional to a high-degree polynomial, the problem can still take a long 
time to solve. Fortunately, in many important cases, the polynomial bound has small 
degree. 
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A problem that does not have a worst-case polynomial-time algorithm is said to be 
intractable. Any algorithm, if there is one, that solves an intractable problem is guar- 
anteed to take a long time to execute in the worst case, even for modest sizes of the 
input. 

Certain problems are so hard that they have no algorithms at all. A problem for 
which there is no algorithm is said to be unsolvable. A large number of problems are 
known to be unsolvable, some of considerable practical importance. One of the earliest 
problems to be proved unsolvable is the halting problem: Given an arbitrary program 
and a set of inputs, will the program eventually halt? 

A large number of solvable problems have an as yet undetermined status; they are 
thought to be intractable, but none of them has been proved to be intractable. (Most of 
these problems belong to the class of NP-complete problems; see [Johnsonbaugh] for 
details.) An example of an NP-complete problem is: 


Given a collection C of finite sets and a positive integer k < |C|, does C contain at 
least A mutually disjoint sets? 


Other NP-complete problems include the traveling-salesperson problem and the 
Hamiltonian-cycle problem (see Section 8.3). 

NP-complete problems have efficient (i.e., polynomial-time) algorithms to check 
whether a proposed solution is, in fact, a solution. For example, given a collection C of 
finite sets and k sets in C, it is easy and fast to check whether the & sets are mutually 
disjoint. (Just check each pair of sets!) On the other hand, no NP-complete problem is 
known to have an efficient algorithm. For example, given a collection C of finite sets, 
finding k mutually disjoint sets in C is, in general, difficult and time consuming. NP- 
complete problems also have the property that if any one of them has a polynomial-time 
algorithm, then all NP-complete problems have polynomial-time algorithms. 


4.3 Problem-Solving Tips 


= To derive a big oh notation for an expression f (7) directly, you must find a constant 
C, and a simple expression g() (e.g., n, nlgn, n”) such that lf(a)| < Cilg(a)| for 
all but finitely many n. Remember you’re trying to derive an inequality, not an 
equality, so you can replace terms in f(m) with other terms if the result is Jarger 
(see, e.g., Example 4.3.3). 


= To derive an omega notation for an expression f(n) directly, you must find a con- 
stant Cz and a simple expression g(7) such that |f(m)| > C2|g()| for all but finitely 
many n. Again, you're trying to derive an inequality so you can replace terms in 
Jf (n) with other terms if the result is smaller (again, see Example 4.3.3). 


= To derive a theta notation, you must derive both big oh and omega notations. 


= Another way to derive big oh, omega, and theta estimates is to use known 


results: 
Expression Name Estimate Reference 
agn® + agin’! 4... Polynomial O(n*) Theorem 4.3.4 
+ajn+ag 
14+2+---+n Arithmetic Sum (Case k = 1 (2) (n?) Example 4.3.7 
for Next Entry) 
42k 4... 4 nk Sum of Powers @(nt!) Example 4.3.8 


Ign! log n Factorial O(nlgn) Example 4.3.9 
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= To derive an asymptotic estimate for the time of an algorithm, count the number 
of steps ¢(n) required by the algorithm, and then derive an estimate for t(m) as 
described previously. Algorithms typically contain loops, in which case, deriving 
t(n) requires counting the number of iterations of the loops. 


4.3 Review Exercises 


1 
2. 
2 
4 
5 
6 


To what does “analysis of algorithms” refer? 7. Define f(n) = Q(g(n)). What is this notation called? 


What is the worst-case time of an algorithm? 8. Give an intuitive interpretation of how f and g are related if 


f(r) = Q(g(n)). 
9. Define f(n) = ©(g(n)). What is this notation called? 


What is the best-case time of an algorithm? 


10. Give an intuitive interpretation of how f and g are related if 


fn) = O(g(n)). 


. What is the average-case time of an algorithm? 
. Define f(n) = O(g(n)). What is this notation called? 


Give an intuitive interpretation of how f and g are related if 


f(r) = O(8(n)). 


Select a theta notation from Table 4.3.3 for each expression in 20. fori =1to2n 21. fori=lton 
Exercises 1-13. forj =lton for j = | to |i/2] 

1. 6n+1 2. 2n? +1 x=x+1 x=x+1 

3, 6n? +4 12n2 aca 4. 3n2 +2n Ign 22. fori = ton 23. fori =! ton 

5. 21 Apps tet 6 f forj = 1ton forj =lton 

2 Ete ee cel ey fork =1ton fork = 1 toi 
7. 24+446+-+-4+2n 8. (6n + 1)? x=x+] x=xt 
1 i — 
9. (6n+4)(1 +g) ig = )(n + 3) 24. fori =lton 25. jan 
n+2 for j = 1 toi while (j > 1) { 
ai (n* +1gn)(n + 1) fork = 1 toj fori = 1 toj 
P n+r x=x4+l1 x=x+l1 
= |j/3] 
12. 24+448416+---+2" Us 
13. Ig[(2n)!] 
26. i=n 

In Exercises 14-16, select a theta notation for f(n) + g(n). while (i > 1) { 
14. fm) = 01), gn) = OW”) forj=1ton 
15. fi) =6n2 4272 +4, 2(n) = @Glen) ’ Son), ; 
16. f(r) =O”), g(n) = O(n") } 
In Exercises 17-26, select a theta notation from among 27. Find a theta notation for the number of times the statement 


O(1), 
O(n’), 


O(lgn), 
O(n), 


O(n), Olgn), 
@(2"), or O(n!) 


for the number of times the statement x = x + | is executed. 


17. fori = 1 to 2n 


x =x+ 1 is executed. 


i=2 
while (i < n) { 
i=? 


i x=x+4+1 
x=x+ } 
18. i=1 19. fori=l1ton 
while (i < 2n) { forj=lton . or . 
ead xy=xtl 28. Let t(n) be the total number of times that i is incremented and j 
i=it2 is decremented in the following pseudocode, where a), a2, ... 


} 


is a sequence of real numbers. 


29. 


30. 


31. 


r=1 
j=n 
while (i < j) { 
while (i < j A aj < 0) 
i=i+l 
while (i < j A aj = 0) 
j=j-1 
if i <j) 
swap (dij, aj) 


} 


Find a theta notation for t(n). 


Find a theta notation for the worst-case time required by the 
following algorithm: 


iskey(s,n, key) { 
fori=lton—1 
forj=i+lton 
if (sj + sj == key) 
return | 
else 
return 0 


} 


In addition to finding a theta notation in Exercises 1-29, prove 
that it is correct. 


Find the exact number of comparisons (lines 10, 15, 17, 
24, and 26) required by the following algorithm when n 
is even and when n is odd. Find a theta notation for this 
algorithm. 


Input: s1,52,..., 5,” 
Output: Jarge (the largest item in 5;, 52,...,5 Sn)s 
small (the smallest item in 51, .52,..., 5 Sin) 
1. large_small(s, n, large, small) { 
2 if (n == 1) { 
) large = s1 
4 small = s1 
3: return 
6 } 
7 m = 2[|n/2] 
8. i=1 
9. while (i < m — 1) { 
10. if (sj > si41) 
1. swap(si, Si+1) 
12. i=i+2 
13: } 
14. if (n > m) { 
15. if (Sm-1 > Sn) 
16. swap(Sm—1, Sn) 
a if (8) > Sm) 
18. swap(Sm, Sn) 
19, } 
20. small = s, 
21% large = sz 
22. i=3 


23. while (i < m — 1) { 
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24. if (s; < small) 
25; small = 5; 
26. if (si41 > large) 
27. large = Ssi41 
28. i=i+2 

29. } 

30. } 


32. This exercise shows another way to guess a formula for 1 + 
2+---4+n. 
Example 4.3.7 suggests that 


142+4---+n=An? +Bn+C for all n, 


for some constants A, B, and C. Assuming that this is true, 
plug in n = 1, 2,3 to obtain three equations in the three un- 
knowns A, B, and C. Now solve for A, B, and C. The resulting 
formula can now be proved using mathematical induction (see 
Section 2.4). 


33. Suppose that a > 1 and that f(n) = O(log, n). Show that 
f(r) = O(lgn). 
34. Show that n! = O(n"). 
35. Show that 2” = O(n!). 
36. By using an argument like the one shown in Examples 4.3.7— 
4.3.9 or otherwise, prove that Sie ilgi = O(n? Ign). 
+37. Show that n’*! = 0(2"). 


38. Show that Ig(n* + c) = @(1gn) for every fixed k > 0 and 
c>0. 


39. Show that if is a power of 2, say n = 2k then 


k 
x Ig(n/2') = © (Ig? n). 


i=0 


40. Suppose that f(n) = O(g(n)), and f(n) > 0 and g(n) > 0 for 
all n > 1. Show that for some constant C, f(n) < Cg(n) for 
alln > 1. 


41. State and prove a result for Q similar to that for Exercise 40. 
42. State and prove a result for © similar to that for Exercises 40 
and 41. 


Determine whether each statement in Exercises 43-68 is true or 
false. If the statement is true, prove it. If the statement is false, give 
a counterexample. Assume that the functions f,g, and h take on 
only positive values. 

43. n” = O(2”) 

44, 2+ sinn = O(2+ cosn) 

45. (2n)* = O(n’) 

46. (2n)* = Q(n?) 

47. (2n)? = O(n?) 

48. 2?" = O(2") 

49, 27" = Q(2") 
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50. 
51. 
52. 
53. 
54. 
55. 
56. 
57, 
58. 
59. 
60. 


61. 
62. 
63. 


64. 
65. 
66. 
67. 
68. 
69. 
70. 


x71. 


«72, 


x73. 
74. 


75. 
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2" = O(2") 

n! = O((n+ 1)!) 

n! = Q((n+ 1)!) 

n! = O((n+ 1)!) 

Ig(2n)? = O(Ign?) 

Ig(2n)* = Q (ign?) 

Ig(2n)? = @(1gn?) 

Ig 22” = O(1g 2”) 

1g 2?” = Q(1g 2”) 

Ig 2?” = @(1g 2”) 

If f(n) = O(h(n)) and g(n) = O(h(n)), then f(n) + g(n) = 
O(A(n)). 

If f(n) = O(g(n)), then cf(n) = O(g(n)) for any c £0. 
If f(n) = @(g(n)), then 2° = O(28), 

If f(n) = O(g(n)), then lgf(n) = O(lg g(n)). Assume that 
f(n) = land g(n) > 1 foralln = 1,2,.... 

If f(n) = O(g(n)), then g(n) = O(f(n)). 

If f(n) = O(g(n)), then g(n) = Q(f(n)). 

If f(n) = O(g(n)), then g(n) = O(f(n)). 

fm) + g(n) = O(h(n)), where h(n) = max{f(n), g(n)} 
fm) + g(n) = O(h(n)), where h(n) = min{f(n), g(n)} 
Write out exactly what f(n) 4 O(g(n)) means. 


What is wrong with the following argument that purports to 
show that we cannot simultaneously have f(n) # O(g(n)) and 
gin) # O(f(n))? 

If f(n) # O(g(n)), then for every C > 0, |f(n)| > 
Clg(n)|. In particular, [f(m)| > 2ig(n)|. If gm) # OF), 
then for every C > 0, |g(n)| > C|f()|. In particular, |g(n)| > 
2\f(n)|. But now 


\f(a)| > 2\g(n)| > 4if@)|. 
Cancelling [f(n)| gives 1 > 4, which is a contradiction. 
Therefore, we cannot simultaneously have f(n) #4 O(g(n)) 
and g(n) 4 O(f(n)). 
Find functions f and g satisfying 


f(n) € O(g(n)) and g(n) - O(f(n)). 


Give an example of increasing positive functions f and g de- 
fined on the positive integers for which 


and 


f(r) # O(g(n)) g(n) F O(f(n)). 
Prove that n“ = O(c") for allk = 1,2,...andc > 1. 


Find functions f, g, h, and ¢ satisfying 


fm) = O(n), h(n) = Ot), 
f(r) — hn) # O(gm) — t(n)). 
Suppose that the worst-case time of an algorithm is ©(n). 


What is the error in the following reasoning? Since 2n = 
©(n), the worst-case time to run the algorithm with input of 


76. 


77. 


78. 


79. 


80. 


81. 


82. 


size 2n will be approximately the same as the worst-case time 
to run the algorithm with input of size n. 


Does f(n) = O(g(n)) define an equivalence relation on the set 
of real-valued functions on {1, 2, ...}? 


Does f(n) = ©(g(n)) define an equivalence relation on the set 
of real-valued functions on {1, 2,...}? 


[Requires the integral] 

(a) Show, by consulting the figure, that 
1 
2 

(b) Show, by consulting the figure, that 


1 1 
+i+---+- <log,n. 
3 n 


1 
n—1° 


1 
loge as Usb 5 bese 


(c) Use parts (a) and (b) to show that 


1 
+--+ — = O(lgn). 
n 


[Requires the integral] Use an argument like the one shown in 
Exercise 78 to show that 


m-+1 m+1 
n < 1” f gm | erin | nl” ee (n+ 1) 
m+1 m+1 
where m is a positive integer. 
By using the formula 
1 1 n 
prt —q't ie 
Ee ab O<a<b 
b-a 
i=0 
or otherwise, prove that 
prt = qt 
5 < (n+ 1)b" O0<a<b. 


Take a = 14+ 1/(n+ 1) and b = 1 +4 1/n in the inequal- 
ity of Exercise 80 to prove that the sequence {(1 + 1/n)"} is 
increasing. 

Take a = | and b = 1 + 1/(2n) in the inequality of Exercise 
80 to prove that 


83. 


84. 


85. 
86. 


87. 


88. 


89. 


90. 


for all n > 1. Use the preceding exercise to conclude that 


1 n 
(: + >) <4 
n 
for alln > 1. 


The method used to prove the results of this exercise 
and its predecessor is apparently due to Fort in 1862 (see 
[Chrystal, vol. I], page 77]). 


By using the preceding two exercises or otherwise, prove that 


<Ig(n+1)-Ign < 


n 


for alln > 1. 
Use the preceding exercise to prove that 


n 
> . = (Ign). 
i=l 

(Compare with Exercise 78.) 

Prove that the sequence {n!/” }7o., is decreasing. 


Prove that if 0 < a < b, then 
prtl 
~ b-a 
Find appropriate values for a and b in the inequality in the pre- 


ceding exercise to prove that the sequence {(1 — 1/n)"}°° | is 
increasing and bounded above by 4/9. 


= qth 
> (n+ lja". 


By using the result of the preceding exercise, or otherwise, 
prove that the sequence {(1 + 1/n)"*+! }pc_, is decreasing. 

By using the result of the preceding exercise, or otherwise, 
prove that 


2 
1 1) —lgn < —— 
gin + 1) gn sy 


foralln > 1. 
What is wrong with the following “proof” that any algorithm 
has a run time that is O(n)? 


We must show that the time required for an input of size 
nis at most a constant times n. 


Basis Step 


Suppose that n = 1. If the algorithm takes C units of time 
for an input of size 1, the algorithm takes at most C - | units of 
time. Thus the assertion is true for n = 1. 


Inductive Step 


Assume that the time required for an input of size n is at most 
C’n and that the time for processing an additional item is C”. 
Let C be the maximum of C’ and C’. Then the total time re- 
quired for an input of size n + 1 is at most 


Cn+C"” <Cn+C=Cin+]). 


The Inductive Step has been verified. 
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By induction, for input of size n, the time required is at 
most a constant time n. Therefore, the run time is O(n). 


In Exercises 91-96, determine whether the statement is true or 
false. If the statement is true, prove it. If the statement is false, give 
a counterexample. Assume that f and g are real-valued functions 
defined on the set of positive integers and that g(n) 0 forn > 1. 
These exercises require calculus. 


ot. 


92. 


93. 


94. 


95. 


96. 


x97, 


98. 


93, 


100. 


If 
i fr) 
im —— =0, 
noo g(n) 
then f(n) = O(g(n)). 
If 
ki fa 
im —— = 
noo g(n) 
then f(n) = O(g(n)). 
If 
lim i) = C550; 
n>oo 9(n) 
then f(n) = O(g(n)). 
If 
fin) _ 
err aa 
then f(n) = O(g(n)). 
If f(n) = O(g(n)), then 
lim fm) 
n>oo g(n) 


exists and is equal to some real number. 
If f(n) = ©(g(n)), then 


im £2 

im —— 
n->0o g(n) 

exists and is equal to some real number. 


Use induction to prove that 
non 
Ign! > —Ig-. 
gn = 7185 
[Requires calculus] Let Inx denote the natural logarithm 


(log, x) of x. Use the integral to obtain the estimate 


n 
ninn—-n< y Ink = Inn!, 
k=l 


n>1. 


Use the result of Exercise 98 and the change-of-base formula 
for logarithms to obtain the formula 

nign—nlge < lgnl, n>1. 
Deduce 


ignt > 1g? 
eae 


from the inequality of Exercise 99. 
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Problem-Solving Corner 


Design and Analysis of an Algorithm 


Problem 


Develop and analyze an algorithm that returns the 
maximum sum of consecutive values in the numeri- 
cal sequence s;,..., S,. In mathematical notation, the 
problem is to find the maximum sum of the form 
Sj + Sj41 +--+ + 5;. Example: If the sequence is 


YO — 0 Bil =—3 i4 lO = 44 333 


the algorithm returns 115—the sum of 


Pl = 8) Métis BI, 


If all the numbers in a sequence are negative, the maxi- 
mum sum of consecutive values is defined to be 0. (The 
idea is that the maximum of 0 is achieved by taking an 
“empty” sum.) 


Attacking the Problem 


In developing an algorithm, a good way to start is to 
ask the question, “How would I solve this problem 
by hand?” At least initially, take a straightforward ap- 
proach. Here we might just list the sums of all con- 
secutive values and pick the largest. For the example 
sequence, the sums are as follows: 


=2Zil %, 


Finding a Solution 


We begin by writing pseudocode for the straightfor- 
ward algorithm that computes all consecutive sums and 
finds the largest: 

Input: 


Mls coe sip 


Output: max 


max_sumI(s, n) { 
// sum; is the sum s; + --- + Sj. 
fori=1ton{ 
forj =1toi-1l 
SUM; = SUM; j—- + 5; 
SUM = Sj 


} 


// step through sum,; and find the maximum 


max = 0 
form—s lato 
forj = 1toi 


if (sum; > max) 
max = sum; 
return max 


i 1 2 3 4 5 6 7 8 9 10 11 12 
1 2H] 

2 33 6 

3 17 44 50 

4 4 23} =f) 21 

5) 1 —26 = 3) 18 =3 

6 1S —12 —18 32 11 14 

7 31 4 —2 48 27 30 16 

8 23} —4 —10 40 19 ay) 8 —8 

9 65 38 sy 82 61 64 50 34 42 

10 98 Tl 65 115 94 Oil 83 67 75 33) 

11 Wi 50 44 94 73 76 62 46 54 12 —21 

12 86 5) 53 103 82 85 71 5) 63 21 =112 9 


The entry in column j, row i, is the sum s; + 
-.+-+ 5;. For example, the entry in column 4, row 7, is 
48—the sum 


s7 = 214-34 144 16=48. 


S4)41-95 1 S61 


By inspection, we find that 115 is the largest sum. 


The first nested for loops compute the sums 
SUM, = Sj +--+ + Sj. 
The computation relies on the fact that 
sum; = S;+---+8;=5;+---+ 571 +5; 


= sum; j_1 + Sj. 


Problem-Solving Corner: Design and Analysis of an Algorithm 


The second nested for loops step through sum; and find 
the largest value. 

Since each of the nested for loops takes time 
O(n), max_sumI1’s time is O(n’). 

We can improve the actual time, but not the 
asymptotic time, of the algorithm by computing the 
maximum within the same nested for loops in which 
we compute sum: 


LOUIE Silo cons Sa 


Output: max 


max_sum2(s, n) { 
// sum; is the sum s; + --- + Sj. 
max = 0 
fori= 1 ton { 
forj =1ltoi—1{ 
SUM; = SUM; j;—1 + S; 
if (sum; > max) 
max = sum 
} 
SUNY = Si 
if (sum; > max) 
max = sum 
} 


return max 


Since the nested for loops take time O(n’), 
max_sum2’s time is @(n?). To reduce the asymptotic 
time, we need to take a hard look at the pseudocode to 
see where it can be improved. 

Two key observations lead to improved time. First, 
since we are looking only for the maximum sum, there 
is no need to record all of the sums; we will store only 
the maximum sum that ends at index 7. Second, the line 


SUM, = SUM; | + 5; 


shows how a consecutive sum that ends at index 7 — | 
is related to a consecutive sum that ends at index i. 
The maximum can be computed by using a similar 
formula. If swm is the maximum consecutive sum that 
ends at index i— 1, the maximum consecutive sum that 
ends at index i is obtained by adding s; to sum pro- 
vided that sum +s; 1s positive. (If some sum of con- 
secutive terms that ends at index i exceeds sum +5;, 
we could remove s; and obtain a sum of consecutive 
terms ending at index i — | that exceeds sum, which is 
impossible.) If swm+s; < 0, the maximum consecutive 
sum that ends at index / is obtained by taking no terms 
and has value 0. Thus we may compute the maximum 
consecutive sum that ends at index i by executing 


if (sum +s; > 0) 
sum = sum + S$; 
else 
sum = 0 


Formal Solution 
WWE Gin soon 


Output: max 


max_sum3(s, n) { 
// max is the maximum sum seen so far. 
// After the ith iteration of the for 
// loop, sum is the largest consecutive 
// sum that ends at index i. 


max = 0 
sum = 0 
jor == ios { 


if (sum + 5; > 0) 
sum = sum + 5; 

else 
sum = 0 

if (sum > max) 
max = sum 


} 


return max 


Since this algorithm has a single for loop that runs 
from | ton, max_sum3’s time is ©(n). The asymptotic 
time of this algorithm cannot be further improved. To 
find the maximum sum of consecutive values, we must 
at least look at each element in the sequence, which 
takes time ©(n). 


Summary of Problem-Solving Techniques 


= In developing an algorithm, a good way to start 
is to ask the question, “How would I solve this 
problem by hand?” 


= In developing an algorithm, initially take a 
straightforward approach. 


= After developing an algorithm, take a close look 
at the pseudocode to see where it can be im- 
proved. Look at the parts that perform key com- 
putations to gain insight into how to enhance the 
algorithm’s efficiency. 


= As in mathematical induction, extend a solution 
of a smaller problem to a larger problem. (In this 
problem, we extended a sum that ends at index 
i — 1 to a sum that ends at index 7.) 
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= Don’t repeat computations. (In this problem, we matching in digital images. The original problem was 
extended a sum that ends at index i — 1 toa sum to find the maximum sum in a rectangular submatrix 
that ends at index i by adding an additional term of ann x n matrix of real numbers. 
rather than by computing the sum that ends at 
index 7 from scratch. This latter method would Exercises 


have meant recomputing the sum that ends at 


1. Modify max_sum3 so that it computes not only the 
index i — 1.) 


maximum sum of consecutive values but also the in- 
dexes of the first and last terms of a maximum-sum 


Comments subsequence. If there is no maximum-sum subse- 
According to [Bentley], the problem discussed in this quence (which would happen, for example, if all of 
section is the one-dimensional version of the origi- the values of the sequence were negative), the algo- 
nal two-dimensional problem that dealt with pattern rithm should set the first and last indexes to zero. 
4.4 Recursive Algorithms 
A recursive function (pseudocode) is a function that invokes itself. A recursive algo- 
rithm is an algorithm that contains a recursive function. Recursion is a powerful, elegant, 
Go Online and natural way to solve a large class of problems. A problem in this class can be solved 
For more on recursion, see using a divide-and-conquer technique in which the problem is decomposed into prob- 
goo.gl/ZwpPlu lems of the same type as the original problem. Each subproblem, in turn, is decomposed 


further until the process yields subproblems that can be solved in a straightforward way. 
Finally, solutions to the subproblems are combined to obtain a solution to the original 
problem. 


Example 4.4.1 Recall that ifn > 1,n! = n(n —1)---2-1, and 0! = 1. Notice that if n > 2, n factorial 
————__ can be written “in terms of itself” since, if we “peel off” n, the remaining product is 
simply (n — 1)!; that is, 


TABLE 4.4.1 m Decomposing haan Teed aactnn i 


the Factorial Problem 


Problem Simplified Problem For example, 
1—5-4.3-2-1—5-4! 
5! 5.4! 5!=5-4-3-2-1=5-4!. 
4! 4-3! The equation 
3! 3-2! 
!=n-(n—1)! 
2! 2-1! n=n-(n—I)!, (4.4.1) 
1! 1-0! which happens to be true even when n = 1, shows how to decompose the original prob- 
0! None lem (compute 7!) into increasingly simpler subproblems [compute (” — 1)!, compute 
(n — 2)!,...] until the process reaches the straightforward problem of computing 0!. 
TABLE 4.4.2 = Combining The solutions to these subproblems can then be combined, by multiplying, to solve the 


original problem. 
For example, the problem of computing 5! is reduced to computing 4!; the problem 


of computing 4! is reduced to computing 3!; and so on. Table 4.4.1 summarizes this 
Problem Solution process. 


Subproblems of the Factorial 
Problem 


0! 1 Once the problem of computing 5! has been reduced to solving subproblems, the 
1! 1-0!=1 solution to the simplest subproblem can be used to solve the next simplest subproblem, 
2! 2-1!=2 and so on, until the original problem has been solved. Table 4.4.2 shows how the sub- 
3! 3-2!=3-2=6 problems are combined to compute 5!. < 
4! 4-3!=4-6=24 

5! 5-4! = 5-24 = 120 


Next, we write a recursive algorithm that computes factorials. The algorithm is a 
direct translation of equation (4.4.1). 


Algorithm 4.4.2 


Theorem 4.4.3 
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Computing n Factorial 
This recursive algorithm computes n!. 


Input: n, an integer greater than or equal to 0 


Output: n! 
1. factorial(n) { 
2 nit (7 ——30)) 
3k return | 
4 return n x factorial(n — 1) 
a Ih 


We show how Algorithm 4.4.2 computes n! for several values of n. If n = 0, at 
line 3 the function correctly returns the value 1. 

Ifn = 1, we proceed to line 4 since n £ 0. We use this function to compute 0!. We 
have just observed that the function computes | as the value of 0!. At line 4, the function 
correctly computes the value of 1!: 


minis 1OfSici=1 


If n = 2, we proceed to line 4 since n 4 0. We use this function to compute 1!. We 
have just observed that the function computes 1 as the value of 1!. At line 4, the function 
correctly computes the value of 2!: 


mia= ls =2 = 2 


Ifn = 3 we proceed to line 4 since n £ 0. We use this function to compute 2!. We 
have just observed that the function computes 2 as the value of 2!. At line 4, the function 
correctly computes the value of 3!: 


n+(n— 1)! =3-2! =3-2=6. 


The preceding arguments may be generalized using mathematical induction to 
prove that Algorithm 4.4.2 correctly returns the value of n! for any nonnegative 
integer n. 


Algorithm 4.4.2 returns the value of n!, n > 0. 


Proof 


Basis Step (n = 0) 
We have already observed that if n = 0, Algorithm 4.4.2 correctly returns the value of 
0! (1). 


Inductive Step 


Assume that Algorithm 4.4.2 correctly returns the value of (n — 1)!, n > 0. Now sup- 
pose that n is input to Algorithm 4.4.2. Since n ¢ 0, when we execute the function in 
Algorithm 4.4.2 we proceed to line 4. By the inductive assumption, the function cor- 
rectly computes the value of (n — 1)!. At line 4, the function correctly computes the 
value (n — 1)!-n=nl!. 

Therefore, Algorithm 4.4.2 correctly returns the value of n! for every integer 
n> 0. < 
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Algorithm 4.4.4 
Go Online 


If executed by a computer, Algorithm 4.4.2 would typically not be as efficient as 
a nonrecursive version because of the overhead of the recursive calls. 

There must be some situations in which a recursive function does not invoke itself; 
otherwise, it would invoke itself forever. In Algorithm 4.4.2, if n = 0, the function does 
not invoke itself. We call the values for which a recursive function does not invoke itself 
the base cases. To summarize, every recursive function must have base cases. 

We have shown how mathematical induction may be used to prove that a recur- 
sive algorithm computes the value it claims to compute. The link between mathematical 
induction and recursive algorithms runs deep. Often a proof by mathematical induction 
can be considered to be an algorithm to compute a value or to carry out a particular 
construction. The Basis Step of a proof by mathematical induction corresponds to the 
base cases of a recursive function, and the Inductive Step of a proof by mathematical 
induction corresponds to the part of a recursive function where the function calls itself. 

In Example 2.4.7, we gave a proof using mathematical induction that, given an 
n X n deficient board (a board with one square removed), where n is a power of 2, we 
can tile the board with right trominoes (three squares that form an “L”; see Figure 2.4.4). 
We now translate the inductive proof into a recursive algorithm to construct a tiling by 
right trominoes of an n x n deficient board where n is a power of 2. 


Tiling a Deficient Board with Trominoes 


This algorithm constructs a tiling by right trominoes of an n x n deficient board where 


For a C program implementing yisa power of 2. 


this algorithm, see 
goo.gl/rZYmnk 


Example 4.4.5 


Input: n, a power of 2 (the board size); and the location L of the missing 
square 


Output: A tiling of ann x n deficient board 


Te ale(n vir 
2: tt Q=S=){ 
// the board is a right tromino T 
3h, tile with T 
4. return 
3 } 
6. divide the board into four (n/2) x (n/2) boards 
The rotate the board so that the missing square is in the upper-left quadrant 
8. place one right tromino in the center // as in Figure 2.4.5 


// consider each of the squares covered by the center tromino as 
// missing, and denote the missing squares as m1, m2, m3, m4 
9. tile(n/2, m,) 
10. tile(n/2, mz) 
iil. tile(n/2, m3) 
12s tile(n/2, m4) 


Using the method of the proof of Theorem 4.4.3, we can prove that Algorithm 4.4.4 
is correct (see Exercise 4). 
We present one final example of a recursive algorithm. 


A robot can take steps of 1 meter or 2 meters. We write an algorithm to calculate the 
number of ways the robot can walk n meters. As examples: 
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Distance Sequence of Steps Number of Ways to Walk 
1 1 1 
2 1,1 or 2 2 
3 1,1,1 or 1,2 of 2,1 3 
4 1,010 or 11,2 5 
or 1,.2,.1- or .2,.1,.1. “er 2,2 


Let walk(n) denote the number of ways the robot can walk n meters. We have 
observed that walk(1) = 1 and walk(2) = 2. Now suppose that n > 2. The robot can 
begin by taking a step of | meter or a step of 2 meters. If the robot begins by taking a 
1-meter step, a distance of n — 1 meters remains; but, by definition, the remainder of the 
walk can be completed in walk(n — 1) ways. Similarly, if the robot begins by taking a 
2-meter step, a distance of n — 2 meters remains and, in this case, the remainder of the 
walk can be completed in walk(n — 2) ways. Since the walk must begin with either a 
1-meter or a 2-meter step, all of the ways to walk n meters are accounted for. We obtain 
the formula 


walk(n) = walk(n — 1) + walk(n — 2). (4.4.2) 
For example, 
walk(4) = walk(3) + walk(2) =3+2=5. 


We can write a recursive algorithm to compute walk(n) by translating equation 
(4.4.2) directly into an algorithm. The base cases are n = 1 andn = 2. < 


Algorithm 4.4.6 Robot Walking 


This algorithm computes the function defined by 


Il, a= I 
Walk(n)i— as i) 
walk(n — 1) + walk(n—2) n> 2. 
‘ Input: n 
Go Online e ee 
For a C program implementing Output: waik(n) 
this algorithm, see walk(n) { 
goo.gl/m/754WF if (n == 1Vn==2) 
return n 
return walk(n — 1) + walk(n — 2) 


} 


Using the method of the proof of Theorem 4.4.3, we can prove that Algorithm 4.4.6 

is correct (see Exercise 7). 
Go Online The sequence walk(1), walk(2), walk(3),..., whose values begin 1, 2, 3,5, 8, 
13,..., is related to the Fibonacci sequence. The Fibonacci sequence {f,,} is defined 


For more on the Fibonacci ; 
by the equations 


sequence, see 


goo.gl/ZwpPlu 
fi = ile h= I, Sn =Sn-1 + fr—2 n> 3. 
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Example 4.4.7 


Figure 4.4.1 A pine cone. There are 
13 clockwise spirals (marked with 
white thread) and 8 counterclockwise 
spirals (marked with dark thread). 
[Photo by the author; pine cone 
courtesy of André Berthiaume and 
Sigrid (Anne) Settle. | 


The Fibonacci sequence begins 


1,.1,.2,.35.5, 8, 13506... 


Since walk(1) = fo, walk(2) = fs, and 
walk(n) = walk(n — 1) + walk(n — 2), tn =Sn-1 + Sn-2 for all n > 3, 


it follows that 


walk(n) = fr+1 for alln > 1. 


(The argument can be formalized using mathematical induction; see Exercise 8.) 

The Fibonacci sequence is named in honor of Leonardo Fibonacci (ca. 1170— 
1250), an Italian merchant and mathematician. The sequence originally arose in a puz- 
zle about rabbits (see Exercises 18 and 19). After returning from the Orient in 1202, 
Fibonacci wrote his most famous work, Liber Abaci (available in an English translation 
by [Sigler]), which, in addition to containing what we now call the Fibonacci sequence, 
advocated the use of Hindu-Arabic numerals. This book was one of the main influences 
in bringing the decimal number system to Western Europe. Fibonacci signed much of his 
work “Leonardo Bigollo.” Bigollo translates as “traveler” or “blockhead.” There is some 
evidence that Fibonacci enjoyed having his contemporaries consider him a blockhead for 
advocating the new number system. 

The Fibonacci sequence pops up in unexpected places. Figure 4.4.1 shows a pine 
cone with 13 clockwise spirals and 8 counterclockwise spirals. Many plants distribute 
their seeds as evenly as possible, thus maximizing the space available for each seed. The 
pattern in which the number of spirals is a Fibonacci number provides the most even 
distribution (see [Naylor, Mitchison]). In Section 5.3, the Fibonacci sequence appears 
in the analysis of the Euclidean algorithm. 


Use mathematical induction to show that 


Sofi =fri2—1 foralln > 1. 
k=l 
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SOLUTION For the basis step (n = 1), we must show that 


1 
Sof =f 1. 
k=1 


Since Lak =f, = land f; — 1 = 2 — 1 = 1, the equation is verified. 
For the inductive step, we assume case n 


yr = fr42 -—1 
k=1 


and prove case n + 1 


n+l 


yh =fn43 -i1. 
k=1 


Now 
n+1 n 
y=) i 
k=l k=1 
= (fr42 — I) + fg by the inductive assumption 
= f+ + fro —1 
= fn43 =i, 


The last equality is true because of the definition of the Fibonacci numbers: 


fr=hrith2  foralln > 3. 


Since the basis step and the inductive step have been verified, the given equation is true 
foralln > 1. < 


4.4 Problem-Solving Tips 


A recursive function is a function that invokes itself. The key to writing a recursive func- 
tion is to find a smaller instance of the problem within the larger problem. For example, 
we can compute n! recursively because n! = n-(n — 1)! for all n > 1. The situation is 
analogous to the inductive step in mathematical induction when we must find a smaller 
case (e.g., case n) within the larger case (e.g., case n + 1). 

As another example, tiling an n x n deficient board with trominoes when n is a 
power of 2 can be done recursively because we can find four (n/2) x (n/2) subboards 
within the original n x n board. Note the similarity of the tiling algorithm to the inductive 
step of the proof that every n x n deficient board can be tiled with trominoes when n is 
a power of 2. 

To prove a statement about the Fibonacci numbers, use the equation 


Sn =fr-1 + fr for alln > 3. 


The proof will often use mathematical induction and the previous equation (see Exam- 
ple 4.4.7). 
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4.4 Review Exercises 


1. What is a recursive algorithm? 


2. What is a recursive function? 


3. Give an example of a recursive function. 


4. Explain how the divide-and-conquer technique works. 


1. 
2. 


17s) 


raat nn & 


. (a 


Trace Algorithm 4.4.2 forn = 4. 


Trace Algorithm 4.4.4 when n = 4 and the missing square is 
the upper-left corner square. 


. Trace Algorithm 4.4.4 when n = 8 and the missing square is 


four from the left and six from the top. 


. Prove that Algorithm 4.4.4 is correct. 
. Trace Algorithm 4.4.6 for n = 4. 

. Trace Algorithm 4.4.6 for n = 5. 

. Prove that Algorithm 4.4.6 is correct. 
. Prove that 


walk(n) = fn+1 for alln > 1. 


. (a) Use the formulas 


s=1L Sy =Syp-p tn for alln > 2, 


to write a recursive algorithm that computes 
Spo =14+2+34+---+n. 
(b 


i 


Give a proof using mathematical induction that your 

algorithm for part (a) is correct. 

Use the formulas 
sy = 2; Sn = Sn—1 +2n for all n > 2, 

to write a recursive algorithm that computes 


Sp =24+44+64+---+2n. 
(b 


YS 


Give a proof using mathematical induction that your 
algorithm for part (a) is correct. 


past 


A robot can take steps of | meter, 2 meters, or 3 meters. 
Write a recursive algorithm to calculate the number of 
ways the robot can walk n meters. 

(b) Give a proof using mathematical induction that your 
algorithm for part (a) is correct. 


. Write a recursive algorithm to find the minimum of a finite 


sequence of numbers. Give a proof using mathematical induc- 
tion that your algorithm is correct. 


. Write a recursive algorithm to find the maximum of a finite 


sequence of numbers. Give a proof using mathematical induc- 
tion that your algorithm is correct. 


. Write a recursive algorithm that reverses a finite sequence. 


Give a proof using mathematical induction that your algorithm 
is correct. 


5. What is a base case in a recursive function? 


6. Why must every recursive function have a base case? 


7. How is the Fibonacci sequence defined? 


8. Give the first four values of the Fibonacci sequence. 


15. 
x16. 


«17. 


Write a nonrecursive algorithm to compute n!. 


A robot can take steps of | meter or 2 meters. Write an algo- 
rithm to list all of the ways the robot can walk n meters. 


A robot can take steps of | meter, 2 meters, or 3 meters. Write 
an algorithm to list all of the ways the robot can walk n meters. 


Exercises 18-36 concern the Fibonacci sequence {fn}. 


18. 


Suppose that at the beginning of the year, there is one pair of 
rabbits and that every month each pair produces a new pair that 
becomes productive after one month. Suppose further that no 
deaths occur. Let a, denote the number of pairs of rabbits at 
the end of the nth month. Show that aj = 1,a. = 2, and 
An — An—| = An—2. Prove that a, = fn41 for alln > 1. 


. Fibonacci’s original question was: Under the conditions of 


Exercise 18, how many pairs of rabbits are there after one 
year? Answer Fibonacci’s question. 


. Show that the number of ways to tile a 2 x n board with | x 2 


rectangular pieces is f,1, the (n + 1)st Fibonacci number. 


. Use mathematical induction to show that 
P=frifert(-Dt! — foralln > 2. 
. Show that 
fe = fn—2fng2 + (-1)" for all n > 3. 
. Show that 
feo —feu =fdoe3 — foralln > 1. 
. Use mathematical induction to show that 


n 
Wis =fifor1  foralln > 1. 


k=1 


. Use mathematical induction to show that for all n > 1, f, is 


even if and only if n is divisible by 3. 


. Use mathematical induction to show that 


fin =f —f2 and fone =fP + £24, forall n > 2. 


. Use mathematical induction to show that for all n > 6, 


n—-1 
oY" 


. Use mathematical induction to show that for all n > 1, 


feo 


«30. 


. Use mathematical induction to show that for all n > 1, 


She = font -i1. 


k=1 


n 
So fit = fon, 
k=l 


Use mathematical induction to show that every integer n > | 
can be expressed as the sum of distinct Fibonacci numbers, no 
two of which are consecutive. 


. Show that the representation in Exercise 30 is unique if we do 


not allow f; as a summand. 


. Show that for all n > 2, 


fn + \/5f2_) 4-4! 


In = 2 


Notice that this formula gives f, in terms of one predecessor 
rather than two predecessors as in the original definition. 


33. Prove that 
“(= nga 
1+) ="? foralln > 1. 
fifi Sn41 
34. Define a sequence {g,} as gj = cy and gz = co for constants 


Cl and CQ, and 


8n = 8n—-1 + 8n-2 
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for n > 3. Prove that 


8n = 8ifn—2 + 82fn-1 


for all n > 3. 


. Prove that 


SCD fe = (Dri — 1 foralln > 2. 


k=1 
. Prove that 
n 
SoD fe = (-1)"(fn-1 + fa-3) — 2 foralln > 4. 
k=1 
. [Requires calculus] Assume the formula for differentiating 
products: 
dfs) __,dg | of 
vax ae 8a 
Use mathematical induction to prove that 
‘1 
a = nx! form 12.0444 
. [Requires calculus] Explain how the formula gives a recursive 


algorithm for integrating log” |x|: 


/ log” |x| dx = x log” |x| —1n : log”! |x| dx. 


Give other examples of recursive integration formulas. 


Chapter 4 Notes 


The first half of [Knuth, 1977] introduces the concept of an algorithm and various mathemat- 
ical topics, including mathematical induction. The second half is devoted to data structures. 

Most general references on computer science contain some discussion of algorithms. 
Books specifically on algorithms are [Aho; Baase; Brassard; Cormen; Johnsonbaugh; Knuth, 
1997, 1998a, 1998b; Manber; Miller; Nievergelt; and Reingold]. [McNaughton] contains a 
very thorough discussion on an introductory level of what an algorithm is. Knuth’s expository 
article about algorithms ([{Knuth, 1977]) and his article about the role of algorithms in the 
mathematical sciences ({Knuth, 1985]) are also recommended. [Gardner, 1992] contains a 
chapter about the Fibonacci sequence. 
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19. Big oh notation: f(n) = O(g(n)) 
20. Omega notation: f(n) = Q(g(n)) 
21. Theta notation: f(n) = O(g(n)) 


Section 4.4 


22. Recursive algorithm 


23. Recursive function 

24. Divide-and-conquer technique 

25. Base cases: Situations where a recursive function does not 
invoke itself 

26. Fibonacci sequence {f,} : fi = lf = Lfp = fr-i 4 
Sr-2, n= 3 


Chapter 4 Self-Test 


1. Trace Algorithm 4.1.1 for the values a = 12,b = 3,c =0. 

2. Which of the algorithm properties—input, output, preci- 

sion, determinism, finiteness, correctness, generality—if 
any, are lacking in the following? Explain. 

Input: 

Output: 


S (a set of integers), m (an integer) 
All finite subsets of S that sum to m 
1. List all finite subsets of S and their sums. 


2. Step through the subsets listed in 1 and output each 
whose sum is m. 


3. Trace Algorithm 4.2.1 for the input ¢ = “111011” and 


p = “110”. 

4. Trace Algorithm 4.2.3 for the input 44, 64, 77, 15, 3. 

5. Trace Algorithm 4.2.4 for the input 5, 51, 2, 44, 96. 
Assume that the values of rand are 


rand(1,5)=1,  rand(2, 5) = 3, 


rand(4, 5) = 5. 


rand(3,5) = 5, 


6. Write an algorithm that receives as input the distinct num- 
bers a, b, and c and assigns the values a, b, and c to the 
variables x, y, and z so that x < y < z. 


7. Write an algorithm that receives as input the sequence 
S|,..., 8, sorted in nondecreasing order and prints all val- 
ues that appear more than once. Example: If the sequence 
is 1,1, 1,5,8,8,9, 12, the outputis 1 8. 

8. Write an algorithm that returns true if the values of a, b, and 
c are distinct, and false otherwise. 


9. Write an algorithm that tests whether two n x n matri- 
ces are equal and find a theta notation for its worst-case 
time. 


Select a theta notation from among @(1), @(n), @(n?), @(n>), 
O(n‘), ©(2"), or O(n!) for each of the expressions in Exercises 
10 and 11, 


10. 4n? + 2n —5 
11. 12423 +---473 
12. Select a theta notation from among ©(1), O(n), O(n’), 


O(n3), ©(2"), or @(n!) for the number of times the line 
x =x-+ 1 is executed. 


fori=l1ton 
forj =1ton 
x=x4+1 


13. Trace Algorithm 4.4.4 (the tromino tiling algorithm) when 
n = 8 and the missing square is four from the left and two 
from the top. 


Exercises 14-16 refer to the tribonacci sequence {t,,} defined by 
the equations 


th-1 + tr-2 4 


h=h=h=1, t th-3 foralln > 4. 


14. Find tq and ts. 
15. Write a recursive algorithm to compute t,, > 1. 


16. Give a proof using mathematical induction that your algo- 
rithm for Exercise 15 is correct. 


Chapter 4 Computer Exercises 


1. Implement Algorithm 4.1.2, finding the largest element in 
a sequence, as a program. 


. Implement Algorithm 4.2.1, text search, as a program. 
. Implement Algorithm 4.2.3, insertion sort, as a program. 


. Implement Algorithm 4.2.4, shuffle, as a program. 


an & WwW NY 


. Run shuffle (Algorithm 4.2.4) many times for the same in- 
put sequence. How might the output be analyzed to deter- 
mine if it is truly “random”? 


6. Implement selection sort (see Exercise 22, Section 4.2) as a 
program. 


7. Compare the running times of insertion sort (Algorithm 
4.2.3) and selection sort (see Exercise 22, Section 4.2) for 
several inputs of different sizes. Include data sorted in non- 
decreasing order, data sorted in nonincreasing order, data 
containing many duplicates, and data in random order. 


8. Write recursive and nonrecursive programs to compute n!. 
Compare the times required by the programs. 


9. 


10. 


11. 


Write a program whose input is a 2” x 2” board with one 
missing square and whose output is a tiling of the board by 
trominoes. 


Write a program that uses a graphics display to show a tiling 
with trominoes of a 2” x 2” board with one square missing. 


Write a program that tiles with trominoes an n x n board 
with one square missing, provided that n # 5 and 3 does 
not divide n. 


12. 


13. 


14. 
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Write recursive and nonrecursive programs to compute the 
Fibonacci sequence. Compare the times required by the pro- 
grams. 
A robot can take steps of 1 meter or 2 meters. Write a pro- 
gram to list all of the ways the robot can walk n meters. 


A robot can take steps of 1, 2, or 3 meters. Write a program 
to list all of the ways the robot can walk n meters. 
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Integers and Integer 
Algorithms 

The Euclidean 
Algorithm 

The RSA Public-Key 
Cryptosystem 


5.1 


Example 5.1.2 


Chapter 5 


INTRODUCTION TO 
NUMBER THEORY 


Number theory is the branch of mathematics concerned with the integers. Traditionally, 
number theory was a pure branch of mathematics—known for its abstract nature rather 
than its applications. The great English mathematician, G. H. Hardy (1877-1947), used 
number theory as an example of a beautiful, but impractical, branch of mathematics. 
However, in the late 1900s, number theory became extremely useful in cryptosystems— 
systems used for secure communications. 

In the preceding chapters, we used some basic number theory definitions such as 
“divides” and “prime number.” In Section 5.1, we review these basic definitions and ex- 
tend the discussion to unique factorization, greatest common divisors, and least common 
multiples. 

In Section 5.2, we discuss representations of integers and some algorithms for 
integer arithmetic. 

The Euclidean algorithm for computing the greatest common divisor is the subject 
of Section 5.3. This is surely one of the oldest algorithms. Euclid lived about 295 B.c., 
and the algorithm probably predates him. 

As an application of the number theory presented in Sections 5.1—5.3, we discuss 
the RSA system for secure communications in Section 5.4. 


Divisors 


In this section, we give the basic definitions and terminology. We begin by recalling the 
definition of “divides,” and we introduce some related terminology. 


Definition 5.1.1 > Let and d be integers, d 4 0. We say that d divides n if 
there exists an integer qg satisfying n = dq. We call q the quotient and d a divisor or 
factor of n. If d divides n, we write d | n. If d does not divide n, we write d | n. < 


Since 21 = 3-7, 3 divides 21 and we write 3 | 21. The quotient is 7. We call 3 a divisor 
or factor of 21. < 


We note that if m and d are positive integers and d | n, thend < n. (If d | n, 
there exists an integer g such that n = dq. Since n and d are positive integers, 1 < gq. 
Therefore, d < dq =n.) 


Theorem 5.1.3 


Example 5.1.5 


Example 5.1.6 
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Whether an integer d > 0 divides an integer n or not, we obtain a unique quotient 
q and remainder r as given by the Quotient-Remainder Theorem (Theorem 2.5.6): There 
exist unique integers g (quotient) and r (remainder) satisfying n = dq+r,0 <r < d. 
The remainder r equals zero if and only if d divides n. 

Some additional properties of divisors are given in the following theorem and will 
be useful in our subsequent work in this chapter. 


Let m, n, and d be integers. 


(a) Ifd| mandd|n, thend | (m+n). 
(b) Ifd | mand d | n, then d | (m—n). 
(c) Ifd | m, then d | mn. 


Proof (a) Suppose that d | mand d | n. By Definition 5.1.1, 
m= dqi (5.1.1) 


for some integer g; and 


n= dq2 (5.1.2) 
for some integer qo. If we add equations (5.1.1) and (5.1.2), we obtain 
m+n = dq + dqr = d(qi + qQ). 


Therefore, d divides m + n (with quotient g; + gz). We have proved part (a). 
The proofs of parts (b) and (c) are left as exercises (see Exercises 27 and 28). <4 


Definition 5.1.4 > An integer greater than 1 whose only positive divisors are 
itself and | is called prime. An integer greater than | that is not prime is called 
composite. < 


The integer 23 is prime because its only divisors are itself and 1. The integer 34 is com- 
posite because it is divisible by 17, which is neither 1 nor 34. < 


If an integer n > 1 is composite, then it has a positive divisor d other than | and 
itself. Since d is positive and d 4 1,d > 1. Since dis a divisor of n, d < n. Sinced #n, 
d <n. Therefore, to determine if a positive integer n is composite, it suffices to test 
whether any of the integers 2,3, ..., — 1 divides n. If some integer in this list divides 
n, then n is composite. If no integer in this list divides n, then n is prime. (Actually, we 
can shorten this list considerably; see Theorem 5.1.7.) 


By inspection, we find that none of 2,3, 4,5, ...,41, 42 divides 43; thus, 43 is prime. 
Checking the list 2, 3, 4, 5, ... , 449, 450 for potential divisors of 451, we find that 
11 divides 451 (451 = 11-41); thus, 451 is composite. < 


In Example 5.1.6, to determine whether a positive integer n > 1 was prime, we 
checked the potential divisors 2, 3,...,— 1. Actually, it suffices to check only 


2,3,..., L/al. 
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Theorem 5.1.7 


Algorithm 5.1.8 


Go Online 

For a C++ program 
implementing this 
algorithm, see 
goo.gl/y/T2se 


Example 5.1.9 


A positive integer n greater than | is composite if and only if n has a divisor d satis- 
fying 2<d< Jn. 


Proof We must prove 


If n is composite, then n has a divisor d satisfying 2 < d < /n, (5.1.3) 
and 
If 7 has a divisor d satisfying 2 < d < ./n, then n is composite. (5.1.4) 


We first prove (5.1.3). Suppose that n is composite. The discussion following Ex- 
ample 5.1.5 shows that 7 has a divisor d’ satisfying 2 < d’ < n. We now argue by cases. 
If d’ < ./n, then n has a divisor d (namely d = d’) satisfying 2 < d < /n. 

The other case is d’ > ./n. Since d' divides n, by Definition 5.1.1 there exists an 
integer g satisfying n = d'q. Thus q is also a divisor of n. We claim that g < ./n. To show 
that g < ./n, we use proof by contradiction. Thus, suppose that g > ./n. Multiplying 


d' > ./nand q > ./n gives 
n=dq>JnJ/n=n, 


which is a contradiction. Thus, g < ./n. Therefore, n has a divisor d (namely d = q) 
satisfying 2<d< Jn. 

It remains to prove (5.1.4). If 7 has a divisor d satisfying 2 < d < ./n, by Defini- 
tion 5.1.4 n is composite. The proof is complete. < 


We may use Theorem 5.1.7 to construct the following algorithm that tests whether 
a positive integer n > | is prime. 


Testing Whether an Integer Is Prime 


This algorithm determines whether the integer n > | is prime. If n is prime, the al- 
gorithm returns 0. If n is composite, the algorithm returns a divisor d satisfying 
2 <d<./n. To test whether d divides n, the algorithm checks whether the remainder 
when n is divided by d, n mod d, is zero. 


Input: 1 
Output: d 


is_prime(n) { 
for d = 2 to L/n| 
if (n mod d == 0) 
return d 
return 0 


To determine whether 43 is prime, Algorithm 5.1.8 checks whether any of 2, 3, 4, 5,6 = 
|/43] divides 43. Since none of these numbers divides 43, the condition 


nmodd == 0 (5.1.5) 


is always false. Therefore, the algorithm returns 0 to indicate that 43 is prime. 

To determine whether 451 is prime, Algorithm 5.1.8 checks whether any of 
De Scacey 21 = | /451| divides 451. Ford = 2,3,..., 10, d does not divide 451 and the 
condition (5.1.5) is false. However, when d = 11, d does divide 451 and the condition 


Example 5.1.10 


Theorem 5.1.11 
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(5.1.5) is true. Therefore, the algorithm returns 11 to indicate that 451 is composite and 
11 divides 451. < 


In the worst case (when n is prime and the for loop runs to completion), Algo- 
rithm 5.1.8 takes time @(./n). Although Algorithm 5.1.8 runs in time polynomial in n 
(since ./n < n), it does not run in time polynomial in the size of the input (namely, 
n). [We can represent n in considerably less space than ©(n); see Example 5.2.1.] We 
say that Algorithm 5.1.8 is not a polynomial-time algorithm. It is not known whether 
there is a polynomial-time algorithm that can find a factor of a given integer; but most 
computer scientists think that there is no such algorithm. On the other hand, in 2002 
Manindra Agarwal and two of his students, Nitin Saxena and Neeraj Kayal, discovered 
a polynomial-time algorithm that can determine whether or not a given integer is prime 
(see [Agarwal]). The question of whether there is a polynomial-time algorithm that can 
factor an integer is of more than academic interest since the security of certain encryption 
systems relies on the nonexistence of such an algorithm (see Section 5.4). 

Notice that if a composite integer n is input to Algorithm 5.1.8, the divisor returned 
is prime; that is, Algorithm 5.1.8 returns a prime factor of a composite integer. To prove 
this, we use proof by contradiction. If Algorithm 5.1.8 returns a composite divisor of 
n, say a, then a has a divisor a’ less than a. Since a’ also divides n and a’ < a, when 
Algorithm 5.1.8 sets d = a’, it will return a’, not a. This contradiction shows that if a 
composite integer n is input to Algorithm 5.1.8, the divisor returned is prime. 


If the input to Algorithm 5.1.8 is n = 1274, the algorithm returns the prime 2 because 
2 divides 1274, specifically 1274 = 2-637. 

If we now input n = 637 to Algorithm 5.1.8, the algorithm returns the prime 7 
because 7 divides 637, specifically 637 = 7-91. 

If we now input n = 91 to Algorithm 5.1.8, the algorithm returns the prime 7 
because 7 divides 91, specifically 91 = 7-13. 

If we now input n = 13 to Algorithm 5.1.8, the algorithm returns 0 because 13 is 
prime. 

Combining the previous equations gives 1274 as a product of primes 


1274 = 2-637 =2-7-91 =2-7-7-13. 


We have illustrated how to write any integer greater than | as a product of primes. 
It is also a fact (although we will not prove it in this book) that, except for the order of the 
prime factors, the prime factors are unique. This result is known as the Fundamental 
Theorem of Arithmetic or the unique factorization theorem. (Unique factorization 
does not hold in some systems; see Exercise 44.) < 


Fundamental Theorem of Arithmetic 


Any integer greater than | can be written as a product of primes. Moreover, if the 
primes are written in nondecreasing order, the factorization is unique. In symbols, if 


1 PDO Di 
where the p, are primes and p; < po <--- < p;, and 
n= P\Py-* Pi, 
where the p}, are primes and p| < p) <--- < Di. then i = j and 


pepe tonal kg — alts 
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Theorem 5.1.12 


Example 5.1.13 


Example 5.1.15 


We next prove that the number of primes is infinite. 


The number of primes is infinite. 


Proof _Itsuffices to show that if p is a prime, there is a prime larger than p. To this end, 
we let pi, p2,..-, Pn denote all of the distinct primes less than or equal to p. Consider 
the integer 


m= Pipr-**Pna+l. 


Notice that when m is divided by p;, the remainder is 1: 


m=piq+\, q = P1P2 +++ Pi-1Pi+1 +++ Pn- 


Therefore, for all i = 1 ton, p; does not divide m. Let p’ be a prime factor of m (m may 
or may not itself be prime; see Exercise 33). Then p’ is not equal to any of p;,i = 1 ton. 
Since pi, p2,--., Pn is a list of all of the primes less than or equal to p, we must have 
p’ > p. The proof is complete. < 


Show how the proof of Theorem 5.1.12 produces a prime larger than 11. 


SOLUTION We list the primes less than or equal to 11: 2,3,5,7,11. We let m = 
2-3-5-7-11 + 1 = 2311. Using Algorithm 5.1.8, we find that 2311 is prime. We 
have found a prime, namely 2311, larger than each of 2, 3,5, 7, 11. (If 2311 had turned 
out not to be prime, Algorithm 5.1.8 would have found a factor of 2311, which would 
necessarily be larger than each of 2, 3,5, 7, 11.) < 


The greatest common divisor of two integers m and n (not both zero) is the largest 
positive integer that divides both m and n. For example, the greatest common divisor of 4 
and 6 is 2, and the greatest common divisor of 3 and 8 is 1. We use the notion of greatest 
common divisor when we check to see if a fraction m/n, where m and n are integers, 
is in lowest terms. If the greatest common divisor of m and n is 1, m/n is in lowest 
terms; otherwise, we can reduce m/n. For example, 4/6 is not in lowest terms because 
the greatest common divisor of 4 and 6 is 2, not 1. (We can divide both 4 and 6 by 2.) 
The fraction 3/8 is in lowest terms because the greatest common divisor of 3 and 8 is 1. 


Definition 5.1.14 > Let m and n be integers with not both m and n zero. A 
common divisor of m and n is an integer that divides both m and n. The greatest common 
divisor, written 

gcd(m, n), 


is the largest common divisor of m and n. < 


The positive divisors of 30 are 
1; 2,:3,.96,;.10;, 15,30, 
and the positive divisors of 105 are 
1;3;.5,°7, 15,21, 35,105; 
thus the positive common divisors of 30 and 105 are 
1;3;:5,.19. 


It follows that the greatest common divisor of 30 and 105, gcd(30, 105), is 15. < 


Example 5.1.16 


Theorem 5.1.17 


Example 5.1.18 
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We can also find the greatest common divisor of two integers m and n by looking 
carefully at their prime factorizations. We illustrate with an example and then explain 
the technique in detail. 


We find the greatest common divisor of 30 and 105 by looking at their prime 
factorizations 


30 = 2-3-5 105 = 3-5-7. 
Notice that 3 is acommon divisor of 30 and 105 since it occurs in the prime factorization 
of both numbers. For the same reason, 5 is also a common divisor of 30 and 105. Also, 
3-5 = 15 is also a common divisor of 30 and 105. Since no larger product of primes is 


common to both 30 and 105, we conclude that 15 is the greatest common divisor of 30 
and 105. < 


We state the method of Example 5.1.16 as Theorem 5.1.17. 


Let m and n be integers, m > 1, n > 1, with prime factorizations 


ay a2 


ax 
IDG Joey eye 

and 
tht by 
N=P, Py ***Pr- 


Uf the prime p; is not a factor of m, we let a; = 0. Similarly, if the prime p; is not a 
factor of n, we let b; = 0.) Then 


min(a;,b,) _ min(a2,b2) min(ay bx) 


gced(m, n) = py P2 2B Ojafe 


Proof Let g = gcd(m, n). We note that if a prime p appears in the prime factorization 


of g, p must be equal to one of pi, ... , px; otherwise, g would not divide m or n (or both). 
Therefore g = pj! ---p;' for some cy, ..., cy. Now 
pn pene a eee (5.1.6) 


divides both m and n and if any exponent, min(q;, b;), is increased, the resulting integer 
will fail to divide m or n (or both). Therefore (5.1.6) is the greatest common divisor of 
mand n. < 


Using the notation of Theorem 5.1.17, we have 
82320 = 2*.3'.5'-7°.11° 
and 
950796 = 2°-3° «5°. 74-111. 
By Theorem 5.1.17, 


gcd(82320 950796) = gmin(4,2) 2 gmin(1,2) z 5min(1,0) /7minG.4) . ypmin@.D 
as 2? 3! -59.73 -11° 
= 4116. < 
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Example 5.1.20 


Example 5.1.21 


Theorem 5.1.22 


Neither the “list all divisors” method of Example 5.1.15 nor use of prime factoriza- 
tion as in Example 5.1.18 is an efficient method of finding the greatest common divisor. 
The problem is that both methods require finding the prime factors of the numbers in- 
volved and no efficient algorithm is known to compute these prime factors. However, 
in Section 5.3, we will present the Euclidean algorithm, which does provide an efficient 
way to compute the greatest common divisor. 

A companion to the greatest common divisor is the least common multiple. 


Definition 5.1.19 >» Let mandn be positive integers. A common multiple of m 
and n is an integer that is divisible by both m and n. The least common multiple, written 


Icm(m, n), 
is the smallest positive common multiple of m and n. < 
The least common multiple of 30 and 105, Ilem(30, 105), is 210 because 210 is divisible 


by both 30 and 105 and, by inspection, no positive integer smaller than 210 is divisible 
by both 30 and 105. < 


We can find the least common multiple of 30 and 105 by looking at their prime 
factorizations 
30 = 2-3-5 105 = 3-5-7. 


The prime factorization of lcem(30, 105) must contain 2, 3, and 5 as factors [so that 30 
divides Icm(30, 105)]. It must also contain 3, 5, and 7 [so that 105 divides lem(30, 105)]. 
The smallest number with this property is 


2+3-+5+-7= 210. 
Therefore, lcm(30, 105) = 210. < 


We state the method of Example 5.1.21 as Theorem 5.1.22. 


Let m and n be integers, m > 1, n > 1, with prime factorizations 
m = pi'py pi 

and 
avy ery 


(Uf the prime p; is not a factor of m, we let a; = 0. Similarly, if the prime p; is not a 
factor of n, we let b; = 0.) Then 


Icm(m, n) = fel AU es ay ue gy 


Proof Let / = Icm(m, n). We note that if a prime p appears in the prime factorization 
of /, p must be equal to one of p),..., px; otherwise, we could eliminate p and obtain 
a smaller integer that is divisible by both m and n. Therefore / = p{' --- pj‘ for some 
Cl, +++, ¢%. Now 


sb 12, bo cD 
| i Dee b>) a po Ke) (5.1.7) 


Example 5.1.23 


Example 5.1.24 


Theorem 5.1.25 
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is divisible by both m and n and if any exponent, max(a;, b;), is decreased, the resulting 
integer will fail to be divisible by m or n (or both). Therefore (5.1.7) is the least common 
multiple of m and n. < 


Using the notation of Theorem 5.1.22, we have 
82320 = 2? «3' 5" «776119 


and 


950796 = 27-37 -5°.74 111, 
By Theorem 5.1.22, 


lem(82320 950796) = qmax(4,2) .3max(1,2) . 5max(1,0) 2 7maxG.4) Pail ymax0.)) 
ap ae sil 
= 19015920. < 


In Example 5.1.15, we found that gcd(30, 105) = 15, and in Example 5.1.21, we found 
that lcm(30, 105) = 210. Notice that the product of the gcd and Icm is equal to the 
product of the pair of numbers; that is, 


gcd(30, 105) -Icm(30, 105) = 15-210 = 3150 = 30-105. 


This formula holds for any pair of numbers as we will show in Theorem 5.1.25. < 


For any positive integers m and n, 


gcd(m, n) +lem(m, n) = mn. 


Proof If m= 1, then gcd(m, n) = 1 and Iem(m, n) = n, so 
gcd(m, n)-Icm(m, n) = 1-n = mn. 

Similarly, if nm = 1, then gcd(m, n) = | and Icm(m, n) = m, so 
gcd(m, n)-Icm(m, n) = 1-m = mn. 


Thus, we may assume that m > 1 andn > 1. 
The proof combines the formulas for the gcd (Theorem 5.1.17) and Icm 
(Theorem 5.1.22) (which require that m > 1 and n > 1) with the fact that 


min(x, y) + max(x, y) =x+y for all x and y. 


This latter formula is true because one of {min(x, y), max(x, y)} equals x and 
the other equals y. We now put this all together to produce a proof. 
Write the prime factorizations of m and n as 
a) a2 ak 
m= Py Po -** Px 
and 


bi b> by 
N=P) Py ***PK- 


222 


Chapter 5 @ Introduction to Number Theory 


(if the prime p; is not a factor of m, we let a; = 0. Similarly, if the prime p; is not a factor 
of n, we let b; = 0.) By Theorem 5.1.17, 


gcd(m, n) _ pee . Cae 
and by Theorem 5.1.22, 
Icm(m, n) _ fea aa 3 ger 
Therefore, 
ged(m, n) «lem(m, n) = [prin@r?) ... prin@e??). 


max(a),b)) max(az,bx) 
[P| De ] 


_ ,min(a;,b;)+max(a;,b)) min(ax,b¢)-+max (ay, b¢) 
=Ppi Dy 


— paythy ath 
=p} pt 
a ap b by 
= [py Pe py ++ Pe] = mn. < 


If we have an algorithm to compute the greatest common divisor, we can compute 
the least common multiple by using Theorem 5.1.25: 
mn 
Icm(m, n) = ————. 
gcd(m, n) 
In particular, if we have an efficient algorithm to compute the greatest common divisor, 
we can efficiently compute the least common multiple as well. 


5.1 Problem-Solving Tips 


The straightforward way to determine whether an integer > 1 is prime is to test whether 
any of 2,3,..., |./n] divides n. While this technique becomes too time-consuming as 
n grows larger, it suffices for relatively small values of n. This technique can be iterated 
to find the prime factorization of n, again for relatively small values of n. 

Two ways of finding the greatest common divisor of a and b were presented. The 
first way was to list all of the positive divisors of a and all of the positive divisors of 
b and then, among all of the common divisors, choose the largest. This technique is 
time consuming and was shown mainly to illustrate exactly what is meant by common 
divisors and the greatest common divisor. 

The second technique was to compare the prime factorizations of a and b. If p’ 
appears in a and p/ appears in b, include p™"“” in the prime factorization of the greatest 
common divisor. This technique works well if the numbers a and b are relatively small 
so that the prime factorizations of each can be found, or if the prime factorizations of 
each are given. In Section 5.3, we present the Euclidean algorithm that efficiently finds 
the greatest common divisor even for large values of a and b. 

If you compute the gcd(a, b), you can immediately compute the least common 
multiple using the formula 


ab 


Icm(a, b) = ————_. 
gcd(a, b) 


The least common multiple can also be computed by comparing the prime factor- 
izations of a and b. If p' appears in a and p/ appears in b, include p™*“) in the prime 
factorization of the least common multiple. 
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5.1 Review Exercises 


— 


. Define d divides n. 

. Define d is a divisor of n. 
Define quotient. 

. Define n is prime. 


. Define n is composite. 


A vn & WwW N 


. Explain why, when testing whether an integer n > 1 is prime 
by looking for divisors, we need only check whether any of 2 
to |./n] divides n. 


I 


. Explain why Algorithm 5.1.8 is not considered to be a 
polynomial-time algorithm. 


ow 


. What is the Fundamental Theorem of Arithmetic? 


9. Prove that the number of primes is infinite. 
10. What is a common divisor? 
11. What is the greatest common divisor? 


12. Explain how to compute the greatest common divisor of m and 
n, not both zero, given their prime factorizations. 


13. What is a common multiple? 
14. What is the least common multiple? 


15. Explain how to compute the least common multiple of positive 
integers m and n, given their prime factorizations. 


16. How are the greatest common divisor and least common multi- 
ple related? 


In Exercises 1-8, trace Algorithm 5.1.8 for the given input. 


1. n=9 2. n= 209 

3. n=47 4. n= 637 

5. n= 4141 6. n = 1007 

7. n = 3738 8. n = 1050703 

9. Which of the integers in Exercises 1-8 are prime? 


10. Find the prime factorization of each integer in Exercises 1-8. 


11. Find the prime factorization of 11!. 


Find the greatest common divisor of each pair of integers in Exer- 
cises 12-24. 


12. 0,17 13. 60,90 14, 5,25 

15. 110, 273 16. 315, 825 17. 220, 1400 
18. 20, 40 19. 2091, 4807 20. 331, 993 
21. 13, 13? 22. 15, 15° 


23. 37-79-11, 23-5-7 
24, 32-79-11, 32-73-11 


25. Find the least common multiple of each pair of integers in 
Exercises 13-24. 


26. For each pair of integers in Exercises 13-24, verify that 
gced(m, n) -lcm(m, n) = mn. 


27. Let m,n, and d be integers. Show that if d | m and d | n, then 
d|(m—n). 


28. Let m,n, and d be integers. Show that if d | m, then d | mn. 


29. Let m,n, d;, and d2 be integers. Show that if d; | mand d2 | n, 
then d,d> | mn. 


30. Let n, c, and d be integers. Show that if dc | nc, then d | n. 


31. Let a, b, and c be integers. Show that if a | b and b | c, then 
alc. 


32. Suggest ways to make Algorithm 5.1.8 more efficient. 


33. Give an example of consecutive primes pj = 2,p2,..-,DPn 
where 
Pip2++*Pn +1 
is not prime. 


Exercises 34 and 35 use the following definition: A subset 
{a1,...,4n} of Z* is a *-set of size n if (a; — qj) | a for alli 
and j, where i # j, 1 <i<n,and1 <j <n. These exercises are 
due to Martin Gilchrist. 


34. Prove that for all n > 2, there exists a *-set of size n. Hint: 
Use induction on n. For the Basis Step, consider the set {1, 2}. 
For the Inductive Step, let bp = [[j_, 4% and bj = bo + a; for 
1l<i<n. 

35. Using the hint in Exercise 34, construct *-sets of sizes 3 
and 4. 


The Fermat numbers Fo, F, ... are defined as F, = 27" +1. 
36. Prove that 


n—1 
[[% =" -2. foralln, n= 1. 
i=0 

37. Using Exercise 36 or otherwise, prove that 


gced(Fin, Fy) = 1 forallm,n, O<m<n. 


38. Use Exercise 37 to prove that the number of primes is infinite. 


39. Recall that a Mersenne prime (see the discussion before 
Example 2.2.14) is a prime of the form 2? — 1, where 
p is prime. Prove that if m is composite, 2” — 1 is also 
composite. 
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Exercises 40-49 use the following notation and terminology. We 
let E denote the set of positive, even integers. Ifn € E can be writ- 
ten as a product of two or more elements in E, we say that n is 
E-composite; otherwise, we say that n is E-prime. As examples, 
4 is E-composite and 6 is E-prime. 


40. 
41. 
42. 
43. 
44. 


Is 2 E-prime or E-composite? 
Is 8 E-prime or E-composite? 
Is 10 E-prime or E-composite? 
Is 12 E-prime or E-composite? 


Show that the number 36 can be written as a product of 
E-primes in two different ways, which shows that factoring 
into E-primes is not necessarily unique. 
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45. 


46. 
47. 


48. 


49. 


Find a necessary and sufficient condition for an integer to be 
an E-prime. Prove your statement. 


Show that the set of E-primes is infinite. 


Show that there are no twin E-primes, that is, two E-primes 
that differ by 2. 


Show that there are infinitely many pairs of E-primes that dif- 
fer by 4. 


Give an example to show that the following is false: If an 
E-prime p divides mn € E, then p divides m or p divides n. 
“Divides” means “divides in E.” That is, if p,q € E, we say 
that p divides g in E if g = pr, where r € E. (Compare this 
result with Exercise 27, Section 5.3.) 


Representations of Integers 


and Integer Algorithms 


A bit is a binary digit, that is, a 0 or a 1. In a digital computer, data and instructions are 
encoded as bits. (The term digital refers to the use of the digits 0 and 1.) Technology 
determines how the bits are physically represented within a computer system. Today’s 


Go Online 

For more on 
representations of 
integers, see 
goo.gl/WyqJp9 


hardware relies on the state of an electronic circuit to represent a bit. The circuit must be 
capable of being in two states—one representing 1, the other 0. In this section we discuss 
the binary number system, which represents integers using bits, and the hexadecimal 
number system, which represents integers using 16 symbols. The octal number sys- 
tem, which represents integers using eight symbols, is discussed before Exercise 42. 


In the decimal number system, to represent integers we use the 10 symbols 0, 1, 
2, 3, 4, 5, 6, 7, 8, and 9. In representing an integer, the symbol’s position is significant; 
reading from the right, the first symbol represents the number of 1’s, the next symbol 
the number of 10’s, the next symbol the number of 100’s, and so on. For example, 


3854 = 3-10°+ 8-107 +5-10' +4-10° 


(see Figure 5.2.1). In general, the symbol in position n (with the rightmost symbol being 
in position 0) represents the number of 10”’s. Since 10° = 1, the symbol in position 0 
represents the number of 10°’s or 1’s; since 10! = 10, the symbol in position 1 represents 


the number of 10!’s or 10’s; since 107 = 


100, the symbol in position 2 represents the 


number of 107’s or 100’s; and so on. We call the value on which the system is based (10 
in the case of the decimal system) the base of the number system. 


100’s place (107) 


1000’s place (107) —_, 


Symbol 3. —— 


Symbol 2 


10’s place (10!) 


1’s place (10°) 


po 
3 8 5 4 
| seit 


hood 
Symbol 1 


Figure 5.2.1 The decimal number system. 


Example 5.2.1 
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In the binary (base 2) number system, to represent integers we need only two 
symbols, 0 and 1. In representing an integer, reading from the right, the first symbol 
represents the number of 1’s, the next symbol the number of 2’s, the next symbol the 
number of 4’s, the next symbol the number of 8’s, and so on. For example, in base si 


1011015 = 1-2°4+0-24 41-23 4+1-274+0-2! +1.2° 


(see Figure 5.2.2). In general, the symbol in position n (with the rightmost symbol be- 
ing in position 0) represents the number of 2”’s. Since 2° = 1, the symbol in position 0 
represents the number of 2°’s, or 1’s; since 2! = 2, the symbol in position 1 represents 
the number of 2!’s or 2’s; since 2? = 4, the symbol in position 2 represents the number 
of 22’s or 4’s; and so on. 


8’s place (27) 4’s place (27) 


16’s place (24) — ________ 2’s place (2!) 


32’s place (2°) —— 
y ey  F | 


1 
bot ff} 
Symbol 5 —— - Symbol 0 


Symbol 4 ———_ Symbol 1 


1’s place (2°) 


Symbol 3 Symbol 2 


Figure 5.2.2 The binary number system. 


Computer Representation of Integers Computer systems represent integers in bi- 
nary. Compute the number of bits necessary to represent a positive integer n. Deduce that 
Algorithm 5.1.8, which determines whether an integer is prime, is not a polynomial-time 
algorithm. 


SOLUTION Suppose that the binary representation of the positive k-bit integer n is 
n= 102! + dy_p2*? + ++ +b 92°, 


Now 


Qk! <n 


and 
n= 1-2k-b a by_o2*? as ++ + 92° 
< 1 Qeb a] he oes de 1 0S OF a 1 ee OF. 


(The last equality follows from the formula for the geometric sum; see Example 2.4.4.) 
Therefore 


When <2 


Without knowing which number system is being used, a representation is ambiguous; for example, 101101 
represents one number in decimal and quite a different number in binary. Often the context will make clear 
which number system is in effect; but when we want to be absolutely clear, we subscript the number to specify 
the base—the subscript 10 denotes the decimal system and the subscript 2 denotes the binary system. 
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Example 5.2.2 


Algorithm 5.2.3 


Taking logs, we obtain 
k—1<l|gn<k. 


Adding one gives 
k<l+lgn<k+l1. 


Therefore, k = |1+1gn]. Since k is the number of bits required to represent n, we have 
proved that the number of bits necessary to represent n is [1 + Ign]. 

The size s of an integer n input to Algorithm 5.1.8 is the number of bits necessary 
to represent n. Thus s satisfies 


s=|l+lgn] <14+lgn=1lg2+l]gn = lg(2n). 


Raising to the power 2 gives 2° < 2n. Dividing by 2 and taking square roots yields 


v3 < Jn. (5.2.1) 


The worst-case time of Algorithm 5.1.8 is ©(,/7). Thus its worst-case time T satisfies 
T>CJn (5.2.2) 


for some constant C. Combining inequalities (5.2.1) and (5.2.2), we obtain 
T > (C/ V2) (/2)*. Therefore, in the worst case, Algorithm 5.1.8 runs in exponential 
time in the input size s. We say that Algorithm 5.1.8 is not a polynomial-time algorithm. 4 


Binary to Decimal The binary number 101101, represents the number consisting of 
one I, no 2’s, one 4, one 8, no 16’s, and one 32 (see Figure 5.2.2). This representation 
may be expressed 


101101, = 1-2° 4+ 0-244 1-23 4+1-274+0-2! 4+1.-2°. 
Computing the right-hand side in decimal, we find that 


101101, = 1-324+0-164+1-8+1-4+0-2+4+1-1 
= 324+84+441=45). < 


We turn the method of Example 5.2.2 into an algorithm. We generalize by allowing 
an arbitrary base b. 


Converting an Integer from Base b to Decimal 
This algorithm returns the decimal value of the base b integer cyCy_) +++ 1Co.- 


Input: c,n,b 
Output: dec_val 


base_b_to_dec (c,n, b) { 
dec=vai—10) 
b_to_the_i= 1 
fori = Oton { 
dec_val = dec_val +c; * b_to_the_i 
b_to_the_i = b_to_the_ixb 
} 


return dec_val 


Example 5.2.4 
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Algorithm 5.2.3 runs in time ©(n). 


Show how Algorithm 5.2.3 converts the binary number 1101 to decimal. 
SOLUTION Here n = 3, b = 2, and 


c3 = 1, C= 1, c; = 0, co = 1. 


First, dec_val is set to 0, and b_to_the_i is set to 1. We then enter the for loop. 
Since i = 0 and b_to_the_i = 1, 


ci * b_to_thei=1*1=1. 


Thus dec_val becomes 1. Executing 
b_to_the_i = b_to_the_i*b 

sets b_to_the_i to 2. We return to the top of the for loop. 
Since i = | and b_to_the_i = 2, 


c; * b_to_the.i=0*«2=0. 


Thus dec_val remains |. Executing 
b_to_the_i = b_to_the_i*b 

sets b_to_the_i to 4. We return to the top of the for loop. 
Since i = 2 and b_to_the_i = 4, 


c; * b_to_thei=1*4=4. 


Thus dec_val becomes 5. Executing 
b_to_the_i = b_to_the_i* b 


sets b_to_the_i to 8. We return to the top of the for loop. 
Since i = 3 and b_to_the_i = 8, 


c, * b_to_thei=1*8=8. 


Thus dec_val becomes 13. Executing 
b_to_the_i = b_to_the_ixb 


sets b_to_the_i to 16. The for loop terminates and the algorithm returns 13, the decimal 
value of the binary number 1101. < 


Other important bases for number systems in computer science are base 8 or octal 
and base 16 or hexadecimal (sometimes shortened to hex). We will discuss the hexa- 
decimal system and leave the octal system to the exercises (see Exercises 45-50). 

In the hexadecimal number system, to represent integers we use the symbols 0, 
1, 2, 3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, and F. The symbols A-F are interpreted as 
decimal 10-15. (In general, in the base N number system, N distinct symbols, repre- 
senting 0, 1,2,...,N — 1 are required.) In representing an integer, reading from the 
right, the first symbol represents the number of 1’s, the next symbol the number of 
16’s, the next symbol the number of 167’s, and so on. For example, in 
base 16, 


B4F = 11-167 +4-16! + 15-16° 
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Example 5.2.5 


16’s place (161) 


256’s place (167) —_, ——— 1’s place (16°) 


Symbol 2. —— '_— Symbol 0 


Symbol 1 
Figure 5.2.3 The hexadecimal number system. 


(see Figure 5.2.3). In general, the symbol in position n (with the rightmost symbol being 
in position 0) represents the number of 16”’s. 


Hexadecimal to Decimal Convert the hexadecimal number B4F to decimal. 
SOLUTION We obtain 


B4F\6 = 11-167 +4-16' + 15+ 16° 
= 11-2564 4-16+ 15 = 2816 + 64 +4 15 = 2895 jo. < 
Algorithm 5.2.3 shows how to convert an integer in base b to decimal. Consider 


the reverse problem—converting a decimal number to base b. Suppose, for example, that 
we want to convert the decimal number 91 to binary. If we divide 91 by 2, we obtain 


45 
2)91 
8 
ll 
10 
1 


This computation shows that 
91=2-45+1. (5.2.3) 
We are beginning to express 91 in powers of 2. If we next divide 45 by 2, we find 
45 =2-22+1. (5.2.4) 
Substituting this expression for 45 into (5.2.3), we obtain 
91=2-4541 
=2-(2-2241)4+1 
=27-2242+1. (5.2.5) 
If we next divide 22 by 2, we find 
22 =2-11. 
Substituting this expression for 22 into (5.2.5), we obtain 
91 =27-224+2+41 
=2?.(2-11) +241 
=23-1142+1. (5.2.6) 


Example 5.2.6 


Algorithm 5.2.7 
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If we next divide 11 by 2, we find 
11=2-5+4+1. 
Substituting this expression for 11 into (5.2.6), we obtain 
91= 27-542 4241. (5.2.7) 
If we next divide 5 by 2, we find 
5=2-2+4+1. 
Substituting this expression for 5 into (5.2.7), we obtain 


91=2°-24244+2742+4+1 
= 294244274241 
= 1011011». 
The preceding computation shows that the remainders, as N is successively di- 
vided by 2, give the bits in the binary representation of N. The first division by 2 in 


(5.2.3) gives the 1’s bit; the second division by 2 in (5.2.4) gives the 2’s bit; and so on. 
We illustrate with another example. 


Decimal to Binary Write the decimal number 130 in binary. 


SOLUTION The computation shows the successive divisions by 2 with the remainders 
recorded at the right. 


2)130 remainder = 0 1’s bit 
2)65, remainder = | 2’s bit 
2)32 remainder = 0 4’s bit 
2)16 remainder = 0 8’s bit 


2)8 remainder = 0 16’s bit 

2)4 remainder = 0 32’s bit 

2)2 remainder = 0 64’s bit 

2)1 remainder = | 128’s bit 
0 


We may stop when the quotient is 0. Remembering that the first remainder gives the 
number of 1’s, the second remainder gives the number of 2’s, and so on, we obtain 


< 
13019 = 100000105. 
We turn the method of Example 5.2.6 into an algorithm. We generalize by allowing 
an arbitrary base b. 


Converting a Decimal Integer into Base b 


This algorithm converts the positive integer m into the base b integer ¢nCy_1 +++ C1Co- 
The variable n is used as an index in the sequence c. The value of m mod b is the 
remainder when m is divided by b. The value of |m/b]| is the quotient when m is 
divided by b. 
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Example 5.2.8 


Example 5.2.9 


Input: m,b 


Output: c,n 


dec_to_base_b(m, b,c, n) { 
n=-1 
while (m > 0) { 
n=n+1 
Crm smodip 
m= |m/b| 
} 
} 


Just as a binary integer m has |1 + 1g m| bits, a base b integer m has |1 + log, m| 
digits (see Exercise 55). Thus Algorithm 5.2.7 runs in time © (log, m). 


Show how Algorithm 5.2.7 converts the decimal number m = 11 to binary. 


SOLUTION The algorithm first sets n to —1. The first time we arrive at the while loop, 
m = 11 and the condition m > 0 is true; so we execute the body of the while loop. The 
variable n is incremented and becomes 0. Since m mod b = 11 mod 2 = 1, co is set to 
1. Since |m/b] = [11/2] = 5, mis set to 5. We return to the top of the while loop. 
Since m = 5, the condition m > 0 is true; so we execute the body of the while 
loop. The variable n is incremented and becomes 1. Since m mod b = 5 mod 2 = 1, c 
is set to 1. Since |m/b| = [5/2] = 2, mis set to 2. We return to the top of the while loop. 
Since m = 2, the condition m > 0 is true; so we execute the body of the while 
loop. The variable 1 is incremented and becomes 2. Since m mod b = 2 mod 2 = 0, c2 
is set to 0. Since |m/b| = |2/2| = 1, mis set to 1. We return to the top of the while loop. 
Since m = 1, the condition m > 0 is true; so we execute the body of the while 
loop. The variable n is incremented and becomes 3. Since m mod b = 1 mod 2 = 1, c3 
is set to 1. Since |m/b| = [1/2] = 0, mis set to 0. We return to the top of the while loop. 
Since m = 0, the algorithm terminates. The value 11 has been converted to the 
binary number c3c2¢\cy = 1011. < 


Decimal to Hexadecimal Convert the decimal number 20385 to hexadecimal. 


SOLUTION The computation shows the successive divisions by 16 with the remainders 
recorded at the right. 


16)20385 remainder = | 1’s place 
16)1274 remainder = 10 16’s place 
16)79 remainder = 15 16?’s place 
16)4 remainder = 4 16°’s place 

0 


We stop when the quotient is 0. The first remainder gives the number of 1’s, the second 
remainder gives the number of 16’s, and so on; thus we obtain 2038519 = 4FAli5. <4 


Next we turn our attention to addition of numbers in arbitrary bases. The same 
method that we use to add decimal numbers can be used to add binary numbers; however, 
we must replace the decimal addition table with the binary addition table 
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Sete | 


(In decimal, 1 + 1 = 2, and 2;9 = 10y; thus, in binary, 1 + 1 = 10.) 


Example 5.2.10 Binary Addition Add the binary numbers 10011011 and 1011011. 
SOLUTION We write the problem as 


10011011 
ele 1011011 


As in decimal addition, we begin from the right, adding 1 and 1. This sum is 10); thus 
we write 0 and carry 1. At this point the computation is 


1 
10011011 
+ _1011011 
0 


Next, we add 1 and 1 and 1, which is 11. We write 1 and carry 1. At this point, the 
computation is 


1 
10011011 
a 1011011 
10 


Continuing in this way, we obtain 


10011011 
a 1011011 
11110110 


Example 5.2.11 The addition problem of Example 5.2.10, in decimal, is 


155 
+ 91 
246 

< 


We turn the method of Example 5.2.10 into an algorithm. If the numbers to add 
are b,b,_1 --- bi bo and b),b),_, --- b.bp, at the iteration i > 0 the algorithm adds bj, bj, 
and the carry bit from the previous iteration. When adding three bits, say B), Bz, and B3, 
we obtain a two-bit binary number, say cb. For example, if we compute 1 + 0 + 1, the 
result is 102; in our notation, c = 1 and b = 0. By checking the various cases, we can 
verify that we can compute the binary sum B; + Bz + B3 by first computing the sum in 


decimal and then recovering c and b from the formulas 


b = (B, + By + B3) mod 2, c= |(B; + Bp + B3)/2]. 
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Algorithm 5.2.12 | Adding Binary Numbers 
This algorithm adds the binary numbers b,b,—1 «++ by bo and bi,b’_, -- - bby and stores 


n?n-1~ 


the sum in $,415)5,—1 +++ 5189. (Leading zeros can be appended to b or b’ so that this 
algorithm can be used to add integers with different numbers of bits.) 


Input: b,b',n 
Output: s 


binary_addition(b, b’,n, s) { 
carry = 0) 
fori = Oton { 
S; = (b; + b; + carry) mod 2 
carry = |(b; +b; +. carry) /2] 
} 
San Scary 


} 


Algorithm 5.2.12 runs in time @(n). 
Our next example shows that we can add hexadecimal numbers in the same way 
that we add decimal or binary numbers. 


Example 5.2.13 Hexadecimal Addition Add the hexadecimal numbers 84F and 42EA. 
SOLUTION The problem may be written 


84F 
+ 42EA 


We begin in the rightmost column by adding F and A. Since F is 15j9 and A is 1010, 
F+A = 15j9 + 109 = 2519 = 19}6. We write 9 and carry 1: 


1 
84F 

+ 42EA 
9 


Next, we add 1, 4, and E, obtaining 1316. We write 3 and carry 1: 


Continuing in this way, we obtain 


84F 
+ 42EA 
4B39 < 


Example 5.2.14 The addition problem of Example 5.2.13, in decimal, is 


2127 
+ 17130 
19257 < 


Example 5.2.15 
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We can multiply binary numbers by modifying the standard algorithm for multi- 
plying decimal numbers (see Exercise 67). 

We conclude by discussing a special algorithm, which we will need in Section 5.4, 
to compute powers mod z. We first discuss an algorithm to compute a power a” (without 
dealing with mod z). The straightforward way to compute this power is to repeatedly 
multiply by a 


a-d-+:a, 
SS 


nas 
which uses n — | multiplications. We can do better using repeated squaring. 

As a concrete example, consider computing a”®. We first compute a2 = a-a, 
which uses | multiplication. We next compute at = a?-a’, which uses | additional 
multiplication. We next compute a’ = a*-a*, which uses 1 additional multiplication. 
We next compute a! = a®-a®, which uses | additional multiplication. So far, we have 
used only 4 multiplications. Noting that the expansion of 29 in powers of 2, that is the 
binary expansion, is 


29=1+4+8+ 16, 


we see that we can compute a”? as 


@ =a'.a! da", 

which uses 3 additional multiplications for a total of 7 multiplications. The straightfor- 
ward technique uses 28 multiplications. 

In Example 5.2.6, we saw that the remainders when n is successively divided by 
2 give the binary expansion of n. If the remainder is 1, the corresponding power of 2 is 
included; otherwise, it is not included. We can formalize the repeated squaring technique 
if, in addition to repeated squaring, we simultaneously determine the binary expansion 
of the exponent. 


Figure 5.2.4 shows how a”? is calculated using repeated squaring. Initially x is set to a, 
and n is set to the value of the exponent, 29 in this case. We then compute n mod 2. Since 
this value is 1, we know that 1 = 2° is included in the binary expansion of 29. Therefore 
a’ is included in the product. We track the partial product in Result; so Result is set to 
a. We then compute the quotient when 29 is divided by 2. The quotient 14 becomes the 


new value of n. We then repeat this process. 


Current Value Quotient When n 
x of n n mod 2 Result Divided by 2 
a 29 1 a 14 
ae 14 0 Unchanged 7 
at ri 1 a-at=a 3 
a 3 1 a-a® = als 1 
a6 1 1 axqlé = 4? 0 


Figure 5.2.4 Computing a” using repeated squaring. 


We square x to obtain a”. We then compute n mod 2. Since this value is 0, we know 
that 2 = 2! is not included in the binary expansion of 29. Therefore a? is not included in 
the product, and Result is unchanged. We then compute the quotient when 14 is divided 
by 2. The quotient 7 becomes the new value of n. We then repeat this process. 


234 


Chapter 5 @ Introduction to Number Theory 


Algorithm 5.2.16 


Theorem 5.2.17 


We square x to obtain a+. We then compute n mod 2. Since this value is 1, we 
know that 4 = 2? is included in the binary expansion of 29. Therefore a* is included 
in the product. Result becomes a>. We then compute the quotient when 7 is divided 
by 2. The quotient 3 becomes the new value of n. The process continues until n 
becomes 0. < 


We state the method of repeated squaring as Algorithm 5.2.16. 


Exponentiation by Repeated Squaring 


This algorithm computes a” using repeated squaring. The algorithm is explained in 
Example 5.2.15. 


Input: a,n 
Output: a” 
exp_via_repeated_squaring(a, n) { 

result = 1 

P= ea 

while (n > 0) { 
if (imod’2:——ml)) 

result = result * x 

Te RES 
n= |n/2| 

} 


return result 


The number of times that the while loop executes is determined by n. The variable 
n is repeatedly halved 


n= |n/2| 


and when n becomes 0, the loop terminates. Example 4.3.14 shows that it takes time 
© (1g) to reduce n to 0 by repeated halving. In the body of the while loop, at most two 
multiplications are performed. Thus, the number of multiplications is at most © (1g 7), 
an improvement over the straightforward algorithm that uses ©(n) multiplications. The 
bottleneck in Algorithm 5.2.16 is the size of the numbers involved. The value returned 
a” requires lg a” = nlga bits in its representation. Thus, simply to copy the final value 
into Result takes time, at least Q(nlga), which is exponential in the size of n (see 
Example 5.2.1). 

In Section 5.4, we will need to compute a” mod z for large values of a and n. In this 
case, a” will be huge; so it is impractical to compute a” and then compute the remainder 
when a” is divided by z. We can do much better. The key idea is to compute the remainder 
after each multiplication thereby keeping the numbers relatively small. The justification 
for this technique is given in our next theorem. 


If a, b, and z are positive integers, 


ab mod z = [(a mod z)(b mod z)] mod z. 


Example 5.2.18 
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Proof Let w=ab mod z, x=a mod z, and y=b mod z. Since w is the remainder 
when ab is divided by z, by the quotient-remainder theorem, there exists g; such that 


ab =q\z+w. 
Thus 
w = ab— qiz. 
Similarly, there exists gz and q3 such that 
a=Qzt+x, b=qzty. 
Now 
w=ab—qz 
= (qr + x)(Q3z + Y) — HZ 


= (q2qaz + qoy + qax — qi)z + xy 
= qzt+xy, 


where g = qoq3z + q2y + q3x — qi. Therefore, 
xy = —qz+w; 


that is, w is the remainder when xy is divided by z. Thus, w= xy mod z, which trans- 
lates to 


ab mod z = [(a mod z)(b mod z)] mod z. 4 


Show how to compute 572”? mod 713 using Algorithm 5.2.16 and Theorem 5.2.17. 
SOLUTION To compute a??, we successively computed 
a, a@=a-a, a> =a -a, a” =a->-a 


(see Example 5.2.15). To compute a”’ mod z, we successively compute 


a mod z, @ mod Z a® mod z, @° mod z. 


Each multiplication is performed using Theorem 5.2.17. We compute a” using the 


formula 


a’ mod z = [(a mod z)(a mod z)] mod z. 


We compute a* using the formula 


4 


a mod z= a" 


a’ mod z = (a mod z)(a* mod z)] mod z, 


and so on. 
We compute a? using the formula 


@ mod z = aa* mod z = [(a mod z)(a* mod z)] mod z. 
We compute a!? using the formula 


5 


a? mod z = aa’ mod z = [(a> mod z)(aé mod z)] mod z, 


and so on. 
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The following shows the computation of 572”? mod 713: 


5727 mod 713 = (572 mod 713)(572 mod 713) mod 713 = 5727 


mod 713 = 630 
572* mod 713 = (572? mod 713)(572? mod 713) mod 713 = 6307 
mod 713 = 472 
572° mod 713 = (572+ mod 713)(572* mod 713) mod 713 = 4727 
mod 713 = 328 
572!© mod 713 = (5728 mod 713)(572® mod 713) mod 713 = 3287 
mod 713 = 634 
572° mod 713 = (572 mod 713)(572* mod 713) mod 713 = 572 +472 
mod 713 = 470 
572!3 mod 713 = (572° mod 713)(5728 mod 713) mod 713 = 470-328 
mod 713 = 152 
572” mod 713 = (572!3 mod 713)(572'° mod 713) mod 713 = 152-634 
mod 713 = 113. 


The number 572”? has 80 digits, so Theorem 5.2.17 indeed simplifies the 
computation. < 


The technique demonstrated in Example 5.2.18 is formalized as Algorithm 5.2.19. 


Algorithm 5.2.19 | Exponentiation Mod z by Repeated Squaring 


This algorithm computes a” mod z using repeated squaring. The algorithm is 
explained in Example 5.2.18. 


Input: a,n,z 
Output: a” modn 


exp_mod_z_via_repeated_squaring(a, n, Z) { 


Go Online ce ae 
For a C++ program a ; 
implementing this kaos om ee ee 1 
algorithm, see sais Gen 7 ) mod 
ai y PU. result = (resu x me 
goo.gl/mYpqPU x = (x * x) mod z 
} 


return result 


The key difference between Algorithms 5.2.16 and 5.2.19 is the size of the num- 
bers that are multiplied. In Algorithm 5.2.19, the numbers multiplied are remainders 
after division by z and so have magnitude less than z. If we modify the usual method of 
multiplying base 10 integers for base 2, it can be shown (see Exercise 68) that the time 
required to multiply a and b is O(galgb). Since the while loop in Algorithm 5.2.19 
executes @(lg 7) times, the total time for Algorithm 5.2.19 is O(gn lg? z). 
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5.2 Problem-Solving Tips 


= To convert the base b number c,b” + cy_1b"-! +--+ + c,b! + cob® to decimal, 
carry out the indicated multiplications and additions in decimal. 


= To convert the decimal number n to base b, divide by b, divide the resulting quo- 
tient by b, divide the resulting quotient by b, and so on, until obtaining a zero 
quotient. The remainders give the base b representation of n. The first remainder 
gives the 1’s coefficient, the next remainder gives the b’s coefficient, and so on. 


= When multiplying modulo z, compute the remainders as soon as possible to min- 
imize the sizes of the numbers involved. 


5.2 Review Exercises 


iL 


What is the value of the decimal number d,d,_ 1 ...djdo? 
(Each d; is one of 0-9.) 


. What is the value of the binary number b,b,—1 ...b1b9? (Each 


b; is 0 or 1.) 


. What is the value of the hexadecimal number hyhyp_| ... hy ho? 


(Each h; is one of 0-9 or A-F.) 


. How many bits are required to represent the positive integer n? 


. Explain how to convert from binary to decimal. 


. Explain how to convert from decimal to binary. 

. Explain how to convert from hexadecimal to decimal. 
. Explain how to convert from decimal to hexadecimal. 
. Explain how to add binary numbers. 

. Explain how to add hexadecimal numbers. 

. Explain how to compute a” using repeated squaring. 


. Explain how to compute a” mod z using repeated squaring. 


In Exercises 29-34, express each hexadecimal number in decimal. 


How many bits are needed to represent each integer in Exercises 


1-10? 
1. 60 2. 63 3. 64 
4. 127 5. 128 6. 21000 
ce 31000 8. g!000 9. 3481 
10. 6.87 x 1048 


In Exercises 11-16, express each binary number in decimal. 


11. 
13. 
15. 


1001 12. 11011 
11011011 14. 100000 
11111111 16. 110111011011 


In Exercises 17-22, express each decimal number in binary. 


17. 
20. 


19. 223 
22. 12,340 


34 18. 61 
400 21. 1024 


In Exercises 23-28, add the binary numbers. 


- 1001+ 1111 

- 110110 + 101101 

- 110110101 + 1101101 

- 1101 + 101100 + 11011011 


24. 11011 +1101 
26. 101101 + 11011 


3A 30. 1E9 31. 3E7C 

. A03 33. 209D 34. 4BO7A 

. Express each binary number in Exercises 11-16 in 
hexadecimal. 

. Express each decimal number in Exercises 17-22 in 
hexadecimal. 


. Express each hexadecimal number in Exercises 29, 30, and 32 


in binary. 


In Exercises 38-42, add the hexadecimal numbers. 


. 4A + B4 39. 195 + 76E 

. 49F7 + C66 41. 349CC + 922D 

. 82054 + AEFA3 

. Does 2010 represent a number in binary? in decimal? in 
hexadecimal? 

- Does 1101010 represent a number in binary? in decimal? in 
hexadecimal? 
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In the octal (base 8) number system, to represent integers we use 
the symbols 0, 1, 2, 3, 4, 5, 6, and 7. In representing an integer, 
reading from the right, the first symbol represents the number of 
1’s, the next symbol the number of 8’s, the next symbol the num- 
ber of 87's, and so on. In general, the symbol in position n (with 
the rightmost symbol being in position 0) represents the number of 
8”’s. In Exercises 45—50, express each octal number in decimal. 
45. 63 46. 7643 47. 7711 

48. 10732 49. 1007 50. 537261 

51. Express each binary number in Exercises | 1—16 in octal. 

52. Express each decimal number in Exercises 17—22 in octal. 
53. Express each hexadecimal number in Exercises 29-34 in octal. 
54. Express each octal number in Exercises 45—50 in hexadecimal. 
55. Does 1101010 represent a number in octal? 


56. Does 30470 represent a number in binary? in octal? in deci- 
mal? in hexadecimal? 


57. Does 9450 represent a number in binary? in octal? in decimal? 
in hexadecimal? 


58. Prove that a base b integer m has |1 + log, m|] digits. 


Chapter 5 @ Introduction to Number Theory 


In Exercises 59-61, trace Algorithm 5.2.16 for the given value 
of n. 


59. n= 16 60. n= 15 61. n= 80 


In Exercises 62-64, trace Algorithm 5.2.19 for the given values of 
a, n, and z. 

62. a=5,n=10,z=21 

63. a= 143,n=.10, 27 = 230 

64. a = 143,n = 100, z = 230 


65. Let T,, denote the highest power of 2 that divides n. Show that 
Tin = Tn + Ty for all m,n > 1. 


66. Let S,, denote the number of 1’s in the binary representation 
of n. Use induction to prove that T,, = n — S, for alln > 1. 
(T,, is defined in the previous exercise.) 


67. Modify the usual method of multiplying base 10 integers for 
base 2 to produce an algorithm to multiply binary numbers 
bmbm—1 +++ bi bo and bib «++ bi bo. 

68. Show that the time required by the algorithm of Exercise 67 
to multiply a and b is O(igalgb). 


5.3 The Euclidean Algorithm 


Go Online 
For more on the 
Euclidean algorithm, see 


goo.gl/WyqJp9 integers. 


In Section 5.1, we discussed some methods of computing the greatest common divi- 
sor of two integers that turned out to be inefficient. The Euclidean algorithm is an 
old, famous, and efficient algorithm for finding the greatest common divisor of two 


The Euclidean algorithm is based on the fact that if r = a mod b, then 


gcd(a, b) = gcd(b, r). (5.3.1) 


Before proving (5.3.1), we illustrate how the Euclidean algorithm uses it to find the 


greatest common divisor. 


Example 5.3.1 


Since 105 mod 30 = 15, by (5.3.1) 


gcd(105, 30) = gcd(30, 15). 


Since 30 mod 15 = 0, by (5.3.1) 


gcd(30, 15) = ged(15, 0). 


By inspection, gcd(15, 0) = 15. Therefore, 


gcd(105, 30) = gced(30, 15) = gcd(15, 0) = 15. < 


We next prove equation (5.3.1). 


Theorem 5.3.2 


If a is a nonnegative integer, b is a positive integer, and r = a mod J, then 


gcd(a, b) = gcd(b, r). 


Algorithm 5.3.3 
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Proof By the quotient-remainder theorem, there exist g and r satisfying 
C= bq +r 0 <r< b. 


We show that the set of common divisors of a and b is equal to the set of common divisors 
of b and r, thus proving the theorem. 

Let c be a common divisor of a and b. By Theorem 5.1.3(c), c | bg. Since c | a and 
c| bq, by Theorem 5.1.3(b), c|a — bg (= r). Thus c is a common divisor of b and r. 
Conversely, if c is a common divisor of b and r, then c | bg and c| bq + r (= a) and c is 
a common divisor of a and b. Thus the set of common divisors of a and b is equal to the 
set of common divisors of b and r. Therefore, gcd(a, b) = gcd(b, r). < 


We next formally state the Euclidean algorithm as Algorithm 5.3.3. 


Euclidean Algorithm 


This algorithm finds the greatest common divisor of the nonnegative integers a and 
b, where not both a and b are zero. 


Input: a and b (nonnegative integers, not both zero) 
Output: Greatest common divisor of a and b 


1. gcd(a, b) { 
2 // make a largest 
3 if (a < b) 
4 swap(a, b) 
5, while (b —= 0) { 
6. r=amodb 
W =D) 
8 p= 
9. } 

10. return a 

A eee 


We note that the while loop in the Euclidean algorithm (lines 5—9) always termi- 
nates since at the bottom of the loop (lines 7 and 8), the values of a and b are updated 
to smaller values. Since nonnegative integers cannot decrease indefinitely, eventually b 
becomes zero and the loop terminates. 

Let G = gcd(a, b), where a and b are the values input to Algorithm 5.3.3. We can 
prove that Algorithm 5.3.3 is correct by verifying that G = gcd(a, b) is a loop invariant, 
where now a and b denote the variables in the pseudocode. 

By definition, the loop invariant is true the first time we arrive at line 5. Sup- 
pose that G = gcd(a, b) is true prior to another iteration of the loop and that b # 0. 
Theorem 5.3.2 then tells us that after line 6 executes, gcd(a, b) = gcd(b, r). At lines 7 
and 8, a becomes b and b becomes r. Therefore, G = gcd(a, b) is true for the new values 
of a and b. It follows that G = gcd(a, b) is a loop invariant. The while loop terminates 
when b becomes 0. At this point, the loop invariant becomes G = gcd(a, 0). The algo- 
rithm then returns a [= gcd(a, 0)]. Thus the value that the algorithm returns is G, which 
by definition is the greatest common divisor of the input values. Therefore, Algorithm 
5.3.3 is correct. 

Algorithm 5.3.3 correctly finds the greatest common divisor if lines 3 and 4 are 
omitted (see Exercise 15). We include these lines because it simplifies the analysis of 
Algorithm 5.3.3 in the next subsection. 
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Example 5.3.4 Show how Algorithm 5.3.3 finds gcd(504, 396). 


SOLUTION Let a = 504 and b = 396. Since a> b, we move to line 5. Since b 4 0, 
we proceed to line 6, where we set r to 


a mod b = 504 mod 396 = 108. 


We then move to lines 7 and 8, where we set a to 396 and b to 108. We then return to 
line 5. 
Since b £ 0, we proceed to line 6, where we set r to 


a mod b = 396 mod 108 = 72. 


We then move to lines 7 and 8, where we set a to 108 and b to 72. We then return to 
line 5. 
Since b # 0, we proceed to line 6, where we set r to 


amod b = 108 mod 72 = 36. 


We then move to lines 7 and 8, where we set a to 72 and b to 36. We then return to line 5. 
Since b £ 0, we proceed to line 6, where we set r to 


amod b = 72 mod 36 = 0. 


We then move to lines 7 and 8, where we set a to 36 and b to 0. We then return to line 5. 
This time b = 0, so we skip to line 10, where we return a (36), the greatest common 
divisor of 396 and 504. < 


Analysis of the Euclidean Algorithm 


We analyze the worst-case performance of Algorithm 5.3.3. We define the time required 
to be the number of modulus operations executed at line 6. Table 5.3.1 lists the number 
of modulus operations required for some small input values. 


TABLE 5.3.1 m Number of Modulus Operations Required by the Euclidean 
Algorithm for Various Values of the Input 


b| 0 1 2 3 4 =55 6 #67 8 9 10 11 12 13 
a 
0 — 0 0 0 0 0 0 0 0 0 0 0 0 0 
1 0 1 1 1 1 1 1 1 1 1 1 1 1 1 
2 0 1 1 2 1 2 1 2 1 2 1 2 1 2 
3 0 1 2 1 2 3 1 2 3 1 2 3 1 2 
4 0 1 1 2 1 2 2 %3 1 2 Z 3 1 2 
5 0 1 2 3 2 1 2 3 4 3 1 2 | 4 
6 0 1 1 1 2 @ 1 2 ae 3 3 1 2 
7 0 1 2 2 3 ) 2 1 2 3 3 4 4 3 
8 0 1 1 3 1 4 2 2 1 2 2 4 2 5 
9 0 1 2 1 2 3 2. 3 2 1 2 3 2 3 
10 0 1 1 2 2 1 3 3 2 2 1 2 2 3 
11 0 1 2 3 3 2 3 4 4 3 2 1 2 3 
12 0 1 1 1 1 3 1 4 2 2 2 2 1 2 
13 0 1 2 2 2 4 2 3 5 3 3 3 2 1 


TABLE 5.3.2 @ Smallest 
Input Pair That Requires n 
Modulus Operations in the 
Euclidean Algorithm 


n 
a b (= number of 
modulus operations) 
1 O 0 
2 1 1 
3 2 2 
> 3 3 
8 5 4 
13. 8 5 


Theorem 5.3.5 
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The worst case for the Euclidean algorithm occurs when the number of modulus 
operations is as large as possible. By referring to Table 5.3.1, we can determine the 
input pair a,b, a > b, with a as small as possible, that requires n modulus operations 
forn = 0,...,5. The results are given in Table 5.3.2. 

Recall that the Fibonacci sequence {f,,} (see Section 4.4) is defined by the equations 


AHl fhp=l, fr=fr-itfr2 123. 


The Fibonacci sequence begins 1, 1,2, 3,5,8,.... A surprising pattern develops in 
Table 5.3.2: The a column is the Fibonacci sequence starting with f2 and, except for 
the first value, the b column is also the Fibonacci sequence starting with f2! We are led 
to conjecture that if the pair a,b, a > b, when input to the Euclidean algorithm requires 
n > 1 modulus operations, then a > f42 and b > f,41. As further evidence of our 
conjecture, if we compute the smallest input pair that requires six modulus operations, 
we obtain a = 21 and b = 13. Our next theorem confirms that our conjecture is correct. 
The proof of this theorem is illustrated in Figure 5.3.1. 


Suppose that the pair a,b, a > b, requires n > 1 modulus operations when input 
to the Euclidean algorithm. Then a > f,42 and b > fn+1, where {f,} denotes the 
Fibonacci sequence. 


Proof = The proof is by induction on n. 


Basis Step (n = 1) 


We have already observed that the theorem is true if n = 1. 


Inductive Step 
Assume that the theorem is true for n > 1. We must show that the theorem is true for 
n+1. 

Suppose that the pair a,b, a > b, requires n + 1 modulus operations when input 
to the Euclidean algorithm. At line 6, we compute r = a mod b. Thus 


a=bq+r O0<r<b. (5.3.2) 


The algorithm then repeats using the values b and r, b > r. These values require n 
additional modulus operations. By the inductive assumption, 


b> fr42 and r> fnti- (5.3.3) 
Combining (5.3.2) and (5.3.3), we obtain 
a= bq +re= b+r = fr42 t+ frtt = fr43- (5.3.4) 


[The first inequality in (5.3.4) holds because g > 0; q cannot equal 0, because a > b.] 
Inequalities (5.3.3) and (5.3.4) give 


a > fn+3 and b > frt2- 
The inductive step is finished and the proof is complete. < 


We may use Theorem 5.3.5 to analyze the worst-case performance of the Euclidean 
algorithm. 
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Theorem 5.3.6 


34 = 91 mod 57 (1 modulus operation) 

57, 34 requires 4 modulus operations (to make a total of 5) 

57 = fe and 34 = fs (by inductive assumption) 
91 =57-14+ 342574342 f6t+h=f7 


Figure 5.3.1 The proof of Theorem 5.3.5. The pair 91, 57, which requires 
n+ 1=5 modulus operations, is input to the Euclidean algorithm. 


If integers in the range 0 to m,m > 8, not both zero, are input to the Euclidean 
algorithm, then at most 


2m 
log3/2 ou 


modulus operations are required. 


Proof Let n be the maximum number of modulus operations required by the Eu- 
clidean algorithm for integers in the range 0 to m, m > 8. Let a, b be an input pair in the 
range O to m that requires n modulus operations. Table 5.3.1 shows that n > 4 and that 
a # b. We may assume that a > b. (Interchanging the values of a and b does not alter 
the number of modulus operations required.) By Theorem 5.3.5, a > fr42. Thus 


Sny2 Sm. 


By Exercise 27, Section 4.4, since n + 2 > 6, 


3 n+l 
(5) < fny2- 


Combining these last inequalities, we obtain 


3 n+l 
owe 


Taking the logarithm to the base 3/2, we obtain 
n+1 < log). m. 


Therefore, 


3 2m 
n< (log3/5 m)—-1= log3/9 m— log3/9 3 = log3/ = < 


Because the logarithm function grows so slowly, Theorem 5.3.6 tells us that the 
Euclidean algorithm is quite efficient, even for large values of the input. For example, 
since 

2(1,000,000 
egg I aks 
3 
the Euclidean algorithm requires at most 33 modulus operations to compute the greatest 
common divisor of any pair of integers, not both zero, in the range 0 to 1,000,000. 


Theorem 5.3.7 


Example 5.3.8 
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A Special Result 


The following special result will be used to compute inverses modulo an integer (see the 
following subsection). Such inverses are used in the RSA cryptosystem (see Section 5.4). 
However, this special result is also useful in other ways (see Exercises 27 and 29 and the 
following Problem-Solving Corner). 


If a and b are nonnegative integers, not both zero, there exist integers s and ¢ such that 
gcd(a, b) = sa+ tb. 
The method of the Euclidean algorithm can be used to prove Theorem 5.3.7 and to 


compute s and ¢. Before proving the theorem, we first illustrate the proof with a specific 
example. 


Consider how the Euclidean algorithm computes gcd(273, 110). We begin with a = 273 
and b = 110. The Euclidean algorithm first computes 


r = 273 mod 110 = S53. (5.3.5) 


It then sets a = 110 and b = 53. 
The Euclidean algorithm next computes 


r= 110 mod 53 = 4. (5.3.6) 


It then sets a = 53 and b = 4. 
The Euclidean algorithm next computes 


r=53mod4=1. (5.3.7) 


It then sets a = 4 andb=1. 
The Euclidean algorithm next computes 


r=4mod1=0. 


Since r = 0, the algorithm terminates, having found the greatest common divisor of 273 
and 110 to be 1. 

To find s and t, we work back, beginning with the last equation [equation (5.3.7)] 
in which r ¥ 0. Equation (5.3.7) may be rewritten as 


1=53—4-13 (5.3.8) 


since the quotient when 53 is divided by 4 is 13. 
Equation (5.3.6) may be rewritten as 


4=110— 53-2. 
We then substitute this formula for 4 into equation (5.3.8) to obtain 
1 =53 —4-13 = 53 — (110 — 53 -2)13 = 27-53 — 13-110. (5.3.9) 
Equation (5.3.5) may be rewritten as 


53 = 273 — 110-2. 
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We then substitute this formula for 53 into equation (5.3.9) to obtain 
1 = 27-53 — 13-110 = 27(273 — 110-2) — 13-110 = 27-273 — 67-110. 
Thus, if we take s = 27 and t = —67, we obtain 


gcd(273, 110) = 1 = 5-273 + t-110. < 


Proof of Theorem 5.3.7. Givena > b > 0, let m = a, rr; = b, and r; equal the 

value of r after the (i — 1)st time the while loop is executed in Algorithm 5.3.3 (e.g., 

r2 = a mod b). Suppose that r, is the first r-value that is zero so that gcd(a, b) = rp_1. 
In general, 


Yi = Vit Gi42 + Vi42- (5.3.10) 
Taking i = n — 3 in (5.3.10), we obtain 
Tn-3 = Tn-29n-1 + Vn-1, 


which may be rewritten as 


Th-1 = —n-1!n-2 +1 °Ty-3- 
We may take f,-3 = —gn—1 and s,_3 = | to obtain 
Tn-1 = tn—-3!n-2 + Sn—3ln-3- (5.3.11) 


Taking i = n — 4 in (5.3.10), we obtain 
Tr—4 = Tn—39n—-2 T+ Vn-2 
or 
Tn-2 = —Qn-2"n—-3 + n-4- (5.3.12) 
Substituting (5.3.12) into (5.3.11), we obtain 


4-1 tr—31—-n—2T n—-3 a Tn—4] + Sp—3ln-3 


= [-th-3n—2 + Sn—3]Pn—3 + tn—3T na. 
Setting t,-4 = —t—-39n—2 + Sn—3 and S,_4 = t,-3, We obtain 
Tn-1 = tn—aln—3 + Sn—4Tn—4- 
Continuing in this way, we ultimately obtain 
gced(ro, ri) = Mn—1 = tori + Soro = tob + soa. 
Taking s = so and t = fo, we have 


gcd(ro, r)) = sa + tb. < 


We next give an algorithm to compute s and ¢ satisfying gcd(a, b) = sa + tb, 
where a and b are nonnegative integers not both zero. To compute s and ¢, the proof 
of Theorem 5.3.7, which is illustrated in Example 5.3.8, first finds gcd(a, b) and then 
works backward from the last remainder obtained during the computation of gcd(a, b) 
to the first remainder obtained. Since recursion elegantly handles such a backward com- 
putation, we will write a recursive algorithm to compute s and f. We begin by writing 


Algorithm 5.3.9 
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a recursive version of the Euclidean algorithm (Algorithm 5.3.9). We can then modify 
this recursive algorithm to obtain a recursive algorithm to compute s and f. 


Recursive Euclidean Algorithm 


This algorithm recursively finds the greatest common divisor of the nonnegative in- 
tegers a and b, where not both a and b are zero. 


Input: a and b (nonnegative integers, not both zero) 
Output: Greatest common divisor of a and b 


gcdr(a, b) { 

// make a largest 

if (a < b) 
swap(a, b) 

its (Di ——10) 
return a 

r=amodb 

return gcdr(b, r) 


Algorithm 5.3.9 first makes a largest. If b is zero, it correctly returns a. Otherwise, 
Algorithm 5.3.9 computes r = a mod b and returns the greatest common divisor of b 
and r, which is correct since Theorem 5.3.2 tells us that gcd(b, r) = gced(a, b). 

To compute s and t, we modify Algorithm 5.3.9. We call the modification STgcdr. 
The idea is that whenever we compute the greatest common divisor, we also compute 
the values of s and t. These values are stored in added parameters named s and f. 

Consider first the case when b is zero. Then gcd(a, b) = a. Here we must set 
s = 1. Since b is zero, t could be assigned any value; we choose t = 0. The first part of 
the modification of Algorithm 5.3.9 looks like: 


STgcdr(a, b, s, t) { 

// make a largest 

if (a < Db) 
swap(a, b) 

if (b == 0) { 
s=1 
t=0 
//now a= sa+tb 
return a 


Next, Algorithm 5.3.9 computes r = a mod b and gcdr(b, r). Our modified algo- 
rithm will compute r = amodb and STgcdr(b, r,s’, t'). Thus s’ and f’ satisfy 
g=s'b+fr, where g = gcd(b,r). We must compute s and ¢ in terms of the avail- 
able values. If we let g be the quotient of a divided by b, we have a = bq +r. Therefore, 
using the fact that r = a — bq, we have 


g=sb+tr 
=sb+t'(a— bq) 
=ta+(s' —t'q)b. 


Thus if we set s = ¢’ and t = s’—t'g, we have g = sa+tb. The formal algorithm follows. 
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Algorithm 5.3.10 


Computing s and t of Theorem 5.3.7 


This algorithm computes s and ¢ satisfying gcd(a, b) = sa + tb, where a and b are 
nonnegative integers not both zero, and returns gcd(a, b). 


Input: a and b (nonnegative integers, not both zero) 


Output: s and t of Theorem 5.3.7 (stored in parameters s and t) and the 


greatest common divisor of a and b (which is returned) 


STgcdr(a, b, s, t) { 
// make a largest 
if (a < b) 
swap(a, b) 
if (b == 0) { 
Sl 
i— 0) 
// now a = sa+ tb 
return a 
} 
q = |a/b| 
r=amod b 
Ha=bq+r 
ign Siligcdr, (Duras eit) 
Ng=s'b+tr 
Mo.g=tat+(s' —tq)b 
csi 
t=s'—ft «xq 
return g 


Computing an Inverse Modulo an Integer 


Suppose that we have two integers n > 0 and @ > 1| such that gcd(n,@) = 1. We 
show how to efficiently compute an integer s,0 < s < @ such that ns mod ¢ = 1. We 
call s the inverse of n mod @. Efficiently computing this inverse is required by the RSA 
cryptosystem in Section 5.4. 

Since gcd(n, 6) = 1, we use the Euclidean algorithm, as explained previously, to 
find numbers s’ and fr’ such that s’n + t'@ = 1. Then ns’ = —1'¢ + 1, and, since ¢ > 1, 
1 is the remainder. Thus 

ns’ mod ¢ = 1. (5.3.13) 

Note that s’ is almost the desired value; the problem is that s’ may not satisfy 0 < s’ < ¢. 
However, we can convert s’ to the proper value by setting s = s’ mod ¢. Now 0 < 
s < @.In fact s £ 0 since, if s = 0, then @ | s’, which contradicts (5.3.13). Since 
s = s’ mod @, there exists g such that s’ = gd + s. Combining the previous equations, 
we have 


to+l1 


ns =ns' — ong = ong = ¢(-t — nq) +1. 


Therefore 


ns mod ¢ = 1. (5.3.14) 
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Example 5.3.11 Letn = 110 and ¢ = 273. In Example 5.3.8, we showed that gcd(n, @) = 1 and that 
—— ———_ ‘n+ t'@ = 1, where s’ = —67 and ¢’ = 27. Thus, 


110(—67) mod 273 = ns’ mod ¢ = 1. 


Here s = s’ mod ¢ = —67 mod 273 = 206. Therefore, the inverse of 110 modulo 273 
is 206. < 


We conclude by showing that the number s in equation (5.3.14) is unique. 
Suppose that 


ns mod ¢ = 1 = ns’ mod ¢, 0<s<4, O<s' <¢. 


We must show that s’ = s. Now 
s’ = (s’ mod ¢)(ns mod ¢) = s‘ns mod ¢ = (s'n mod ¢)(s mod ¢) = s. 


Therefore, the number s in equation (5.3.14) is unique. 


5.3 Problem-Solving Tips 


The Euclidean algorithm for computing the greatest common divisor of nonnegative 
integers a and b, not both zero, is based on the equation 


gcd(a, b) = gced(b, r), 


where r=a mod b. We replace the original problem, compute gcd(a, b), with the prob- 
lem, compute gcd(b, r). We then replace a by b and b by r, and repeat. Eventually r = 0, 
so the solution is gcd(b, 0) = b. 

The Euclidean algorithm is quite efficient. If integers in the range 0 to m, m > 8, 
not both zero, are input to the Euclidean algorithm, then at most 


2m 


log3/2 ca 


modulus operations are required. 
If a and b are nonnegative integers, not both zero, there exist integers s and f 
such that 


gcd(a, b) = sa + tb. 


To compute s and f, use the Euclidean algorithm. In a problem that involves the greatest 
common divisor, the preceding equation may be helpful. (Try Exercises 27 and 29.) 

Suppose that we have two integers n > 0 and @ > | such that gcd(n, @) = 1. To 
efficiently compute an integer s,0 < s < @ such that ns mod ¢ = 1, first compute s’ 
and ?’ satisfying 


gcd(n,d) =s' n+ 


(see the subsection Computing an Inverse Modulo an Integer). Then set s = s’ mod ¢. 


5.3 Review Exercises 


1. State the Euclidean algorithm. 3. Ifthe pair a, b, a > b, requires n > | modulus operations when 


:-Wihnt wey tice tthe Baxiutonthe Budidean alvonitiint input to the Euclidean algorithm, how are a and b related to the 
: : Fibonacci sequence? 
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. Integers in the range 0 to m, m > 8, not both zero, are input to 


the Euclidean algorithm. Give an upper bound for the number 
of modulus operations required. 


. Theorem 5.3.7 states that there exist integers s and t such that 


gcd(a, b) = sa + tb. Explain how the Euclidean algorithm can 
be used to compute s and f. 
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6. 
as 


Explain what it means for s to be the inverse of n modulo ¢. 


Suppose that gcd(n,@)=1. Explain how to compute the 
inverse of n modulo ¢. 


Use the Euclidean algorithm to find the greatest common divisor 
of each pair of integers in Exercises 1-12. 


1. 60, 90 2. 110,273 3. 220, 1400 
4. 315, 825 5. 20, 40 6. 331, 993 

7. 2091, 4807 8. 2475, 32670 9. 67942, 4209 
10. 490256, 337 11. 27, 27 


17. 


18. 


19. 


20. 


21. 
22. 


23. 


24. 


25. 
26. 


« 57853125, 555111200 


. For each number pair a, b in Exercises 1-12, find integers s 


and ¢t such that sa + th = gcd(a, b). 


. Find two integers a and b, each less than 100, that maximize 


the number of iterations of the while loop of Algorithm 5.3.3. 


. Show that Algorithm 5.3.3 correctly finds gcd(a, b) even if 


lines 3 and 4 are deleted. 


. Write a recursive version of the Euclidean algorithm that ex- 


ecutes the check for a < b and the call to the swap function 
one time. Hint: Use two functions. 


If a and b are positive integers, show that gcd(a, b) 
gcd(a,a+b). 


Show that if a > b > 0, then 
gcd(a, b) = gcd(a — b, b). 


Using Exercise 18, write an algorithm to compute the greatest 
common divisor of two nonnegative integers a and b, not both 
zero, that uses subtraction but not the modulus operation. 


How many subtractions are required by the algorithm of 
Exercise 19 in the worst case for numbers in the range 0 
tom? 


Extend Tables 5.3.1 and 5.3.2 to the range 0 to 21. 


Exactly how many modulus operations are required by the 
Euclidean algorithm in the worst case for numbers in the 
range 0 to 1,000,000? 

Prove that when the pair fn+2,fn41 is input to the Euclidean 
algorithm, n > 1, exactly n modulus operations are required. 
Show that for any integer k > 1, the number of modulus 
operations required by the Euclidean algorithm to compute 
gcd(a, b) is the same as the number of modulus operations 
required to compute gcd(ka, kb). 

Show that ged (fp, fn41) = 1, n> 1. 

Suppose that d > 0 is a common divisor of nonnegative inte- 
gers a and b, not both zero. Prove that d | gcd(a, b). 


«27, 


28. 


29. 


Show that if p is a prime number, a and b are positive integers, 
and p| ab, then p|a or p|b. 


Give an example of positive integers p, a, and b where p | ab, 
pia, and p jb. 
Let m and n be positive integers. Let f be the function from 


X= {0,1,...,m— 1} 
to X defined by 
f(x) = nx mod m. 


Prove that f is one-to-one and onto if and only if 
gcd(m, n) = 1. 


Exercises 30-34 show another way to prove that if a and b are non- 
negative integers, not both zero, there exist integers s and t such that 


gcd(a, b) = sa+ tb. 


However, unlike the Euclidean algorithm, this proof does not lead 
to a technique to compute s and t. 


30. 


31. 
32. 
33. 


34. 


Let 
X = {sa+ tb | sa+ tb > O ands and ¢ are integers}. 


Show that X is nonempty. 
Show that X has a least element. Let g denote the least element. 
Show that if c is a common divisor of a and b, then c divides g. 


Show that g is a common divisor of a and b. Hint: Assume 
that g does not divide a. Then a = qg+r, 0 <r < g. Obtain 
a contradiction by showing that r € X. 


Show that g is the greatest common divisor of a and b. 


In Exercises 35-41, show that gcd(n, ¢) = 1, and find the inverse 
s of n modulo @ satisfying 0 < s < ¢. 


35, 
37. 
39. 
41. 
42. 


43. 


n=2,¢=3 36. n=1,6 =47 
n=7,¢@=20 38. n=11,¢=47 
n= 50, ¢ = 231 40. n= 100, @ = 231 
n= 100, @ = 243 


Show that 6 has no inverse modulo 15. Does this contradict 
the result preceding Example 5.3.11? Explain. 


Show that n > 0 has an inverse modulo ¢ > 1 if and only if 
gcd(n, @) = 1. 
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Making Postage 


Problem 


Let p and q be positive integers satisfying 
gcd(p, q) = 1. Show that there exists n such that for 
all k > n, postage of k cents can be achieved by using 
only p-cent and q-cent stamps. 


Attacking the Problem 


Does this type of problem sound familiar? Example 
2.5.1 used mathematical induction to show that postage 
of four cents or more can be achieved by using only 
2-cent and 5-cent stamps. This result illustrates the 
problem for p = 2 and g = S. In this case, if we take 
n = 4, forall k > 4, postage of k cents can be achieved 
by using only 2-cent and 5-cent stamps. At this point, 
you should review the induction proof of this result. 

The induction proof of the p = 2, g = 5 problem 
can be summarized as follows. We first proved the base 
cases (k = 4, 5). In the inductive step, we assumed that 
we could make postage of k — 2 cents. We then added 
a 2-cent stamp to achieve k cents postage. We will imi- 
tate this inductive proof to prove that, if gcd(p, q) = 1 
for arbitrary p and gq, there exists n such that for all 
k > n, postage of k cents can be achieved by using 
only p-cent and q-cent stamps. 


Finding a Solution 


Let’s first take care of a trivial case. If either p or g is 1, 
we can make k cents postage for all k > 1 by using k 
1-cent stamps. Thus, we assume that p > | andgq > 1. 

Let’s first develop some notation. For a particular 
amount of postage, we’ll let n, denote the number of 
p-cent stamps used and n, denote the number of g-cent 
stamps used. Then the amount of postage is 


NpP + Ngq- 
Does this expression remind you of anything? Theo- 
rem 5.3.7 states that there exist integers s and f such that 


1 = gced(p, gq) = sp + tq. (1) 


This last equation suggests that we can make postage 
of 1 cent by using s p-cent stamps and ¢ g-cent stamps. 
The problem is that one of s or tf must be negative in 
order for the sum sp + tq to be 1 (since p and q are 
greater than 1). In fact, since p and g are both greater 
than 1, one of s or fis positive and the other is negative. 
We assume that s > 0 andt < 0. 

Let’s see how the inductive step should work 
and then see what basis steps we need. Imitating the 


specific case discussed previously, we would like to as- 
sume that we can make k — p cents postage and then 
add a p-cent stamp to make k cents postage. Nothing 
to it! In order for this inductive step to work, our basis 
steps must ben,n+1,...,n+p— 1 for some n that 
we get to choose. 
Suppose that we can make n-cents postage: 
N = Npp + Ngqq.- 


Because of equation (1), we can then make (n+ 1)- 
cents postage 
n+ 1= (pp +ngq) + (sp + tq) = (Mp + 5)p + (ng + 1g 
using m, + s p-cent stamps and n, + t q-cent stamps. 
Of course, this last statement is meaningful only if 
Ny +s = Oandn, +t = 0. However, ny +5 >= 0 
holds because n, = 0 and s > 0. We can arrange for 
ng +t = 0 to hold by choosing ng = —t. 
Similarly, we can make (n + 2)-cents postage 
n+2 = (Mp +ngq) +2(sp + tq) 
= (np +25)p + (nq + 21)4 
using 1, + 2s p-cent stamps and ng + 2t q-cent stamps. 
As before, np + 2s > 0 holds because n, > 0 and 
s > 0. We can arrange for ng + 2t = 0 to hold by 
choosing ng = —2t. Notice that for this choice of ny, 
Ng = —t also holds [so that we can still make (n + 1) 
cents postage, too]. 

In general, we can make (n + i)-cents postage 

n+i = (mp+ngq) + i(sp + tg) 

= (np +is)p + (ng + it)g 
using n, + is p-cent stamps and n, + it g-cent stamps. 
As before, n, +is => O holds because n, > Oands > 0. 
We can arrange for ng + it = 0 to hold by choosing 
Ng = —it. Notice that for this choice of ng, ng = —jt 
also holds forj = 0, ..., i—1 [so that we can stil] make 
(n +/) cents postage, too]. 

It follows that we can make postage for n, 
n+1,...,n+p—1 provided that we choose ng = 
— (p—1)t. Any value for n, = 0 will do, so we take 
Ny = 0. This makes n = ngg = —(p — 1)tq. 


Formal Solution 


If either p or g equals 1, we may take n = 1; so as- 
sume that p > | and g > 1. By Theorem 5.3.7, there ex- 
ist integers s and ¢ such that sp + tg = 1. Since p> 1 
and qg>1,s # Oandtr 4 O. Furthermore, either s or 
t is negative. We may suppose that t<0. Then s > 0. 
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Let n = —t(p — 1)q. We next show that we can make 
postage forn,n+ 1,...,n-+p— 1 using only p-cent 
and q-cent stamps. 

Now 


n+j = —t(p— 1)q+J(sp + tq) 
= (js)p + (-t(p — 1) + jt)q. 


If0 <j < p—1, then 


tp — 1) + jt >= tp — 1) + (p— 1) =0. 


Therefore we can make postage forn+j,0 <j < 
p-—1, by using js p-cent stamps and —f(p — 1) + jt q- 
cent stamps. 

Finally, we use induction to show that we can 
make postage of n cents or more using only p-cent and 
q-cent stamps. The basis steps aren, n+1,...,n+p— 
1. Suppose that k > n+p and that we can make postage 
for m satisfying n < m < k. In particular, we can make 
postage for k — p. Add a p-cent stamp to make postage 
for k. The inductive step is complete. 


Summary of Problem-Solving Techniques 


= Look for a similar problem. We already encoun- 
tered a postage problem in Example 2.5.1. 


5.4 The RSA Public-Key Cryptosystem 
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= Try to use some of the ideas in a similar problem. 
To solve our problem, we were able to modify the 
induction proof in Example 2.5.1. 


= Sometimes notation, setting, or context will 
suggest something useful. In our problem, the 
postage-amount equation npp + ngq was similar 
in form to the greatest common divisor formula 
gcd(p, q) = sp + tq. Combining these equations 
was crucial to proving the base cases. 

= Don’t be afraid to make assumptions. If an as- 
sumption turns out to be unwarranted, it can 
sometimes be modified so that the modified as- 
sumption is correct. In our problem, we assumed 
that if we could make n-cents postage using np 
p-cent stamps and n, g-cent stamps, we could 
then make (n + 1)-cents postage by using np + 5 
p-cent stamps and n, +t g-cent stamps. We were 
able to force this latter statement to be true by 
choosing ng = —t. 


Exercise 


1. Show that if gcd(p, g) > 1, it is false that there ex- 
ists n such that for all k > n, postage of k cents can 
be achieved by using only p-cent and g-cent stamps. 


Cryptology is the study of systems, called cryptosystems, for secure communications. 
In acryptosystem, the sender transforms the message before transmitting it, hoping that 
only authorized recipients can reconstruct the original message (i.e., the message before 
it was transformed). The sender is said to encrypt the message, and the recipient is said to 
decrypt the message. If the cryptosystem is secure, unauthorized persons will be unable 
to discover the decryption technique, so even if they read the encrypted message, they 
will be unable to decrypt it. Cryptosystems are important for large organizations (e.g., 
government and military), all internet-based businesses, and individuals. For example, 
if a credit card number is sent over the internet, it is important for the number to be read 
only by the intended recipient. In this section, we look at some algorithms that support 


secure communication. 


In one of the oldest and simplest systems, the sender and receiver each have a key 
that defines a substitute character for each potential character to be sent. Moreover, the 
sender and receiver do not disclose the key. Such keys are said to be private. 


Example 5.4.1 Ifa key is defined as 


character: 
replaced by: 


~ABCDEFGHI JKLMNOPQRSTUVWXYZ 
EI JFUAXVHWP_GSRKOBTQYDMLZNC 


the message SEND_MONEY would be encrypted as QARUESKRAN. The encrypted message 
SKRANEKRELIN would be decrypted as MONEY_ON_WAY. 


< 


Go Online 

For more on the 
RSA public-key 
cryptosystem, see 
goo.gl/WyqJp9 


Example 5.4.2 
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Simple systems such as that in Example 5.4.1 are easily broken since certain letters 
(e.g., Ein English) and letter combinations (e.g., ER in English) appear more frequently 
than others. Also, a problem with private keys in general is that the keys have to be 
securely sent to the sender and recipient before messages can be sent. We devote the re- 
mainder of this section to the RSA public-key cryptosystem, named after its inventors, 
Ronald L. Rivest, Adi Shamir, and Leonard M. Adleman, that is believed to be secure. In 
the RSA system, each participant makes public an encryption key and hides a decryption 
key. To send a message, all one needs to do is look up the recipient’s encryption key in 
a publicly distributed table. The recipient then decrypts the message using the hidden 
decryption key. The RSA system is used in all major secure internet transactions, for 
example, for secure financial transactions and secure e-mail exchanges. 

In the RSA system, messages are represented as numbers. For example, each char- 
acter might be represented as a number. If a blank space is represented as 1, A as 2, B as 
3, and so on, the message SEND_MONEY would be represented as 20, 6, 15, 5, 1, 14, 16, 
15, 6, 26. If desired, the integers could be combined into the single integer 


20061505011416150626 


(note that leading zeros have been added to all single-digit numbers). 

We next describe how the RSA system works, present a concrete example, and 
then discuss why it works. Each prospective recipient chooses two primes p and qg and 
computes z = pq. Next, the prospective recipient computes ¢ = (p — 1)(g — 1) 
and chooses an integer n such that gcd(n, @) = 1. The pair z,n is then made public. 
Finally, the prospective recipient computes the unique number s, 0 < s < @, satisfying 
ns mod @ = 1. (An efficient way to compute s is given in Section 5.3.) The number s is 
kept secret and used to decrypt messages. 

To send the integer a, 0 < a < z—1, to the holder of public key z, n, the sender 
computes c = a" mod z and sends c. (Algorithm 5.2.19 provides an efficient way to 
compute a” mod z.) To decrypt the message, the recipient computes c* mod z, which 
can be shown to be equal to a. 


Suppose that we choose p=23,qg=31, and n=29. Then z=pq=713 and ¢= 
(p — 1)(q — 1) = 660. Now s = 569 since ns mod ¢ = 29 -569 mod 660 = 16501 mod 
660 = 1. The pair z, n = 713, 29 is made publicly available. 

To transmit a = 572 to the holder of public key 713, 29, the sender computes 
c =a" mod z = 572” mod 713 = 113 and sends 113. The receiver computes c’ mod 
z= 113° mod 713 = 572 in order to decrypt the message. < 


The main result that makes encryption and decryption work is that 
a“ modz=a for allO <a < zandumod¢=1 


(for a proof, see [Rivest]). Using this result and Theorem 5.2.17, we may show that 
decryption produces the correct result. Since ns mod ¢ = 1, 


c’ mod z = (a” mod z)* mod z = (a”)* mod z = a” mod z= a. 


The security of the RSA encryption system relies mainly on the fact that currently 
there is no efficient algorithm known for factoring integers; that is, currently no algorithm 
is known for factoring d-bit integers in polynomial time, O(d*). Thus if the primes p and 
q are chosen large enough, it is impractical to compute the factorization z = pq. If the 
factorization could be found by a person who intercepts a message, the message could 
be decrypted just as the authorized recipient does. At this time, no efficient algorithm 
is known for factoring an arbitrary integer with 1024 or more bits. Thus, if p and p 
are chosen so that z = pq is at least 1024 bits long and other technical requirements are 


252 Chapter 5 @ Introduction to Number Theory 


implemented (e.g., p and g should not be chosen “too close” together), RSA would seem 
to be secure. (An integer having 1024 bits has over 300 decimal digits.) 

The first description of the RSA encryption system was in Martin Gardner’s 
February 1977 Scientific American column (see [Gardner, 1977]). Included in this col- 
umn were an encoded message using the key z, n, where z was the product of 64- and 
65-digit primes, and n = 9007, and an offer of $100 to the first person to crack the code. 
At the time the article was written, it was estimated that it would take 40 quadrillion 
years to factor z. In fact, in April 1994, Arjen Lenstra, Paul Leyland, Michael Graff, and 
Derek Atkins, with the assistance of 600 volunteers from 25 countries using over 1600 
computers, factored z (see [Taubes]). The work was coordinated on the internet. 

Another possible way a message could be intercepted and decrypted would be to 
compute the nth root of c mod z, that is, compute an integer a satisfying c = a" mod z. 
This computation would give a, the decrypted value. Again, currently there is no efficient 
algorithm known for computing nth roots mod z. It is also conceivable that a message 
could be decrypted by some means other than factoring integers or taking nth roots mod 
z. For example, Paul Kocher proposed a way to break RSA based on the time it takes 
to decrypt messages (see [English]). The idea is that distinct secret keys require distinct 
amounts of time to decrypt messages and, by using this timing information, an unau- 
thorized person might be able to unveil the secret key and thus decrypt the message. To 
thwart such attacks, implementors of RSA have taken steps to alter the observed time to 


decrypt messages. 


5.4 Review Exercises 


1. To what does “cryptology” refer? 
2. What is a cryptosystem? 
3. What does it mean to “encrypt a message”? 


4. What does it mean to “decrypt a message”? 


5. In the RSA public-key cryptosystem, how does one encrypt a 
and send it to the holder of public key z, n? 


6. In the RSA public-key cryptosystem, how does one decrypt c? 


7. On what does the security of the RSA encryption system rest? 


1. Encrypt the message COOL_BEAVIS using the key of Exam- 
ple 5.4.1. 


2. Decrypt the message UTWR_ENKDTEKMIGYWRA using the key 
of Example 5.4.1. 


3. Encrypt the message I_AM_NOT_A_CROOK using the key of 
Example 5.4.1. 


4. Decrypt the message JDQHLHIF_AU using the key of Exam- 
ple 5.4.1. 


. Encrypt 333 using the public key 713, 29 of Example 5.4.2. 
. Decrypt 411 using s = 569 as in Example 5.4.2. 
. Encrypt 241 using the public key 713, 29 of Example 5.4.2. 
. Decrypt 387 using s = 569 as in Example 5.4.2. 


ror~aIr_nn 


In Exercises 9-13, assume that we choose primes p = 17, q = 23, 
andn = 31. 


9. Compute z. 10. Compute ¢. 
12. Encrypt 101 using the public key z, n. 


13. Decrypt 250. 


11. Compute s. 


In Exercises 14-18, assume that we choose primes p = 59, 


q= 101, andn=4l. 
14. Compute z. 15. Compute ¢. 
17. Encrypt 584 using the public key z, n. 


18. Decrypt 250. 


16. Compute s. 


Chapter 5 Notes 


An accessible introduction to elementary number theory is [Niven, 1980]. An extended dis- 
cussion of the greatest common divisor, including historical background, and other elemen- 
tary number theory topics are in [Knuth, 1998a]. 

Full details of the RSA cryptosystem may be found in [Rivest]. [Pfleeger] is devoted 


to computer security. 
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Chapter 5 Review 


Section 5.3 


Section 5.1 


ean 


. ddividesn:d|n 
. d does not divide n: d {n 


d is a divisor or factor of n 
Prime 


. Composite 
. Fundamental theorem of arithmetic: any integer greater 


than | can be written as the product of primes 
Common divisor 


. Greatest common divisor 
. Common multiple 
10. 


Least common multiple 


Section 5.2 


11. 
12. 
13. 
14. 


15. 
16. 
17. 
18. 
19. 
20. 
21. 
22. 
23. 
24. 
25. 


Bit 

Decimal number system 

Binary number system 

Computer representation of integers: when represented in 
binary, the positive integer n requires [1 + lgnJ bits 
Hexadecimal number system 

Base of a number system 

Convert binary to decimal 

Convert decimal to binary 

Convert hexadecimal to decimal 

Convert decimal to hexadecimal 

Add binary numbers 

Add hexadecimal numbers 

Compute a” using repeated squaring 

ab mod z = [(a mod z)(b mod z)] mod z 

Compute a” mod z using repeated squaring 


Chapter 5 Self-Test 


kw bn = 


rrntnn 


. Trace Algorithm 5.1.8 for the input n = 539. 

. Write the binary number 10010110 in decimal. 

. Find the prime factorization of 539. 

. Use the Euclidean algorithm to find the greatest common 


divisor of the integers 396 and 480. 


. Find ged(2 +5? +7? «134, 74-13? +17). 
. Find lem(2 +57 +7? - 134, 74-13? +17). 
. Write the decimal number 430 in binary and hexadecimal. 


. Given that log3/2 100 = 11.357747, provide an upper bound 


for the number of modulus operations required by the Eu- 
clidean algorithm for integers in the range 0 to 100,000,000. 


. Trace Algorithm 5.2.16 for the value n = 30. 


26. 
27. 


28. 


29. 


30. 


31. 


Euclidean algorithm 

If the pair a,b, a > b, requires n > 1 modulus oper- 
ations when input to the Euclidean algorithm, then a > 
Snt2 and b > fnii, where {f,} denotes the Fibonacci 
sequence. 

If integers in the range 0 to m,m > 8, not both zero, are 
input to the Euclidean algorithm, then at most 


2m 
logs). 3 


modulus operations are required. 

Ifa and b are nonnegative integers, not both zero, there exist 
integers s and ¢ such that gcd(a, b) = sa + tb. 

Compute s and ¢ such that gcd(a, b) = sa + tb using the 
Euclidean algorithm 

Compute an inverse modulo an integer 


Section 5.4 


32. 
33. 
34. 
35. 
36. 


37. 


12. 


. Trace Algorithm 5.2.19 for the values a = 


Cryptology 

Cryptosystem 

Encrypt a message 

Decrypt a message 

RSA public key cryptosystem: To encrypt a and send it to 
the holder of public key z,n, compute c = a" mod z and 
send c. To decrypt the message, compute c* mod z, which 
can be shown to be equal to a. 

The security of the RSA encryption system relies mainly on 
the fact that currently there is no efficient algorithm known 
for factoring integers. 


z=11, 


. Use the Euclidean algorithm to find integers s and t satisfy- 


ing s +396 + t-480 = gcd(396, 480). 


Show that gcd(196, 425) = | and find the inverse s of 196 
modulo 425 satisfying 0 < s < 425. 


In Exercises 13-16, assume that we choose primes p = 13,q = 


17, 


13. 
14. 
15. 
16. 


and n = 19. 


Compute z and @. 

Compute s. 

Encrypt 144 using public key z, n. 
Decrypt 28. 
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Chapter 5 Computer Exercises 


an bh WwW 


. Implement Algorithm 5.1.8, testing whether a positive 


integer is prime, as a program. 


. Write a program that converts among decimal, hexadeci- 


mal, and octal. 


. Write a program that adds binary numbers. 

. Write a program that adds hexadecimal numbers. 

. Write a program that adds octal numbers. 

. Implement Algorithm 5.2.16, exponentiation by repeated 


squaring, as a program. 


. Implement Algorithm 5.2.19, exponentiation mod z, as a 


program. 


8. 


10. 


11. 


Write recursive and nonrecursive programs to compute the 
greatest common divisor. Compare the times required by 
the programs. 


. Implement Algorithm 5.3.10, computing s and f satisfying 


gcd(a, b) = sa + th, as a program. 


Write a program that, given integers n > 0 and @¢ > 1, 
gcd(n, #) = 1, computes the inverse of n mod @. 


Implement the RSA public-key cryptosystem. 


Chapter 6 


> COUNTING METHODS 
<0) AND THE PIGEONHOLE 
PRINCIPLE 


6.1 Basic Principles In many discrete problems, we are confronted with the problem of counting. For exam- 
6.2 Permutations and ple, in Section 4.3 we saw that in order to estimate the run time of an algorithm, we 
Combinations needed to count the number of times certain steps or loops were executed. Counting also 
6.3 Generalized plays a crucial role in probability theory. Because of the importance of counting, a va- 
Permutations and riety of useful aids, some quite sophisticated, have been developed. In this chapter we 
Combinations develop several tools for counting. These techniques can be used to derive the binomial 
6.4 Algorithms for theorem. The chapter concludes with a discussion of the Pigeonhole Principle, which 
Generating often allows us to prove the existence of an object with certain properties. 
Permutations 
and Combinations 
16.5 Introduction to 
Discrete Probability 
16.6 Discrete Probability 
Theory 
6.7 Binomial Coefficients 
and Combinatorial 
Identities 
6.8 The Pigeonhole 
Principle . : : 
6.1 Basic Principles 
Go Online The menu for Kay’s Quick Lunch is shown in Figure 6.1.1. As you can see, it features 
For more on basic two appetizers, three main courses, and four beverages. How many different dinners 
principles, see consist of one main course and one beverage? 
goo.g1/WROK11 If we list all possible dinners consisting of one main course and one beverage, 


HT, HM, HC, HR, CT, CM, CC, CR, FT, FM, FC, FR, 


we see that there are 12 different dinners. (The dinner consisting of a main course whose 
first letter is X and a beverage whose first letter is Y is denoted XY. For example, CR 
refers to the dinner consisting of a cheeseburger and root beer.) Notice that there are 
three main courses and four beverages and 12 = 3 -4. 

There are 24 possible dinners consisting of one appetizer, one main course, and 
one beverage: 


| These sections can be omitted without loss of continuity. 


255 


256 Chapter 6 ® Counting Methods and the Pigeonhole Principle 


APPETIZERS 
Nachos 


Salad... . 


MAIN COURSES 


Hamburger 


Cheeseburger... 
Fish Filet 


Figure 6.1.1 Kay’s Quick Lunch 
menu. 


NHT, NHM, NHC, NHR, NCT, NCM, NCC, NCR, 
NFT, NFM, NFC, NFR, SHT, SHM, SHC, SHR, 
SCT, SCM, SCC, SCR, SFT, SFM, SFC, SFR. 


(The dinner consisting of an appetizer whose first letter is X, a main course whose first 
letter is Y, and a beverage whose first letter is Z is denoted XYZ.) Notice that there are 
two appetizers, three main courses, and four beverages and 24 = 2-3-4. 

In each of these examples, we found that the total number of dinners was equal to 
the product of numbers of each of the courses. These examples illustrate the Multipli- 
cation Principle. 


Multiplication If an activity can be constructed in t successive steps and step | can be done in ny; 
Principle ways, step 2 can then be done in m2 ways, ..., and step f can then be done in n,; ways, 
then the number of different possible activities is ny +z ---n;. 


In the problem of counting the number of dinners consisting of one main course 
and one beverage, the first step is “select the main course” and the second step is “select 
the beverage.” Thus n} = 3 and nm) = 4 and, by the Multiplication Principle, the total 
number of dinners is 3-4 = 12. Figure 6.1.2 shows why we multiply 3 times 4—we 
have three groups of four objects. 

We may summarize the Multiplication Principle by saying that we multiply to- 
gether the numbers of ways of doing each step when an activity is constructed in 
successive steps. 


Example 6.1.1 How many dinners are available from Kay’s Quick Lunch consisting of one main course 
and an optional beverage? 


Example 6.1.2 


Example 6.1.3 
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Hamburger Cheeseburger Fish Filet 
Tea | | Milk| |Cola| |R°°) | Tea | |Milk| |Cola| /R°°") | Tea | |Mitk| | Cola] |ROot 
Beer Beer Beer 


HT HM HC HR CT CM CC CR FT FM FC FR 


Figure 6.1.2 An illustration of the Multiplication Principle. 


SOLUTION We may construct a dinner consisting of one main course and an optional 
beverage by a two-step process. The first step is “select the main course” and the second 
step is “select an optional beverage.” There are nj =3 ways to select the main course 
(hamburger, cheeseburger, fish filet) and ny = 5 ways to select the optional beverage 
(tea, milk, cola, root beer, none). By the Multiplication Principle, there are 3-5 = 15 
dinners. As confirmation, we list the 15 dinners (N = no beverage): 


HT, HM, HC, HR, HN, CT, CM, CC, CR, CN, FT, FM, FC, FR, FN. < 


Melissa Virus In the late 1990s, a computer virus named Melissa wreaked havoc by 
overwhelming system resources. The virus was spread by an e-mail message containing 
an attached word processor document with a malicious macro. When the word processor 
document was opened, the macro forwarded the e-mail message and the attached word 
processor document to the first 50 addresses obtained from the user’s address book. 
When these forwarded copies were received and opened, the macro again forwarded the 
e-mail message and the attached word processor document, and so on. The virus caused 
problems by creating messages faster than they could be sent. The to-be-sent messages 
were temporarily stored on a disk. If the disk got full, the system could deadlock or 
even crash. 

After the virus sent the e-mail to the first 50 addresses, each of those recipients then 
sent e-mail to 50 addresses. By the Multiplication Principle, there were then 50-50 = 
2500 additional recipients. Each of these recipients, in turn, sent e-mail to SO addresses. 
Again, by the Multiplication Principle, there were then 50 - 50 -50 = 125,000 additional 
recipients. After one more iteration, there were then 50-50-50 -50 = 6,250,000 addi- 
tional recipients. Thus after just four iterations, 


6,250,000 + 125,000 + 2500 + 50 + 1 = 6,377,551 


copies of the message had been sent. 
Notice that we could have used the geometric sum (Example 2.4.4) 


1-(505 — 1 
1-50°+1-50°+1-507+1-50+1= ee = 6,377,551 


to calculate how many copies of the message were sent. < 


(a) How many strings of length 4 can be formed using the letters ABCDE if repeti- 
tions are not allowed? 


(b) How many strings of part (a) begin with the letter B? 
(c) How many strings of part (a) do not begin with the letter B? 
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Example 6.1.4 


Example 6.1.5 


Example 6.1.6 


SOLUTION 

(a) We use the Multiplication Principle. A string of length 4 can be constructed in 
four successive steps: Choose the first letter; choose the second letter; choose 
the third letter; and choose the fourth letter. The first letter can be selected in 
five ways. Once the first letter has been selected, the second letter can be se- 
lected in four ways. Once the second letter has been selected, the third letter 
can be selected in three ways. Once the third letter has been selected, the fourth 
letter can be selected in two ways. By the Multiplication Principle, there are 
5-4-3-+2 = 120 strings. 

(b) The strings that begin with the letter B can be constructed in four successive 
steps: Choose the first letter; choose the second letter; choose the third letter; 
and choose the fourth letter. The first letter (B) can be chosen in one way, the 
second letter in four ways, the third letter in three ways, and the fourth letter in 
two ways. Thus, by the Multiplication Principle, there are 1-4-3 -2 = 24 strings 
that start with the letter B. 


(c) Part (a) shows that there are 120 strings of length 4 that can be formed using 
the letters ABCDE, and part (b) shows that 24 of these start with the letter B. 
It follows that there are 120 — 24 = 96 strings that do not begin with the 
letter B. < 


In a digital picture, we wish to encode the amount of light at each point as an eight-bit 
string. How many values are possible at one point? 


SOLUTION An eight-bit encoding can be constructed in eight successive steps: Se- 
lect the first bit; select the second bit; ...; select the eighth bit. Since there are two 
ways to select each bit, by the Multiplication Principle the total number of eight-bit 
encodings is 


2+2+2+2+2-2-2-2 = 28 = 256. 4 


We next give a proof using the Multiplication Principle that a set with n elements 
has 2” subsets. We previously gave a proof of this result using mathematical induction 
(Theorem 2.4.6). 


Use the Multiplication Principle to prove that a set {x),...,x,} containing n elements 
has 2” subsets; that is, the cardinality of the power set of an n-element set is 2”. 


SOLUTION A subset can be constructed in n successive steps: Pick or do not pick x1; 
pick or do not pick x2; ...; pick or do not pick x,. Each step can be done in two ways. 
Thus the number of possible subsets is 


2+2---2=2", 
a < 
n factors 


Let X be an n-element set. How many ordered pairs (A, B) satisfy A C B C X? 


SOLUTION Given an ordered pair (A, B) satisfying A C B C X, we see that each 
element in X is in exactly one of A, B — A, or X — B. Conversely, if we assign each 
element of X to one of the three sets A (and, by assumption, also to B and X), B—A (and, 
by assumption, also to X), or X — B, we obtain a unique ordered pair (A, B) satisfying 
A C BC X. Thus the number of ordered pairs (A, B) satisfying A C B C X is equal to 


Example 6.1.7 


Example 6.1.8 


Example 6.1.9 
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the number of ways to assign the elements of X to the three sets A, B— A, and X — B. We 
can make such assignments by the following n-step process: Assign the first element of 
X to one of A, B—A, X —B; assign the second element of X to one of A, B—A, X—B;...; 
assign the nth element of X to one of A, B — A, X — B. Since each step can be done in 
three ways, the number of ordered pairs (A, B) satisfying A C B C X is 


3303 = 3", 
ae 


n factors < 


How many reflexive relations are there on an n-element set? 


SOLUTION We count the number of n x n matrices that represent reflexive relations 
on an n-element set X. Since (x, x) is in the relation for all x € X, the main diagonal of 
the matrix must consist of 1’s. There is no restriction on the remaining entries; each can 
be 0 or 1. Ann x n matrix has n? entries and the diagonal contains n entries. Thus there 
are n? — n off-diagonal entries. Since each can be assigned values in two ways, by the 
Multiplication Principle there are 


De Dane? — 2-1 


n° — n factors 


4 F ¥ 2 
matrices that represent reflexive relations on an n-element set. Therefore there are 2” ~” 
reflexive relations on an n-element set. < 


Internet Addresses The internet is a network of interconnected computers. Each com- 
puter interface on the internet is identified by an internet address. In the IPv4 (Internet 
Protocol, Version 4) addressing scheme, the addresses are divided into five 
classes—Class A through Class E. Only Classes A, B, and C are used to identify com- 
puters on the internet. Class A addresses are used for large networks; Class B addresses 
are used for medium-size networks; and Class C addresses are used for small networks. 
In this example, we count the number of Class A addresses. Exercises 80-82 deal with 
Class B and C addresses. 

A Class A address is a bit string of length 32. The first bit is 0 (to identify it as a 
Class A address). The next 7 bits, called the netid, identify the network. The remaining 
24 bits, called the hostid, identify the computer interface. The netid must not consist of 
all 1’s. The hostid must not consist of all 0’s or all 1’s. Arguing as in Example 6.1.4, 
we find that there are 2’ 7-bit strings. Since 1111111 is not allowed as a netid, there 
are 27 — | netids. Again, arguing as in Example 6.1.4, we find that there are 27+ 24-bit 
strings. Since the two strings consisting of all 0’s or all 1’s are not allowed as a hostid, 
there are 274 — 2 hostids. By the Multiplication Principle, there are 


Qr= 1)(274 — 2) = 127- 16,777,214 = 2,130,706,178 
Class A internet addresses. Because of the tremendous increase in the size of the in- 


ternet, IPv6 (Internet Protocol, Version 6) uses 128-bit addresses rather than 32-bit ad- 
dresses. < 


Next, we illustrate the Addition Principle by an example and then present the 
principle. 


How many eight-bit strings begin either 101 or 111? 
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Addition Principle 


Example 6.1.10 


Example 6.1.11 


SOLUTION An eight-bit string that begins 101 can be constructed in five successive 
steps: Select the fourth bit; select the fifth bit; ...; select the eighth bit. Since each of 
the five bits can be selected in two ways, by the Multiplication Principle, there are 


2+2-2-2-2=2) = 32 


eight-bit strings that begin 101. The same argument can be used to show that there are 
32 eight-bit strings that begin 111. Since there are 32 eight-bit strings that begin 101 and 
32 eight-bit strings that begin 111, there are 32 + 32 = 64 eight-bit strings that begin 
either 101 or 111. < 


In Example 6.1.9 we added the numbers of eight-bit strings (32 and 32) of each 
type to determine the final result. The Addition Principle tells us when to add to com- 
pute the total number of possibilities. 


Suppose that X1, ... , X; are sets and that the ith set X; has n; elements. If {X1, ..., X;} 
is a pairwise disjoint family (i.e., if i A j, X; 1 X; = @), the number of possible 
elements that can be selected from X; or X2 or... or X; is 


il SPUR sp 20° aR bis 


(Equivalently, the union X; U X2 U--- UX; contains n; + nz + --- +7, elements.) 


In Example 6.1.9 we could let X; denote the set of eight-bit strings that begin 101 
and X> denote the set of eight-bit strings that begin 111. Since X, is disjoint from X2, 
according to the Addition Principle, the number of eight-bit strings of either type, which 
is the number of elements in X; U X2, is 32 + 32 = 64. 

We may summarize the Addition Principle by saying that we add the numbers 
of elements in each subset when the elements being counted can be decomposed into 
pairwise disjoint subsets. 

If we are counting objects that are constructed in successive steps, we use the 
Multiplication Principle. If we have disjoint sets of objects and we want to know the 
total number of objects, we use the Addition Principle. It is important to recognize when 
to apply each principle. This skill comes from practice and careful thinking about each 
problem. 

We close this section with examples that illustrate both counting principles. 


In how many ways can we select two books from different subjects among five dis- 
tinct computer science books, three distinct mathematics books, and two distinct art 
books? 


SOLUTION Using the Multiplication Principle, we find that we can select two books, 
one from computer science and one from mathematics, in 5-3 = 15 ways. Similarly, we 
can select two books, one from computer science and one from art, in 5-2 = 10 ways, 
and we can select two books, one from mathematics and one from art, in 3 -2 = 6 ways. 
Since these sets of selections are pairwise disjoint, we may use the Addition Principle 
to conclude that there are 15 + 10 + 6 = 31 ways of selecting two books from different 
subjects among the computer science, mathematics, and art books. < 


A six-person committee composed of Alice, Ben, Connie, Dolph, Egbert, and Francisco 
is to select a chairperson, secretary, and treasurer. 
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(a) In how many ways can this be done? 
(b) In how many ways can this be done if either Alice or Ben must be chairperson? 
(c) In how many ways can this be done if Egbert must hold one of the offices? 


(d) In how many ways can this be done if both Dolph and Francisco must hold office? 


SOLUTION 

(a) We use the Multiplication Principle. The officers can be selected in three succes- 
sive steps: Select the chairperson; select the secretary; select the treasurer. The 
chairperson can be selected in six ways. Once the chairperson has been selected, 
the secretary can be selected in five ways. After selection of the chairperson and 
secretary, the treasurer can be selected in four ways. Therefore, the total number 
of possibilities is 6-5-4 = 120. 

(b) Arguing as in part (a), if Alice is chairperson, we have 5-4 = 20 ways to se- 
lect the remaining officers. Similarly, if Ben is chairperson, there are 20 ways to 
select the remaining officers. Since these cases are disjoint, by the Addition 
Principle, there are 20 + 20 = 40 possibilities. 


(c) [First solution] Arguing as in part (a), if Egbert is chairperson, we have 20 ways 
to select the remaining officers. Similarly, if Egbert is secretary, there are 20 pos- 
sibilities, and if Egbert is treasurer, there are 20 possibilities. Since these three 
cases are pairwise disjoint, by the Addition Principle, there are 20+-20+20 = 60 
possibilities. 

{Second solution] Let us consider the activity of assigning Egbert and two 
others to offices to be made up of three successive steps: Assign Egbert an office; 
fill the highest remaining office; fill the last office. There are three ways to assign 
Egbert an office. Once Egbert has been assigned, there are five ways to fill the 
highest remaining office. Once Egbert has been assigned and the highest remain- 
ing office filled, there are four ways to fill the last office. By the Multiplication 
Principle, there are 3 -5 -4 = 60 possibilities. 


(d) Let us consider the activity of assigning Dolph, Francisco, and one other per- 
son to offices to be made up of three successive steps: Assign Dolph; assign 
Francisco; fill the remaining office. There are three ways to assign Dolph. Once 
Dolph has been assigned, there are two ways to assign Francisco. Once Dolph 
and Francisco have been assigned, there are four ways to fill the remaining office. 
By the Multiplication Principle, there are 3-2-4 = 24 possibilities. < 


Example 6.1.12 We revisit Example 6.1.3(c), which asked how many strings of length 4 that do not begin 
With the letter B can be formed using the letters ABCDE (repetitions not allowed)? 


SOLUTION We could have solved this problem using a direct approach by counting 
the number of strings that begin with A, with C, with D, and with E, and then, using the 
Addition Principle, summed these values. Since 24 strings begin with A [see the solution 
to Example 6.1.3(b) with B replaced by A], and 24 strings begin with C, and so on, we 
obtain 


24424424424 = 96. 


However, the solution given in Example 6.1.3(c), which subtracted the number of strings 
that do start with B from the total number of strings, was easier. It would have been much 
easier if more letters had been involved or if the numbers in the sum were not identical 
so that each involved a distinct calculation. 

A variation of the direct approach might also have been used: Choose the first letter 
(4 ways—choose A, C, D, or E); choose the second letter (4 ways—any of the remaining 


262 Chapter 6 ® Counting Methods and the Pigeonhole Principle 


XY 


Figure 6.1.3 |X| counts the 
number of elements in X — Y and 
XY, and |Y| counts the number 
of elements in Y — X and XN Y. 
Since |X| + |Y| double-counts the 
elements in XN Y, |X| +|Y| = 
|IXUY|+|XNY|. 


Theorem 6.1.13 


letters); choose the third letter (3 ways—any of the remaining letters); and choose the 
fourth letter (2 ways—either of the remaining letters). By the Multiplication Principle, 
there are 


AAS 9.9 = 06 


strings that do not begin with the letter B. < 


Inclusion-Exclusion Principle 


Suppose that we want to count the number of eight-bit strings that start 10 or end 011 or 
both. Let X denote the set of eight-bit strings that start 10 and Y denote the set of eight-bit 
strings that end 011. The goal then is to compute |X U Y|. We cannot use the Addition 
Principle and add |X| and |Y| to compute |X U Y| because the Addition Principle requires 
X and ¥ to be disjoint. Here X and Y are not disjoint; for example, 10111011 € XNY. The 
Inclusion-Exclusion Principle generalizes the Addition Principle by giving a formula 
to compute the number of elements in a union without requiring the sets to be pairwise 
disjoint. 

Continuing the discussion in the previous paragraph, suppose that we compute 
|X| + |Y|. We have counted the elements in X — Y (eight-bit strings that start 10 but do 
not end 011) once and the elements in Y — X (eight-bit strings that end 011 but do not 
start 10) once, but we have counted the elements in X /O Y (eight-bit strings that start 
10 and end 011) twice (see Figure 6.1.3). Thus if we subtract |X  Y| from |X| + |Y|, 
to compensate for the double-counting, we will obtain the number of strings in X U Y; 
that is, 


IXUY| = [X|+|¥|-|XN YI. 


Arguing as in Example 6.1.7, we find that |X| = 2°, |Y| = 2°, and |XN Y| = 23. 
Therefore, the number of eight-bit strings that start 10 or end 011 or both is equal to 


IXUY| = [|X] + |¥]-—[Xn Y| = 2°42 -23. 


We state the Inclusion-Exclusion Principle for two sets as Theorem 6.1.13. 


Inclusion-Exclusion Principle for Two Sets 
If X and Y are finite sets, then 


IXUY| = |X|+|¥|-[Xn YI. 


Proof Since X = (X—Y)U(XNY) and X—Y and XNY are disjoint, by the Addition 
Principle 


IX] = |X-Y¥|+ |X Y|. (6.1.1) 
Similarly, 
IY] =|Y-X|+|xnY|. (6.1.2) 


Since X UY = (X — Y)U(XN Y) U(¥Y —X) and X — Y, XN Y, and Y — X are pairwise 
disjoint, by the Addition Principle 


[XU Y) =X —¥| + [XN Y| +|¥—Xi. (6.1.3) 


Example 6.1.14 
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Combining equations (6.1.1)—(6.1.3), we obtain 


IX| + (¥| = |X —YI4IXNY|+1¥ —X|/4+ [XN Y| = |XUY| 4+ [XN YI. 


Subtracting |X M Y| from both sides of the preceding equation gives the desired result. ¢ 


A committee composed of Alice, Ben, Connie, Dolph, Egbert, and Francisco is to select 
a chairperson, secretary, and treasurer. How many selections are there in which either 
Alice or Dolph or both are officers? 


SOLUTION Let X denote the set of selections in which Alice is an officer and let Y 
denote the set of selections in which Dolph is an officer. We must compute |X U Y]. 
Since X and Y are not disjoint (both Alice and Dolph could be officers), we cannot use 
the Addition Principle. Instead we use the Inclusion-Exclusion Principle. 

We first count the number of selections in which Alice is an officer. Alice can be 
assigned an office in three ways, the highest remaining office can be filled in five ways, 
and the last office can be filled in four ways. Thus the number of selections in which 
Alice is an officer is 3-5-4 = 60, that is, |X| = 60. Similarly, the number of selections 
in which Dolph is an officer is 60, that is, |Y| = 60. 

Now XY is the set of selections in which both Alice and Dolph are officers. Alice 
can be assigned an office in three ways, Dolph can be assigned an office in two ways, 
and the last office can be filled in four ways. Thus the number of selections in which 
both Alice and Dolph are officers is 3-2-4 = 24, that is, |X N Y| = 24. 

The Inclusion-Exclusion Principle tells us that 


IXUY| = |X| +|¥|— [XM ¥| = 60 + 60 — 24 = 96. 


Thus there are 96 selections in which either Alice or Dolph or both are officers. < 


The name “inclusion-exclusion” in Theorem 6.1.13 results from including |X NY| 
twice when computing |X U Y| as |X| + |Y| and then excluding it by subtracting |X N Y| 
from |X| + |Y]. 

We leave the Inclusion-Exclusion Principle for three or more sets to the exercises 
(see Exercises 98-105). 


6.1 Problem-Solving Tips 


The key to solving problems in this section is determining when to use the Multiplication 
Principle and when to use the Addition Principle. Use the Multiplication Principle when 
using a step-by-step process to construct an activity. For example, to construct a dinner 
from Kay’s Quick Lunch menu (Figure 6.1.1) consisting of one appetizer, one main 
course, and one beverage, we use a three-step process: 


1. Choose one appetizer. 
2. Choose one main course. 


3. Choose one beverage. 


The number of different possible activities is the product of the number of ways each 
step can be done. Here we can select one appetizer in 2 ways, one main course in 3 ways, 
and one beverage in 4 ways. Thus, the number of dinners is 2-3-4 = 24. 

Use the Addition Principle when you want to count the number of elements in a 
set and you can divide the set into nonoverlapping subsets. Suppose, for example, that 
we want to count the total number of items available at Kay’s Quick Lunch. Since there 
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are 2 appetizers, 3 main course items, and 4 beverages, and no item belongs to two 
categories, the total number of items available is 


24344=9. 


Notice the difference between the two examples. To construct a dinner consisting 
of one appetizer, one main course, and one beverage at Kay’s Quick Lunch, we use a 
step-by-step process. The size of the set of dinners is not counted by dividing the set 
of dinners into nonoverlapping subsets. To count the number of dinners, we use the 
Multiplication Principle. To count the number of items available at Kay’s Quick Lunch, 
we just sum the number of items in each category since dividing the items by category 
naturally splits them into nonoverlapping subsets. We are not counting the individual 
items available by constructing them using a step-by-step process. To count the total 


number of items available, we use the Addition Principle. 

To count the number of items not having a given property, it is sometimes easier 
to count the number of items that do have the property and subtract that value from the 
total number of items [see Example 6.1.3(c)]. 

The Inclusion-Exclusion Principle (Theorem 6.1.13) is a variant of the Addition 
Principle that can be used when the sets involved are not pairwise disjoint. 


6.1 Review Exercises 


1. State the Multiplication Principle and give an example of its use. 


2. State the Addition Principle and give an example of its use. 


3. State the Inclusion-Exclusion Principle for two sets and give an 


example of its use. 


Use the Multiplication Principle to solve Exercises 1-12. 8. A restaurant chain advertised a special in which a customer 
could choose one of five appetizers, one of 14 main dishes, 
1. H di t Kay’ ick Lunch (Fi 6.1.1 - : oe . 
a ra ene . 7 . : i i eee yen and one of three desserts. The ad said that there were 210 
BE a eae GE Eeeeee Se OR eee ae possible dinners. Was the ad correct? Explain. 
2. He di t Kay’ ick Lunch (Fi a ea - : : . 
SUN RY cree Quite mass Diese ) carn 9. How many different car license plates can be constructed if 
sist of an optional appetizer, one main course, and an optional : : ae 
the licenses contain three letters followed by two digits if rep- 
beverage? tas ap 43 
etitions are allowed? if repetitions are not allowed? 
3. How many dinners at Kay’s Quick Lunch (Figure 6.1.1) 
consist of one appetizer, one main course, and an optional Exercises 10-12 ask about strings of length 5 formed using the let- 
beverage? ters ABCDEFG without repetitions. 
4. A man has eight shirts, four pairs of pants, and five pairs of 10. How many strings begin with the letter F and end with the 
shoes. How many different outfits are possible? letter A? 
5. The Braille system of representing characters was developed 11. How many strings begin with the letter F and do not end with 
early in the nineteenth century by Louis Braille. The charac- EB in that order? 
ters, used by the blind, consist of raised dots. The positions 12. How many strings contain CEG together in that order? 
for the dots are selected from two vertical columns of three 
dots each. At least one raised dot must be present. How many Use the Addition Principle to solve Exercises 13-24. 
sae - fa ; 
distinct Braille characters are possible’ 13. Three departmental committees have 6, 12, and 9 members 
6. The options available on a particular model of a car are five with no overlapping membership. In how many ways can 
interior colors, six exterior colors, two types of seats, three these committees send one member to meet with the 
types of engines, and three types of radios. How many differ- president? 
ent possibilities are available to the consumer! 14. In how many ways can a diner choose one item from among 
7. Two dice are rolled, one blue and one red. How many out- the appetizers and main courses at Kay’s Quick Lunch 


comes are possible? 


(Figure 6.1.1)? 


15. In how many ways can a diner choose one item from among 
the appetizers and beverages at Kay’s Quick Lunch (Figure 
6.1.1)? 


16. How many times are the print statements executed? 


fori=1tom 
printin(i) 

forj = 1ton 
println()) 


17. How many times is the print statement executed? 


fori=1tom 
forj =1ton 
println(i, j) 


18. Given that there are 32 eight-bit strings that begin 101 and 16 
eight-bit strings that begin 1101, how many eight-bit strings 
begin either 101 or 1101? 


19. Two dice are rolled, one blue and one red. How many out- 
comes give the sum of 2 or the sum of 12? 


20. A committee composed of Morgan, Tyler, Max, and Leslie is 
to select a president and secretary. How many selections are 
there in which Tyler is president or not an officer? 


21. A committee composed of Morgan, Tyler, Max, and Leslie is 
to select a president and secretary. How many selections are 
there in which Max is president or secretary? 


Exercises 22-24 ask about strings of length 5 formed using the let- 
ters ABCDEFG without repetitions. 


22. How many strings begin with AC or DB in that order? 


23. How many strings do not begin with AB (in that order) or D? 
24. How many strings contain B and D together in either order 
(i.e., BD or DB)? 

25. Comment on the following item from the New York Times: 
Big pickups also appeal because of the 
seemingly infinite ways they can be personalized; 
you need the math skills of Will Hunting to 
total the configurations. For starters, there are 32 
combinations of cabs (standard, Club Cab, Quad 
Cab), cargo beds (6.5 or 8 feet), and engines (3.9-liter 
V6, 5.2-liter V8, 5.9-liter V8, 5.9-liter turbo-diesel 
inline 6, 8-liter V10). 


In Exercises 26-33, two dice are rolled, one blue and one red. 


26. How many outcomes give the sum of 4? 


27. How many outcomes are doubles? (A double occurs when 
both dice show the same number.) 


28. How many outcomes give the sum of 7 or the sum of 11? 
29. How many outcomes have the blue die showing 2? 
30. How many outcomes have exactly one die showing 2? 


31. How many outcomes have at least one die showing 2? 


32. 
33 
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. How many outcomes have neither die showing 2? 


. How many outcomes give an even sum? 


In Exercises 34-36, suppose there are 10 roads from Oz to Mid 


Ear 


34, 
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36. 


37. 
38. 
39. 


40. 
41. 
42. 
43. 


th and five roads from Mid Earth to Fantasy Island. 


. How many routes are there from Oz to Fantasy Island passing 
through Mid Earth? 


How many round-trips are there of the form Oz—Mid Earth— 
Fantasy Island—Mid Earth-Oz? 


How many round-trips are there of the form Oz—Mid Earth— 
Fantasy Island—Mid Earth-Oz in which on the return trip 
we do not reverse the original route from Oz to Fantasy 
Island? 

How many eight-bit strings begin 1100? 

How many eight-bit strings begin and end with 1? 

How many eight-bit strings have either the second or the 
fourth bit 1 (or both)? 

How many eight-bit strings have exactly one 1? 

How many eight-bit strings have exactly two 1’s? 

How many eight-bit strings have at least one 1? 


How many eight-bit strings read the same from either end? 
(An example of such an eight-bit string is 01111110. Such 
strings are called palindromes.) 


In Exercises 44-49, a six-person committee composed of Alice, 
Ben, Connie, Dolph, Egbert, and Francisco is to select a chair- 
person, secretary, and treasurer. 


44, 
45. 


46. 


47. 


48. 


49. 


How many selections exclude Connie? 

How many selections are there in which neither Ben nor 
Francisco is an officer? 

How many selections are there in which both Ben and 
Francisco are officers? 

How many selections are there in which Dolph is an officer 
and Francisco is not an officer? 

How many selections are there in which either Dolph is chair- 
person or he is not an officer? 

How many selections are there in which Ben is either chair- 
person or treasurer? 


In Exercises 50-57, the letters ABCDE are to be used to form 
strings of length 3. 


50. 


51. 


52. 
53. 
54. 


55. 


. How many strings can be formed if we allow repetitions? 
How many strings can be formed if we do not allow 
repetitions? 

How many strings begin with A, allowing repetitions? 

How many strings begin with A if repetitions are not allowed? 
How many strings do not contain the letter A, allowing 
repetitions? 

How many strings do not contain the letter A if repetitions 
are not allowed? 
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56. How many strings contain the letter A, allowing repetitions? 
57. How many strings contain the letter A if repetitions are not 
allowed? 


Exercises 58-68 refer to the integers from 5 to 200, inclusive. 


58. How many numbers are there? 

59. How many are even? 

60. How many are odd? 

61. How many are divisible by 5? 

62. How many are greater than 72? 

63. How many consist of distinct digits? 

64. How many contain the digit 7? 

65. How many do not contain the digit 0? 

66. How many are greater than 101 and do not contain the 
digit 6? 

67. How many have the digits in strictly increasing order? 
(Examples are 13, 147, 8.) 

68. How many are of the form xyz, where 0 4 x < yand y > z? 


69. (a) In how many ways can the months of the birthdays of 
five people be distinct? 


(b) How many possibilities are there for the months of the 
birthdays of five people? 


(c) In how many ways can at least two people among five 
have their birthdays in the same month? 


Exercises 70-74 refer to a set of five distinct computer science 
books, three distinct mathematics books, and two distinct art 
books. 


70. In how many ways can these books be arranged on a shelf ? 


71. In how many ways can these books be arranged on a shelf if 
all five computer science books are on the left and both art 
books are on the right? 


72. In how many ways can these books be arranged on a shelf if 
all five computer science books are on the left? 
73. In how many ways can these books be arranged on a shelf if 
all books of the same discipline are grouped together? 
x74, In how many ways can these books be arranged on a shelf if 
the two art books are not together? 


75. In some versions of FORTRAN, an identifier consists of a 
string of one to six alphanumeric characters beginning with 
a letter. (An alphanumeric character is one of A to Z or 0 to 
9.) How many valid FORTRAN identifiers are there? 


76. If X is an n-element set and Y is an m-element set, how many 
functions are there from X to Y? 


x77. There are 10 copies of one book and one copy each of 10 
other books. In how many ways can we select 10 books? 


78. How many terms are there in the expansion of 


(x+y\(atb+ejet+ft+g)(ht+i)? 


x79. How many subsets of a (2n+ 1)-element set have n elements 
or less? 
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80. A Class B internet address, used for medium-sized networks, 
is a bit string of length 32. The first bits are 10 (to identify 
it as a Class B address). The netid is given by the next 14 
bits, which identifies the network. The hostid is given by the 
remaining 16 bits, which identifies the computer interface. 
The hostid must not consist of all 0’s or all 1’s. How many 
Class B addresses are available? 


81. A Class C internet address, used for small networks, is a bit 
string of length 32. The first bits are 110 (to identify it as 
a Class C address). The netid is given by the next 21 bits, 
which identifies the network. The hostid is given by the re- 
maining 8 bits, which identifies the computer interface. The 
hostid must not consist of all 0’s or all 1’s. How many Class C 
addresses are available? 


82. Given that the IPv4 internet address of a computer interface is 
either Class A, Class B, or Class C, how many IPv4 Internet 
addresses are available? 


83. How many symmetric relations are there on an n-element 
set? 

84. How many antisymmetric relations are there on an n-element 
set? 

85. How many reflexive and symmetric relations are there on an 
n-element set? 


86. How many reflexive and antisymmetric relations are there on 
an n-element set? 


87. How many symmetric and antisymmetric relations are there 
on an n-element set? 


88. How many reflexive, symmetric, and antisymmetric relations 
are there on an n-element set? 

89. How many truth tables are there for an n-variable function? 

90. How many binary operators are there on {1,2,...,}? 


91. How many commutative binary operators are there on 
{15 2yea.sgnt}? 


Use the Inclusion-Exclusion Principle (Theorem 6.1.13) to solve 
Exercises 92-97. 


92. How many eight-bit strings either begin with 100 or have the 
fourth bit 1 or both? 


93. How many eight-bit strings either start with a | or end with 
a | or both? 


In Exercises 94 and 95, a six-person committee composed of Alice, 
Ben, Connie, Dolph, Egbert, and Francisco is to select a chairper- 
son, secretary, and treasurer. 


94. How many selections are there in which either Ben is chair- 
person or Alice is secretary or both? 


95. How many selections are there in which either Connie is 
chairperson or Alice is an officer or both? 


96. Two dice are rolled, one blue and one red. How many 
outcomes have either the blue die 3 or an even sum or 
both? 


97. How many integers from | to 10,000, inclusive, are multiples 
of 5 or 7 or both? 


98. 


Prove the Inclusion-Exclusion Principle for three finite sets: 


|\XUYUZ| = |X|+|Y|-41Z|-|XAY|—|XNZ|—|YAZ|+|XNYNZ|I. 


99. 


100. 


101. 


Hint: Write the Inclusion-Exclusion Principle for two finite 
sets as 


|A UB| = |A| + |B] — |AN BI 


and let A = X and B= Y UZ. 


In a group of 191 students, 10 are taking French, business, 
and music; 36 are taking French and business; 20 are taking 
French and music; 18 are taking business and music; 65 are 
taking French; 76 are taking business; and 63 are taking mu- 
sic. Use the Inclusion-Exclusion Principle for three finite sets 
(see Exercise 98) to determine how many students are not 
taking any of the three courses. 


Use the Inclusion-Exclusion Principle for three finite sets 
(see Exercise 98) to solve the problem in Example 1.1.21. 


Use the Inclusion-Exclusion Principle for three finite sets 
(see Exercise 98) to solve Exercise 66, Section 1.1. 


Problem-Solving Corner 


Problem 
Find the number of ordered triples of sets X,, X2, X3 
satisfying 
X UX, UX = {1, 2, 3, 4, 5, 6, 7, 8} 
and X;NX,NX3 =. 
By ordered triple, we mean that the order of the sets X,, 
Xp, X3 is taken into account. For example, the triples 
(25 Sit et, 2s sO, 74) 
and 


(Hs ti, 2 Hin 2 Sy 7 


are considered distinct. 


Attacking the Problem 


It would be nice to begin by enumerating triples, but 
there are so many it would be hard to gain much in- 
sight from staring at a few triples. Let’s simplify the 
problem by making it smaller. Let’s replace 


{1, 2, 3, 4, 5, 6, 7, 8} 


by {1}. What could be simpler than {1}? (Well, maybe 
©, but that’s too simple!) We can now enumerate all 
ordered triples of sets X;, X2, X3 satisfying X; U Xz U 
X3 = {1} and X; 1 X21 X3 = S. We must put | in 
at least one of the sets X;, Xz, X3 (so that the union 


102. 


*103. 


104. 
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Use the Inclusion-Exclusion Principle for three finite sets 
(see Exercise 98) to compute the number of integers between 
1 and 10,000, inclusive, that are multiples of 3 or 5 or 11 or 
any combination thereof. 

Use Mathematical Induction to prove the general Inclusion- 
Exclusion Principle for finite sets X1, X2,..., Xn: 


IX1UX2U-+-UXl = So IMI SD NX) 


l<i<n 


De 


l<i<j<k<n 


l<i<j<n 


IX; VX; Xe] — + 


$(-1)" XN X21 Xp 


Hint: In the Inductive Step, refer to the hint in Exercise 98. 


Using the previous exercise, write the Inclusion-Exclusion 


Principle for four finite sets. 
105. 


thereof? 


Counting 


will be {1}), but we must not put 1 in all three of the 
sets X,;, X2, X3 (otherwise, the intersection would not 
be empty). Thus 1 will be in exactly one or two of the 
sets X;, X2, X3. The complete list of ordered triples is 
as follows: 


Xi =), Go, 1 = 2: 
Xi=o, G{, ye; 
i= Oo So. el) 
ApS ee — a, 
At i Sa 
Ki=o, MGa={, t= 10). 


Thus there are six ordered triples of sets X;, Xz, X3 sat- 
isfying 
X, UX UX = {1} and XN NX3 = S. 


Let’s step up one level and enumerate all ordered 
triples of sets X;, X2, X3 satisfying X; UX,UX3 = {1, 2} 
and X; 1 X2 M X3 = ©. As before, we must put | in 
at least one of the sets X,, X2, X3 (so that 1 will be in 
the union), but we must not put | in all three of the sets 
X,, Xz, X3 (otherwise, the intersection would not be 
empty). This time we must also put 2 in at least one of 
the sets X;, X2, X3 (so that 2 will also be in the union), 
but we must not put 2 in all three of the sets X;, X2, X3 
(otherwise, the intersection would not be empty). Thus 
each of 1 and 2 will be in exactly one or two of the sets 


How many integers between 1 and 10,000, inclusive, are 
multiples of 3 or 5 or 11 or 13 or any combination 
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lisin 2 isin lisin 2 isin 1 isin 2 isin 
XxX XxX XxX Xo »¢ X3 
Xo XxX X2 Xo X2 X3 
X3 Xx X3 X2 X3 X3 
Xi, X2 XxX Xi, X2 Xo X, X X3 
Xi, X3 XxX Xi, X3 X2 XG 3G X3 
X, X3 XxX Xo, X3 X2 X, X3 X3 
XxX Xi, X2 XxX Xi, X3 xX Xo, X3 
Xo X, X2 Xo X, X3 X2 X, X3 
X3 Xi, X2 X3 Xi, X3 X3 X, X3 
Xi, X2 Xi, X2 Xi, X2 Xi, X3 X, Xo Xo, X3 
Xi, X3 Xi, X2 Xi, X3 X, X3 X, X3 X, X3 
X, X3 X, X2 Xo, X3 X, X3 Xo, X3 X, X3 


X1, X2, X3. We enumerate the sets in a systematic way 
so that we can recognize any patterns that appear. The 
complete list of ordered triples is shown in the table 
at the top of this page. For example, the top left entry, 
X, X,, specifies that 1 is in X; and 2 is in X;; therefore, 
this entry gives the ordered triple 


X,={1,2}, m=O, =. 


As shown, there are 36 ordered triples of sets X1, X2, 
X3 satisfying 
X, UX, UX; = {1, 2} 
and X,NX%1NX3 = S. 

We see that there are six ways to assign | to the sets 
X1, X2, X3, which accounts for six lines per block. Sim- 
ilarly, there are six ways to assign 2 to the sets X1, Xo, 
X3, which accounts for six blocks. 

Before reading on, can you guess how many 
ordered triples of sets X;, Xz, X3 satisfy 


XG UX UXG — 253} 
and XN XN X3 = @? 
The pattern has emerged. If X = {1,2,...,n}, 
there are six ways to assign each of 1, 2,..., to the 


sets X,, X2, X3. By the Multiplication Principle, the 
number of ordered triples is 6”. 


Finding Another Solution 


We have just found a solution to the problem by start- 
ing with a simpler problem and then discovering and 
justifying the pattern that emerged. 

Another approach is to look for a similar problem 
and imitate its solution. The problem of Example 6.1.6 
is similar to the one at hand in that it also is a counting 
problem that deals with sets: 


Let X be an n-element set. How many ordered 
pairs (A, B) satisfy A C B C X? 


(At this point, it would be a good idea to go 
back and reread Example 6.1.6.) The solution given in 
Example 6.1.6 counts the number of ways to assign el- 
ements of X to exactly one of the sets A, B — A, or 
X—B. 

We can solve our problem by taking a similar 
approach. Each element of X is in exactly one of 

XG MGM, XG MAG OIG, 2G MXQOLG, 
XiNX 1X3, XiNKNX, XiNXNK. 
Since each member of X can be assigned to one of these 
sets in six ways, by the Multiplication Principle, the 

number of ordered triples is 6°. 

Notice that while this approach to solving the 


problem is different than that of the preceding section, 
the final argument is essentially the same. 


Formal Solution 


Each element in X is in exactly one of 

XY =XNXNX, Yo =X1NX2NX3, 

¥3 =X, NX.NX, Ys = X11 X2NX3, 

¥s =X NX2NX3, Yo = X1 1X2 X3. 
We can construct an ordered triple by the following 
eight-step process: Choose j, | < j < 6, and put | in 
Y;; choose j, 1 <j < 6, and put 2 in Y;; ...; choose j, 
1 <j < 6, and put 8 in Y;. For example, to construct 
the triple 


(il 2 Be 


we first choose j = 3 and put 1 in ¥3 = X,) NX. X3. 
Next, we choose j = 2 and put 2 in Y2 = X; AEM, 
The remaining choices for j are j = 6, 5, 4, 4, 4, 5. 

Each choice for 7 can be made in six ways. By 
the Multiplication Principle, the number of ordered 
triples is 


6-6-6-6-6-6-6-6 = 6° = 1,679,616. 


{1,4, 8}, {2,5, 6, 7}, 
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Summary of Problem-Solving Techniques = Enumerate items systematically so that patterns 


m= Replace the original problem with a simpler 
problem. One way to do this is to reduce the size 
of the original problem. 


= Directly enumerate the items to be counted. 


6.2 


Go Online 

For more on 
permutations and 
combinations, see 
goo.gl/WRoK11 


Example 6.2.2 


emerge. 
= Look for patterns. 


= Look for a similar problem and imitate its 
solution. 


Permutations and Combinations 


Four candidates, Zeke, Yung, Xeno, and Wilma, are running for the same office. So that 
the positions of the names on the ballot will not influence the voters, it is necessary to 
print ballots with the names listed in every possible order. How many distinct ballots 
will there be? 

We can use the Multiplication Principle. A ballot can be constructed in four 
successive steps: Select the first name to be listed; select the second name to be listed; 
select the third name to be listed; select the fourth name to be listed. The first name 
can be selected in four ways. Once the first name has been selected, the second name can 
be selected in three ways. Once the second name has been selected, the third name 
can be selected in two ways. Once the third name has been selected, the fourth name 
can be selected in one way. By the Multiplication Principle, the number of ballots is 
4-3-2-1=24. 

An ordering of objects, such as the names on the ballot, is called a permutation. 


Definition 6.2.1 > A permutation of n distinct elements x), ... , x, is an order- 
ing of the n elements x1, ..., Xp. < 


There are six permutations of three elements. If the elements are denoted A, B, C, the 
six permutations are 


ABC, ACB, BAC, BCA, CAB, CBA. 4 


We found that there are 24 ways to order four candidates on a ballot; thus there 
are 24 permutations of four objects. The method that we used to count the number of 
distinct ballots containing four names may be used to derive a formula for the number 
of permutations of n elements. 

The proof of the following theorem for n = 4 is illustrated in Figure 6.2.1. 


B > A = D = CG 
Select Ist Select 2nd Select 3rd Select 4th 
element element element element 


Figure 6.2.1 The proof of Theorem 6.2.3 for n = 4. A permutation of ABCD is 
constructed by successively selecting the first element, then the second element, 
then the third element, and, finally, the fourth element. 
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Theorem 6.2.3 


Example 6.2.4 


Example 6.2.5 


mel VAI 


Figure 6.2.2 Four tokens to 
permute. 


Example 6.2.6 


Example 6.2.7 


There are n! permutations of n elements. 


Proof We use the Multiplication Principle. A permutation of n elements can be con- 
structed in n successive steps: Select the first element; select the second element; ...; 
select the last element. The first element can be selected in n ways. Once the first element 
has been selected, the second element can be selected in n — 1 ways. Once the second 
element has been selected, the third element can be selected in n — 2 ways, and so on. 
By the Multiplication Principle, there are 


n(n — 1)\(n—2)---2-l=n! 


permutations of n elements. < 


There are 
10! = 10-9-8-7-6-5-4-3+2+1 = 3,628,800 


permutations of 10 elements. < 


How many permutations of the letters ABCDEF contain the substring DEF? 


SOLUTION To guarantee the presence of the pattern DEF in the substring, these three 
letters must be kept together in this order. The remaining letters, A, B, and C, can be 
placed arbitrarily. We can think of constructing permutations of the letters ABCDEF 
that contain the pattern DEF by permuting four tokens—one labeled DEF and the others 
labeled A, B, and C (see Figure 6.2.2). By Theorem 6.2.3, there are 4! permutations of 
four objects. Thus the number of permutations of the letters ABCDEF that contain the 
substring DEF is 4! = 24. < 


How many permutations of the letters ABCDEF contain the letters DEF together in any 
order? 


SOLUTION We can solve the problem by a two-step procedure: Select an ordering of 
the letters DEF; construct a permutation of ABCDEF containing the given ordering of 
the letters DEF. By Theorem 6.2.3, the first step can be done in 3! = 6 ways and, ac- 
cording to Example 6.2.5, the second step can be done in 24 ways. By the Multiplication 
Principle, the number of permutations of the letters ABCDEF containing the letters DEF 
together in any order is 6-24 = 144. < 


In how many ways can six persons be seated around a circular table? If a seating is 
obtained from another seating by having everyone move n seats clockwise, the seatings 
are considered identical. 


SOLUTION Let us denote the persons as A, B, C, D, E, and F. Since seatings obtained 
by rotations are considered identical, we might as well seat A arbitrarily. To seat the 
remaining five persons, we can order them and then seat them in this order clockwise 
from A. For example, the permutation CDBFE would define the seating in the adjacent 
figure. Since there are 5! = 120 permutations of five elements, there are 120 ways that 
six persons can be seated around a circular table. 

The same argument can be used to show that there are (n — 1)! ways that n persons 
can be seated around a circular table. < 


Sometimes we want to consider an ordering of r elements selected from n available 
elements. Such an ordering is called an r-permutation. 


Example 6.2.9 


Theorem 6.2.10 


Example 6.2.11 


Example 6.2.12 
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Definition 6.2.8 >» An r-permutation of n (distinct) elements x,,..., x, is an 
ordering of an r-element subset of {x,,...,x,}. The number of r-permutations of a set 
of n distinct elements is denoted P(n, r). < 
Examples of 2-permutations of a, b, c are ab, ba, and ca. < 


If r = n in Definition 6.2.8, we obtain an ordering of all n elements. Thus an 
n-permutation of n elements is what we previously called simply a permutation. 
Theorem 6.2.3 tells us that P(n,n) = n!. The number P(n, r) of r-permutations of a 
n-element set when r < n may be derived as in the proof of Theorem 6.2.3. The proof 
of the theorem for n = 6 and r = 3 is illustrated in Figure 6.2.3. 


Cc = A > E 
Select Ist Select 2nd Select 3rd 
element element element 


Figure 6.2.3 The proof of Theorem 6.2.10 for n = 6 and 
r = 3. An r-permutation of ABCDEF is constructed by 
successively selecting the first element, then the second 
element, and, finally, the third element. 


The number of r-permutations of a set of n distinct objects is 


P(n,r) =n(n— 1)(n—2)---”M—rt+1) 
n! 


> Gan! 


1? ihe 


Proof We are to count the number of ways to order r elements selected from an 
n-element set. The first element can be selected in n ways. Once the first element 
has been selected, the second element can be selected in n — 1 ways. We continue se- 
lecting elements until, having selected the (r — 1)st element, we select the rth element. 
This last element can be chosen inn — r+ 1 ways. By the Multiplication Principle, the 
number of r-permutations of a set of n distinct objects is 


way n@— D+ @—rt+DI@-—na@-r-1)---2-1 
n(n—1)\(n—2)---(n-r+1)= (n—r(n—r—1)---2+1 


n! 


~ (n=l 4 


According to Theorem 6.2.10, the number of 2-permutations of X = {a, b, c}is P(3, 2) = 
3-2 = 6. These six 2-permutations are ab, ac, ba, be, ca, cb. < 


In how many ways can we select a chairperson, vice-chairperson, secretary, and treasurer 
from a group of 10 persons? 


SOLUTION We need to count the number of orderings of four persons selected from 
a group of 10, since an ordering picks (uniquely) a chairperson (first pick), a vice- 
chairperson (second pick), a secretary (third pick), and a treasurer (fourth pick). By 
Theorem 6.2.10, the solution is 


P(10, 4) = 10-9-8-7 = 5040 
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or 


eb aye: 10! 10! 
mo" d0-4)! 6L- < 


Example 6.2.13 In how many ways can seven distinct Martians and five distinct Jovians wait in line if no 
TT _——= two Jovians stand together? 


SOLUTION We can line up the Martians and Jovians by a two-step process: Line up the 
Martians; line up the Jovians. The Martians can line up in 7! = 5040 ways. Once we have 
lined up the Martians (e.g., in positions M;—M7), since no two Jovians can stand together, 
the Jovians have eight possible positions in which to stand (indicated by blanks): 


_M,—M>_M3_Ms_Ms—Mo_M7_. 


Thus the Jovians can stand in P(8,5) = 8-7-6-5-4 = 6720 ways. By the Multiplica- 
tion Principle, the number of ways seven distinct Martians and five distinct Jovians can 
wait in line if no two Jovians stand together is 5040 -6720 = 33,868,800. < 


We turn next to combinations. A selection of objects without regard to order is 
called a combination. 


Definition 6.2.14 >  GivenasetX = {x,, ...,x,} containing n (distinct) elements, 


(a) An r-combination of X is an unordered selection of r-elements of X (i.e., an 
r-element subset of X). 


(b) The number of r-combinations of a set of n distinct elements is denoted C(n, r) 
or ie > | 


Example 6.2.15 A group of five students, Mary, Boris, Rosa, Ahmad, and Nguyen, has decided to talk 
———e'--—-—{- with the Mathematics Department chairperson about having the Mathematics Depart- 
ment offer more courses in discrete mathematics. The chairperson has said that she will 
speak with three of the students. In how many ways can these five students choose three 

of their group to talk with the chairperson? 


SOLUTION In solving this problem, we must not take order into account. (For ex- 
ample, it will make no difference whether the chairperson talks to Mary, Ahmad, and 
Nguyen or to Nguyen, Mary, and Ahmad.) By simply listing the possibilities, we see 
that there are 10 ways that the five students can choose three of their group to talk to the 
chairperson: 


MBR, MBA, MRA, BRA, MBN, MRN, BRN, MAN, BAN, RAN. 


In the terminology of Definition 6.2.14, the number of ways the five students can choose 
three of their group to talk with the chairperson is C(5, 3), the number of 3-combinations 
of five elements. We have found that C(5, 3) = 10. < 


We next derive a formula for C(n, r) by counting the number of r-permutations of 
an n-element set in two ways. The first way simply uses the formula P(n, r). The second 
way of counting the number of r-permutations of an n-element set involves C(n, r). 
Equating the two values will enable us to derive a formula for C(n, r). 

We can construct r-permutations of an n-element set X in two successive steps: 
First, select an r-combination of X (an unordered subset of r items); second, order it. For 
example, to construct a 2-permutation of {a, b, c, d}, we can first select a 2-combination 
and then order it. Figure 6.2.4 shows how all 2-permutations of {a, b, c, d} are obtained 


Theorem 6.2.16 


Example 6.2.17 


Example 6.2.18 


Example 6.2.19 


6.2 @ Permutations and Combinations 273 


{a, b} {a, c} 


iwinie 


ab ba ac ca ad da be cb bd db cd dc 


b, c} {b, d} {c, d} 


Figure 6.2.4 2-permutations of {a, b, c, d}. 


in this way. The Multiplication Principle tells us that the number of r-permutations is 
the product of the number of r-combinations and the number of orderings of r elements. 
That is, 


P(n,r) = C(n, r)r!. 


Therefore, 


C(n,r) = = us 
r! 


Our next theorem states this result and gives some alternative ways to write C(n, r). 


The number of r-combinations of a set of n distinct objects is 


IAG.) _ AG Wyeca(@ asl) n! 
al r! ~ @=PA 


Cn) = 


Proof = The proof of the first equation is given before the statement of the theorem. 
The other forms of the equation follow from Theorem 6.2.10. < 


In how many ways can we select a committee of three from a group of 10 distinct per- 
sons? 


SOLUTION Since a committee is an unordered group of people, the answer is 


10-9°-8 
| 


c(10, 3) = = 120. 


< 


In how many ways can we select a committee of two women and three men from a group 
of five distinct women and six distinct men? 


SOLUTION As in Example 6.2.17, we find that the two women can be selected in 
C(5, 2) = 10 ways and that the three men can be selected in C(6, 3) = 20 ways. The 
committee can be constructed in two successive steps: Select the women; select the men. 
By the Multiplication Principle, the total number of committees is 10-20 = 200. < 


How many eight-bit strings contain exactly four 1’s? 


SOLUTION An eight-bit string containing four 1’s is uniquely determined once we tell 
which bits are 1. This can be done in C(8, 4) = 70 ways. < 
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Example 6.2.20 An ordinary deck of 52 cards consists of four suits 
clubs, diamonds, hearts, spades 
of 13 denominations each 


ace, 2-10, jack, queen, king. 


(a) How many (unordered) five-card poker hands, selected from an ordinary 52-card 
deck, are there? 


(b) How many poker hands contain cards all of the same suit? 


(c) How many poker hands contain three cards of one denomination and two cards 
of a second denomination? 


SOLUTION 
(a) The answer is given by the combination formula C(52, 5) = 2,598,960. 


(b) A hand containing cards all of the same suit can be constructed in two succes- 
sive steps: Select a suit; select five cards from the chosen suit. The first step can 
be done in four ways, and the second step can be done in C(13, 5) ways. By the 
Multiplication Principle, the answer is 4-C(13, 5) = 5148. 


(c) A hand containing three cards of one denomination and two cards of a second 
denomination can be constructed in four successive steps: Select the first denom- 
ination; select the second denomination; select three cards of the first denomi- 
nation; select two cards of the second denomination. The first denomination can 
be chosen in 13 ways. Having selected the first denomination, we can choose the 
second denomination in 12 ways. We can select three cards of the first denomi- 
nation in C(4, 3) ways, and we can select two cards of the second denomination 
in C(4, 2) ways. By the Multiplication Principle, the answer is 


13 -12-C(4, 3) -C(4, 2) = 3744. < 


Example 6.2.21 How many routes are there from the lower-left corner of an n x n square grid to the 
upper-right corner if we are restricted to traveling only to the right or upward? One such 
route is shown in a4 x 4 grid in Figure 6.2.5(a). 


(a) (b) 


Figure 6.2.5 (a) A4 x 4 grid with a route from the lower-left 
corner to the upper-right corner. (b) The route in (a) transformed 
to aroute inaS5 x 3 grid. 


Example 6.2.22 
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SOLUTION Each route can be described by a string of n R’s (right) and n U’s (up). For 
example, the route shown in Figure 6.2.5(a) can be described by the string RUURRURU. 
Any such string can be obtained by selecting n positions for the R’s, without regard to 
the order of selection, among the 27 available positions in the string and then filling the 
remaining positions with U’s. Thus there are C(2n, n) possible routes. < 


How many routes are there from the lower-left corner of an n x n square grid to the 
upper-right corner if we are restricted to traveling only to the right or upward and if 
we are allowed to touch but not go above a diagonal line from the lower-left corner to 
the upper-right corner? 


SOLUTION We call a route that touches but does not go above the diagonal a good 
route, and we call a route that goes above the diagonal a bad route. Our problem is to 
count the number of good routes. We let G,, denote the number of good routes and B,, 
denote the number of bad routes. In Example 6.2.21 we showed that G,, +B, = C(2n, n); 
thus it suffices to compute the number of bad routes. 

We call a route from the lower-left corner of an (n + 1) x (n — 1) grid to the 
upper-right corner (with no restrictions) an (1 + 1) x (nm — 1) route. A 5 x 3 route is 
shown in Figure 6.2.5(b). We show that the number of bad routes is equal to the number 
of (n+ 1) x (n— 1) routes by describing a one-to-one, onto function from the set of bad 
routes to the set of (1 + 1) x (n — 1) routes. 

Given a bad route, we find the first move (starting from the lower left) that takes 
it above the diagonal. Thereafter we replace each right move by an up move and each 
up move by a right move. For example, the route of Figure 6.2.5(a) is transformed to 
the route shown in Figure 6.2.5(b). This transformation can also be effected by rotating 
the portion of the route following the first move above the diagonal about the dashed line 
shown in Figure 6.2.5(b). We see that this transformation does indeed assign to each bad 
route an (n+ 1) x (n — 1) route. 

To show that our function is onto, consider any (n + 1) x (nm — 1) route. Since 
this route ends above the diagonal, there is a first move where it goes above the diag- 
onal. We may then rotate the remainder of the route about the dashed line shown in 
Figure 6.2.5(b) to obtain a bad route. The image of this bad route under our function is 
the (n+ 1) x (n— 1) route with which we started. Therefore, our function is onto. Our 
function is also one-to-one, as we can readily verify that the function transforms dis- 
tinct bad routes to distinct (1 + 1) x (n — 1) routes. Therefore, the number of bad routes 
equals the number of (n+ 1) x (n— 1) routes. 

An argument like that in Example 6.2.21 shows that the number of (n+ 1) x (n— 1) 
routes is equal to C(2n, n — 1). Thus the number of good routes is equal to 


C2 B, =C(. C2 1 = Gn! om) 
Sn ea nn ee se Ie 
Qn! (1 1-\_ a)! 1 
~ nl(n—1)! (- —) <a ae 
(2n)! _ C(2n, n) 


~ (a+ nin! ntl 


Exercises 75 and 76 outline an alternative proof using mathematical induction of 
the formula C(2n, n)/(n+ 1) for the number of good routes. Exercises 27-33, Section 9.8, 
outline yet another way to derive the formula C(2n, n)/(n + 1). < 
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Go Online 

For more on Catalan 
numbers, see 
goo.gl/WRoK11 


Example 6.2.23 


The numbers C(2n,n)/(n + 1) are called Catalan numbers in honor of the 
Belgian mathematician Eugéne-Charles Catalan (1814-1894), who discovered an ele- 
mentary derivation of the formula C(2n, n)/(n + 1). Catalan published numerous pa- 
pers in analysis, combinatorics, algebra, geometry, probability, and number theory. In 
1844, he conjectured that the only consecutive positive integers that are powers (i.e., #/, 
where j > 2) are 8 and 9. Over 150 years later, Preda Mihailescu proved the result (in 
2002). 

In this book, we denote the Catalan number C(2n, n)/(n + 1) as C,,n > 1, and 
we define Cp to be |. The first few Catalan numbers are 


Q=1, C=1, G=2, G=5, G=14, C=42. 


Like the Fibonacci numbers, the Catalan numbers have a way of appearing in unex- 
pected places (e.g., Exercises 73 and 80-82, this section, and Exercises 30-32, 
Section 7.1). 

Our next example illustrates a common error in counting—namely, counting some 
objects more than once. 


What is wrong with the following argument, which purports to show that there are 
C(8, 5)2? bit strings of length 8 containing at least five 0’s? 


SOLUTION We can construct bit strings of length 8 by filling each of eight slots 


with either 0 or 1. To ensure that there are at least five 0’s, we choose five slots and 
place a 0 in each of them. The five slots can be chosen in C(8, 5) ways. We then fill the 
remaining three slots with either 0 or 1. Since each of the three remaining slots can be 
filled in two ways, the remaining slots can be filled in 2° ways. Thus there are C(8, 5)2* 
bit strings of length 8 containing at least five 0’s. 

The problem is that some strings are counted more than one time. For example, 
suppose that we choose the first five slots and place 0’s in them 


00 0 0 0 


If we then place 0 1 0 in the last three slots, we obtain the string 


000000i1 0 (6.2.1) 


Now suppose that we choose the second through sixth slots and place 0’s in them 


000 0 0 


If we then place 0 in the first slot and | 0 in the last two slots, we obtain the string 


000000i1 0 (6.2.2) 


In the argument given, strings (6.2.1) and (6.2.2) are counted as distinct strings. 

A correct way to count the bit strings of length 8 containing at least five 0’s is to 
count the number of strings containing exactly five 0’s, the number of strings containing 
exactly six 0’s, the number of strings containing exactly seven 0’s, and the number of 
strings containing exactly eight 0’s and sum these numbers. Notice that here each string 
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is counted one time since no string can contain exactly i 0’s and exactly j 0’s when 
iY. 

To construct a bit string of length 8 containing exactly five 0’s, we choose five 
slots for the 0’s and put 1’s in the other three slots. Since we can choose five slots 
in C(8,5) ways, there are C(8,5) bit strings of length 8 containing exactly five 0’s. 
Similarly, there are C(8, 6) bit strings of length 8 containing exactly six 0’s, and so on. 
Therefore, there are 


C(8, 5) + C(8, 6) + C(8, 7) + C(8, 8) 


bit strings of length 8 containing at least five 0’s. < 


We close this section by providing another proof of Theorem 6.2.16 that gives 
a formula for the number of r-element subsets of an n-element set. The proof is illus- 
trated in Figure 6.2.6. Let X be an n-element set. We assume the formula P(n, r) = 
n(n—1)---(n—r-+1) that counts the number of orderings of r-element subsets chosen 
from X. To count the number of r-element subsets of X, we do not want to take order into 
account—we want to consider permutations of the same subset equivalent. Formally, we 
define a relation R on the set S$ of r-permutations of X by the following rule: p,;Rpo if 
pi and p2 are permutations of the same r-element subset of X. It is straightforward to 
verify that R is an equivalence relation on S. 

If p is an r-permutation of X, then p is a permutation of some r-element subset 
X, of X; thus, the equivalence class containing p consists of all permutations of X,. We 
see that each equivalence class has r! elements. An equivalence class is determined by 
the r-element subset of X that is permuted to obtain its members. Therefore, there are 
C(n, r) equivalence classes. Since the set S has P(n, r) elements, by Theorem 3.4.16, 
C(n, r) = P(n,r)/r!. 


ab ac ad be bd cd 
ba ca da cb db dc 


Figure 6.2.6 The alternative proof of Theorem 6.2.16 for n = 4 and 

r = 2. Each box contains an equivalence class for the relation R on the 
set of 2-permutations of X = {a, b, c, d} defined by p; Rp2 if p; and 

p2 are permutations of the same 2-element subset of X. There are 

P(4, 2) = 12 2-permutations of X and 2 ways to permute each 
2-permutation. Since each equivalence class corresponds to a subset of X, 
12/2 = C(4, 2). 


In this section, we have presented techniques for counting objects where repetition 
is not allowed. In Section 6.3, we will count objects where repetition is allowed. 


6.2 Problem-Solving Tips 


The key points to remember in this section are that a permutation takes order into ac- 
count and a combination does not take order into account. Thus, a key to solving counting 
problems is to determine whether we are counting ordered or unordered items. For ex- 
ample, a line of distinct persons is considered ordered. Thus six distinct persons can wait 
in line in 6! ways; the permutation formula is used. A committee is a typical example of 
an unordered group. For example, a committee of three can be selected from a set of six 
distinct persons in C(6, 3) ways; the combination formula is used. 
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6.2 Review Exercises 


1. 


n 


7) 


a 


. What is an r-permutation of x1, ... 


What is a permutation of x1, ..., Xn? 


. How many permutations are there of an n-element set? How is 
this formula derived? 


Xn? 


. How many r-permutations are there of an n-element set? How 
is this formula derived? 


5. 


. What is an r-combination of {x;, .. 


How do we denote the number of r-permutations of an 
n-element set? 


-+Xn}? 


. How many r-combinations are there of an n-element set? How 


is this formula derived? 


. How do we denote the number of r-combinations of an 


n-element set? 


Nnwn bw YN = 


wo 


. How many permutations are there of a, b, c, d? 

. List the permutations of a, b, c, d. 

. How many 3-permutations are there of a, b, c, d? 

. List the 3-permutations of a, b, c, d. 

. How many permutations are there of 11 distinct objects? 

. How many 5-permutations are there of 11 distinct objects? 


. In how many ways can we select a chairperson, vice- 


chairperson, and recorder from a group of 11 persons? 


. In how many ways can we select a chairperson, vice- 


chairperson, secretary, and treasurer from a group of 
12 persons? 


. In how many different ways can 12 horses finish in the order 


Win, Place, Show? 


In Exercises 10-18, determine how many strings can be formed 


by ordering the letters ABCDE subject to the conditions 
given. 
10. Contains the substring ACE 


. Contains the letters ACE together in any order 

. Contains the substrings DB and AE 

. Contains either the substring AE or the substring ZA or both 
. A appears before D. Examples: BCAED, BCADE 

. Contains neither of the substrings AB, CD 

. Contains neither of the substrings AB, BE 

. A appears before C and C appears before E 

. Contains either the substring DB or the substring BE or both 


. In how many ways can five distinct Martians and eight distinct 


Jovians wait in line if no two Martians stand together? 


. In how many ways can five distinct Martians, ten distinct 


Vesuvians, and eight distinct Jovians wait in line if no two 
Martians stand together? 


. In how many ways can five distinct Martians and five distinct 


Jovians wait in line? 


. In how many ways can five distinct Martians and five distinct 


Jovians be seated at a circular table? 


23. 


In how many ways can five distinct Martians and five distinct 
Jovians be seated at a circular table if no two Martians sit to- 
gether? 


. In how many ways can five distinct Martians and eight dis- 


tinct Jovians be seated at a circular table if no two Martians sit 
together? 


In Exercises 25-27, let X = {a, b,c, d}. 


32. 


. Compute the number of 3-combinations of X. 
. List the 3-combinations of X. 


. Show the relationship between the 3-permutations and the 


3-combinations of X by drawing a picture like that in Fig- 
ure 6.2.4. 


. In how many ways can we select a committee of three from a 


group of 11 persons? 


. In how many ways can we select a committee of four from a 


group of 12 persons? 


. At one point in the Illinois state lottery Lotto game, a per- 


son was required to choose six numbers (in any order) among 
44 numbers. In how many ways can this be done? The state 
was considering changing the game so that a person would be 
required to choose six numbers among 48 numbers. In how 
many ways can this be done? 


. Suppose that a pizza parlor features four specialty pizzas and 


pizzas with three or fewer unique toppings (no choosing an- 
chovies twice!) chosen from 17 available toppings. How many 
different pizzas are there? 


Suppose that the pizza parlor of Exercise 31 has a special price 
for four pizzas. How many ways can four pizzas be selected? 


Exercises 33-38 refer to a club consisting of six distinct men and 
seven distinct women. 


. In how many ways can we select a committee of five persons? 


. In how many ways can we select a committee of three men 


and four women? 


. In how many ways can we select a committee of four persons 


that has at least one woman? 


36. In how many ways can we select a committee of four persons 
that has at most one man? 


37. In how many ways can we select a committee of four persons 
that has persons of both sexes? 

38. In how many ways can we select a committee of four persons 
so that Mabel and Ralph do not serve together? 


39. In how many ways can we select a committee of four Republi- 
cans, three Democrats, and two Independents from a group of 
10 distinct Republicans, 12 distinct Democrats, and four dis- 
tinct Independents? 


40. How many eight-bit strings contain exactly three 0’s? 


41. How many eight-bit strings contain three 0’s in a row and 
five 1’s? 


How many eight-bit strings contain at least two 0’s in 
arow? 


*42, 


In Exercises 43-51, find the number of (unordered) five-card poker 
hands, selected from an ordinary 52-card deck, having the proper- 
ties indicated. 

43. Containing four aces 


44, Containing four of a kind, that is, four cards of the same 
denomination 


45. Containing all spades 

46. Containing cards of exactly two suits 
47. Containing cards of all suits 

48. Of the form A2345 of the same suit 


49. Consecutive and of the same suit (Assume that the ace is the 
lowest denomination.) 


50. Consecutive (Assume that the ace is the lowest denomination.) 


51. Containing two of one denomination, two of another denomi- 
nation, and one of a third denomination 


52. Find the number of (unordered) 13-card bridge hands selected 
from an ordinary 52-card deck. 


53. How many bridge hands are all of the same suit? 
54. How many bridge hands contain exactly two suits? 
55. How many bridge hands contain all four aces? 


56. How many bridge hands contain five spades, four hearts, three 
clubs, and one diamond? 


57. How many bridge hands contain five of one suit, four of an- 
other suit, three of another suit, and one of another suit? 


58. How many bridge hands contain four cards of three suits and 
one card of the fourth suit? 


59. How many bridge hands contain no face cards? (A face card 
is one of 10, J, Q, K, A.) 
In Exercises 60-64, a coin is flipped 10 times. 


60. How many outcomes are possible? (An outcome is a list of 
10 H’s and T’s that gives the result of each of 10 tosses. For 
example, the outcome 


HHTHTHHHTH 
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represents 10 tosses, where a head was obtained on the first 
two tosses, a tail was obtained on the third toss, a head was 
obtained on the fourth toss, etc.) 


61. How many outcomes have exactly three heads? 
62. How many outcomes have at most three heads? 
63. How many outcomes have a head on the fifth toss? 


64. How many outcomes have as many heads as tails? 


Exercises 65—68 refer to a shipment of 50 microprocessors of which 
four are defective. 


65. In how many ways can we select a set of four microproces- 
sors? 

66. In how many ways can we select a set of four nondefective 
microprocessors? 


67. In how many ways can we select a set of four microprocessors 
containing exactly two defective microprocessors? 


68. In how many ways can we select a set of four microprocessors 
containing at least one defective microprocessor? 


x69. Show that the number of bit strings of length n > 4 that con- 


tain exactly two occurrences of 10 is C(n + 1, 5). 


*70. Show that the number of n-bit strings having exactly k 0’s, 


with no two 0’s consecutive, is C(n — k + 1,k). 


x71. Show that the product of any positive integer and its k — 1 
successors is divisible by k!. 


72. Show that there are (2n — 1)(2n — 3)---3+1 ways to pick n 
pairs from 2n distinct items. 


Exercises 73-77 refer to an election in which two candidates 
Wright and Upshaw ran for dogcatcher. After each vote was tabu- 
lated, Wright was never behind Upshaw. This problem is known as 
the ballot problem. 


73. Suppose that each candidate received exactly r votes. Show 
that the number of ways the votes could be counted is C,, the 
rth Catalan number. 


74. Suppose that Wright received exactly r votes and Upshaw 
received exactly u votes, r > u > 0. Show that the number of 
ways the votes could be counted is C(r + u,r) — C(r + yu, 
r+1). 

75. Suppose that Wright received exactly r votes and Upshaw re- 
ceived exactly u votes, r > u > 0. Use induction onn = r+u 
to prove that the number of ways the votes could be counted is 


r—-u+l1 
————€ ar). 
a (r+u,r) 

76. Use Exercise 75 to derive the formula C(2n, n)/(n+ 1) for the 
number of good routes as defined in Example 6.2.22. 

77. Show that if exactly n votes were cast, the number of ways the 
votes could be counted is C(n, [n/2]). 

78. Suppose that we start at the origin in the xy-plane and take n 
unit steps (i.e., each step is of length one), where each step is 
either vertical (up or down) or horizontal (left or right). How 
many such paths never go strictly below the x-axis? 
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79. 


80. 


81. 


x82. 


83. 


84. 


85. 


86. 


Suppose that we start at the origin in the xy-plane and take n 
unit steps (i.e., each step is of length one), where each step is 
either vertical (up or down) or horizontal (left or right). How 
many such paths stay in the first quadrant (x > 0, y > 0)? 


Show that the number of ways that 2n persons, seated around 
a circular table, can shake hands in pairs without any arms 
crossing is C,, the nth Catalan number. 


A photo of 2n students, no two of which are the same height, 
is to be taken subject to the following rules: 

(a) There will be two rows each containing n students, all of 
whom are standing. 

Each student in the back row must be taller than the stu- 
dent standing directly in front of him. 

In each row, the students must be arranged in increasing 
order of height. 


YS 


(b) 
(c) 


Show that the number of ways to arrange the students is C;,, 
the nth Catalan number. 


Show that the print statement in the pseudocode 


fori; = lton 
for i2 = 1 to min(i,,n — 1) 
for i3 = 1 to min(i2, n — 2) 


for in—-; = 1 to min(i,_2, 2) 
fori, = 1tol 


printin(i,, i2,..-, in) 


is executed C, times, where C,, denotes the nth Catalan 
number. 


Suppose that we have n objects, r distinct and n — r identical. 
Give another derivation of the formula 


P(n,r) =r! C(n, r) 


by counting the number of orderings of the n objects in two 
ways: 
™ Count the orderings by first choosing positions for the 
r distinct objects. 


™ Count the orderings by first choosing positions for the 
n—r identical objects. 


What is wrong with the following argument, which purports 
to show that 4C(39, 13) bridge hands contain three or fewer 
suits? 

There are C(39, 13) hands that contain only clubs, di- 
amonds, and spades. In fact, for any three suits, there are 
C(39, 13) hands that contain only those three suits. Since there 
are four 3-combinations of the suits, the answer is 4C(39, 13). 


What is wrong with the following argument, which purports to 
show that there are 13‘ - 48 (unordered) five-card poker hands 
containing cards of all suits? 

Pick one card of each suit. This can be done in 
13-13-13-13=134 ways. Since the fifth card can be 
chosen in 48 ways, the answer is 134-48. 


What is wrong with the following argument, which purports 
to show that there are P(n, m)m"~ onto functions from the 
n-element set X to the m-element set Y,n > m? 


87. 


88 


89 


90 
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Let Y = {y1,...,¥m}. To ensure that a function from 
X to Y is onto Y, we select an m-permutation of X, say 
5 ees Xm, and assign x; the value y;, x2 the value y2,..., and 
Xm the value y,,. We can select the m-permutation in P(n, m) 
ways. The remainder of the n — m elements in X may be as- 
signed values in Y arbitrarily. The first remaining element in 
X can be assigned a value in Y in m ways. The next remaining 
element in X can also be assigned a value in Y in m ways, and 
so on. Thus the remaining n —m elements in X can be assigned 
values in Y in m"~" ways. Thus the number of functions from 
X onto Y is P(n, m)m'". 


. How many times is the string 10100001 counted in the 
erroneous argument given in Example 6.2.23? 


. How many times is the string 10001000 counted in the 
erroneous argument given in Example 6.2.23? 


. How many times is the string 00000000 counted in the 
erroneous argument given in Example 6.2.23? 


. Let s,,; denote the number of ways to seat n persons at A round 
tables, with at least one person at each table. (The numbers 5p, x 
are called Stirling numbers of the first kind.) The ordering of 
the tables is not taken into account. The seating arrangement 
at a table is taken into account except for rotations. Examples: 


The following pair is not distinct: 
Cc 
D 
©. C) 


A 
D 
©. C) 
The following pair is not distinct: 
A D 
Cc 
The following pair is distinct: 
A D 
Cc 
The following pair is distinct: 


Xe) 


@ ©. 


On, 


= 


@-6 


(a) Show that s,, = Oifk >n. 
(b) Show that s,,, = 1 for alln > 1. 
(c) Show that s,,; = (2 — 1)! for alln > 1. 


(d) Show that sy,—-1 = C(n, 2) for all n > 2. 
(e) Show that 
1 1 1 
Sn,2 = (n pi(i ba 4 had —) 


for alln > 2. 
(f) Show that 


n 
: Sik =n! for alln > 1. 
k=l 


(g) Find a formula for s;,,,-2, n => 3, and prove it. 


91. Let S,,,% denote the number of ways to partition an n-element 
set into exactly k nonempty subsets. The order of the subsets 
is not taken into account. (The numbers S,,, are called Stirling 
numbers of the second kind.) 

(a) Show that S,.; =O if k > n. 


(b) Show that S,,, = 1 for alln > 1. 
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(c) Show that S,,; = 1 for alln > 1. 

(d) Show that $3.2 = 3. 

(e) Show that S42 = 7. 

(f) Show that S43 = 6. 

(g) Show that S,2 = 2"-! _ | forall n > 2. 

(h) Show that S;,—-1 = C(n, 2) for all n > 2. 

(i) Find a formula for S;,,-2, n => 3, and prove it. 


92. Show that there are 


n 
> Snik 
k=1 


equivalence relations on an n-element set. [The numbers S,,x 
are Stirling numbers of the second kind (see Exercise 91).] 


93. If X is ann-element set and Y is an m-element set, n < m, how 
many one-to-one functions are there from X to Y? 


94. If X and Y are n-element sets, how many one-to-one, onto 
functions are there from X to Y? 


95. Show that (n/k)* < C(n, k) < n*/k!. 


Combinations 


Problem 


(a) How many routes are there from the lower-left 
corner to the upper-right corner of an m xn grid 
in which we are restricted to traveling only to 
the right or upward? For example, the follow- 
ing figure is a3 x 5 grid and one route is shown. 


n=5 


(b) Divide the routes into classes based on when 
the route first meets the top edge to derive the 
formula 


So Chk+m— 1,k) = C(m+n,m). 
k=0 
Attacking the Problem 


Example 6.2.21 asked how many paths there were from 
the lower-left corner to the upper-right corner of an 


n X n grid in which we are restricted to traveling only 
to the right or upward. The solution to that problem 
encoded each route as a string of n R’s (right) and n 
U’s (up). The problem then became one of counting 
the number of such strings. Any such string can be ob- 
tained by selecting n positions for the R’s, without re- 
gard to the order of selection, among the 2n available 
positions in the string and then filling the remaining po- 
sitions with U’s. Thus the number of strings and num- 
ber of routes are equal to C(2n, n). 

In the present problem, we can encode each route 
as a string of n R’s (right) and m U’s (up). As in the 
previous problem, we must count the number of such 
strings. Any such string can be obtained by selecting 
n positions for the R’s, without regard to the order of 
selection, among the n + m available positions in the 
string and then filling the remaining positions with U’s. 
Thus the number of strings and number of routes are 
equal to C(n + m, n). We have answered part (a). 

In part (b) we are given a major hint: Divide the 
routes into classes based on when the route first meets 
the top edge. A route can first meet the top edge at 
any one of n + | positions. In the previous figure, 
the route shown first meets the top edge at the third 
position from the left. Before reading on, you might 
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think about why we might divide the routes into 
classes. 

Notice that when we divide the routes into classes 
based on when the route first meets the top edge: 


= The classes are disjoint. 


(A route cannot first meet the top edge in two or more 
distinct positions.) Notice also that every route meets 
the top edge somewhere, so 


= Every route is in some class. 


In the terminology of Section 1.1 (see Example 1.1.25 
and the discussion that precedes it), the classes par- 
tition the set of routes. Because the classes partition 
the set of routes, the Addition Principle applies and the 
sum of the numbers of routes in each class is equal to 
the total number of routes. (No route is counted twice 
since the classes do not overlap, and every route is 
counted once since each route is in some class.) Ev- 
idently, the equation we’re supposed to prove results 
from equating the sum of the number of routes in each 
class to the total number of routes. 


Finding a Solution 


We have already solved part (a). For part (b), let’s look 
at the 3 x 5 grid. There is exactly one route that first 
meets the top edge at the first position from the left. 
There are three routes that first meet the top edge at 
the second position from the left: 


Bas 


Notice that the only variation in the preceding figures 
occurs between the start and the circled dot. To put it 
another way, after a route meets the circled dot, there 
is only one way to finish the trip. Therefore, it suffices 
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to count the number of routes from the lower-left cor- 
ner to the upper-right corner of a 2 x | grid. But we 
already solved this problem in part (a)! The number of 
routes from the lower-left corner to the upper-right cor- 
ner of a 2 x 1 grid is equal to C(2 + 1,1) = 3. Ina 
similar way, we find that the number of routes that first 
meet the top edge at the third position from the left is 
equal to the number of routes from the lower-left cor- 
ner to the upper-right corner of a 2 x 2 grid—namely, 
C(2+2, 2) = 6. By summing we obtain all the routes: 


GG-- 3,5) =e 2,0) ed 2.1) 
+C(2+2,2)+ C3 +2, 3) 
OC He 264) C(o 452 5): 


If we replace each term C(k + 3 — 1, k) by its value, 
we obtain 


56=1+3+6+10+15+21. 


You should verify the preceding formula, find the six 
routes that first meet the top edge at the third position 
from the left, and see why the number of such routes is 
equal to the number of routes from the lower-left cor- 
ner to the upper-right corner of a 2 x 2 grid. 


Formal Solution 


(a) We can encode each route as a string of n R’s 
(right) and m U’s (up). Any such string can 
be obtained by selecting n positions for the 
R’s, without regard to the order of selection, 
among the n + m available positions in the 
string and then filling the remaining positions 
with U’s. Thus the number of routes is equal to 
C(n+m,n). 

(b) Each route can be described as a string contain- 
ing n R’s and m U’s. The last U in such a string 
marks the point at which the route first meets 
the top edge. We count the strings by dividing 
them into classes consisting of strings that end 
U, UR, URR, and so on. There are 


C(n+m-— 1,n) 


strings that end U, since we must choose n slots 
among the first n + m — 1 slots for the n R’s. 
There are 


C(n— 1) +m—1,n—1) 


strings that end UR, since we must choose 
n—1 slots from among the first (2— 1)-+m—1 
slots for the n — 1 R’s. In general, there are 
C(k + m — 1,k) strings that end UR", Since 
there are C(m + n,m) strings altogether, the 
formula follows. 
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Summary of Problem-Solving Techniques ple of a partition of X, we could let X; be the set of 


five-bit strings that contain exactly 7 zeros. 


= Look for a similar problem and imitate its 


Exercises 


solution. 
1. Divide the routes into classes based on when the 
= Counting the number of members of a set in two route first meets a vertical line, and use the Addi- 
different ways leads to an equation. In particular, tion Principle to derive a formula like that proved 
if {X1, X2,..., X,} is a partition of X, the Addi- in this section. 


tion Principle applies and 
IX] = 7 IX. 
i=1 


= Directly enumerate some of the items to be \ I 


counted. 


= Look for patterns. 


Comments 


It’s important to verify that an alleged partition is truly { { IN 
a partition before using the Addition Principle. If X is 
the set of five-bit strings and X; is the set of five-bit 


2. Divide the routes into classes based on when the 
route crosses the slanted line shown. 


for 


SS 


strings that contain 7 consecutive zeros, the Addition 
Principle does not apply; the sets X; are not pairwise Use the Addition Principle to derive a formula like 
disjoint. For example, 00001 € Xz 1X3. As an exam- that proved in this section. 


6.3 


Example 6.3.1 


Go Online 

For more generalized 
permutations and 
combinations, see 
goo.gl/WRoK11 


Generalized Permutations 
and Combinations 


In Section 6.2, we dealt with orderings and selections without allowing repetitions. In 
this section we consider orderings of sequences containing repetitions and unordered 
selections in which repetitions are allowed. 


How many strings can be formed using the following letters? 
MISSISSIPPI 
SOLUTION Because of the duplication of letters, the answer is not 11!, but some num- 


ber less than 11!. 
Let us consider the problem of filling 11 blanks, 


’ 


with the letters given. There are C(11, 2) ways to choose positions for the two P’s. Once 
the positions for the P’s have been selected, there are C(9, 4) ways to choose positions 
for the four S’s. Once the positions for the S’s have been selected, there are C(5, 4) ways 
to choose positions for the four /’s. Once these selections have been made, there is one 
position left to be filled by the M. By the Multiplication Principle, the number of ways 
of ordering the letters is 


Hi! 9! 5S! oI! 
QOl4lst 4rd! 2'4r4r! 


C11, 2)CY9, 4)C(5, 4) = = 34,650. 4 
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Theorem 6.3.2 


Example 6.3.3 


The solution to Example 6.3.1 assumes a nice form. The number 11 that appears 
in the numerator is the total number of letters. The values in the denominator give the 
numbers of duplicates of each letter. This method can be used to establish a general 
formula. 


Suppose that a sequence S of n items has n; identical objects of type 1, nz identical 


objects of type 2, ..., and n, identical objects of type t. Then the number of orderings 
of S is 
n! 
ny! nz! BiG “ny! 


Proof We assign positions to each of the n items to create an ordering of S. We 
may assign positions to the n; items of type | in C(n, 1) ways. Having made these 
assignments, we may assign positions to the nz items of type 2 in C(n — m, nz) ways, 
and so on. By the Multiplication Principle, the number of orderings is 


C(n, ny)C(n — ny, n2)C(n — ny — nz, n3)---C(n— ny — +++ — Ny-1, Mt) 
_ n! (n—n,)! (n—n, —-++—n_1)! 
mi(n—n)!m!(n—ny —m)! n,!0! 
n! 
~ n!no!---m!) 


< 


In how many ways can eight distinct books be divided among three students if Bill gets 
four books and Shizuo and Marian each get two books? 


SOLUTION Put the books in some fixed order. Now consider orderings of four B’s, 
two S’s, and two M’s. An example is 


BBBSMBMS. 


Each such ordering determines a distribution of books. For the example ordering, Bill 
gets books 1, 2, 3, and 6, Shizuo gets books 4 and 8, and Marian gets books 5 and 7. 
Thus the number of ways of ordering BBBBSSMM is the number of ways to distribute 
the books. By Theorem 6.3.2, this number is 


8! 
qaiat 7? < 
We can give an alternate proof of Theorem 6.3.2 by using relations. Suppose that 
a sequence S of n items has n; identical objects of type i fori = 1,..., ¢. Let X denote 
the set of n elements obtained from S by considering the n; objects of type i distinct for 
i=1,...,¢f. For example, if S is the sequence of letters 


MIS S$ 1S Sf PP ff, 
X would be the set 
{M, 1), S1, S2, 12, 83, Sa, 3, P1, Po, Ly}. 


We define a relation R on the set of all permutations of X by the rule: p, Rp if po is 
obtained from p; by permuting the order of the objects of type 1 (but not changing 
their location) and/or permuting the order of the objects of type 2 (but not changing 


Example 6.3.4 


Theorem 6.3.5 
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their location) ... and/or permuting the order of the objects of type f¢ (but not changing 
their location); for example, 


(1,81 S21983S41,P|P2I4M) R (183821; S4S,13P|P2I4M). 


It is straightforward to verify that R is an equivalence relation on the set of all permuta- 
tions of X. 

The equivalence class containing the permutation p consists of all permutations 
of X that are identical if we consider the objects of type i identical for? = 1,...,¢. 
Thus each equivalence class has nj! n2!---n,! elements. Since an equivalence class is 
determined by an ordering of S, the number of orderings of S is equal to the number of 
equivalence classes. There are n! permutations of X, so by Theorem 3.4.16 the number 
of orderings of S is 


n!} 


ny tno! ny! 


Next, we turn to the problem of counting unordered selections where repetitions 
are allowed. 


Consider three books: a computer science book, a physics book, and a history book. 
Suppose that the library has at least six copies of each of these books. In how many 
ways can we select six books? 


SOLUTION The problem is to choose unordered, six-element selections from the set 
{computer science, physics, history}, repetitions allowed. A selection is uniquely deter- 
mined by the number of each type of book selected. Let us denote a particular 
selection as 
CS Physics History 
a |x x] x 

Here we have designated the selection consisting of three computer science books, two 
physics books, and one history book. Another example of a selection is 


CS Physics History 
|x x x x| x x 


which denotes the selection consisting of no computer science books, four physics books, 
and two history books. We see that each ordering of six x’s and two |’s denotes a se- 
lection. Thus our problem is to count the number of such orderings. But this is just the 
number of ways C(8, 2) = 28 of selecting two positions for the |’s from eight possible 
positions. Thus there are 28 ways to select six books. < 


The method used in Example 6.3.4 can be used to derive a general result. 


If X is a set containing t elements, the number of unordered, k-element selections 
from X, repetitions allowed, is 


C(kK+t-—1,t-1)=Ck+t—-1,k). 


Proof Let X = {a,..., a;}. Consider the k + t — 1 slots 
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Example 6.3.6 


Example 6.3.7 


Example 6.3.8 


and k+t— 1 symbols consisting of k x’s and t— | |’s. Each placement of these symbols 
into the slots determines a selection. The number 7; of x’s up to the first | represents the 
selection of n; a,’s; the number n2 of x’s between the first and second |’s represents 
the selection of m2 a2’s; and so on. Since there are C(k + t — 1, t — 1) ways to select 
the positions for the |’s, there are also C(k + t — 1, t — 1) selections. This is equal to 
C(k +t —1,k), the number of ways to select the positions for the x’s; hence there are 


C(fkK+t—1,t-1)=C(k4+t—-1,k) 
unordered, k-element selections from X, repetitions allowed. < 
Suppose that there are piles of red, blue, and green balls and that each pile contains at 
least eight balls. 


(a) In how many ways can we select eight balls? 


(b) In how many ways can we select eight balls if we must have at least one ball of 
each color? 


SOLUTION 
(a) By Theorem 6.3.5, the number of ways of selecting eight balls is 
c(8+3—1,3—1) =C(10, 2) = 45. 


(b) We can also use Theorem 6.3.5 to solve part (b) if we first select one ball of each 
color. To complete the selection, we must choose five additional balls. This can 
be done in 


cb4+3-1,3-1 =C(7, 2) =21 
ways. 4 
In how many ways can 12 identical mathematics books be distributed among the students 
Anna, Beth, Candy, and Dan? 


SOLUTION Wecan use Theorem 6.3.5 to solve this problem if we consider the problem 
to be that of labeling each book with the name of the student who receives it. This is the 
same as selecting 12 items (the names of the students) from the set {Anna, Beth, Candy, 
Dan}, repetitions allowed. By Theorem 6.3.5, the number of ways to do this is 


cd2+4-1,4-—1) = C5, 3) =455. 4 


(a) How many solutions in nonnegative integers are there to the equation 
xX, + x2 + x3 +24 = 29? (6.3.1) 
(b) How many solutions in integers are there to (6.3.1) satisfying x, > 0, x. > 1, 


x3 > 2,x4 > 0? 


SOLUTION 
(a) Each solution of (6.3.1) is equivalent to selecting 29 items, x; of type i, 
i= 1,2,3,4. According to Theorem 6.3.5, the number of selections is 


C29 + 4-—1,4-— 1) = C(32, 3) = 4960. 
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(b) Each solution of (6.3.1) satisfying the given conditions is equivalent to selecting 
29 items, x; of type i, i = 1,2, 3,4, where, in addition, we must have at least 
one item of type 1, at least two items of type 2, and at least three items of type 3. 
First, select one item of type 1, two items of type 2, and three items of type 3. 
Then, choose 23 additional items. By Theorem 6.3.5, this can be done in 


C(23 + 4 — 1,4— 1) = C(26, 3) = 2600 


ways. < 


Example 6.3.9 How many times is the print statement executed? 
fori; =1ton 


for in = 1 toi, 
for 3 = 1 to in 


for i, = 1 to %_ 
printin(i,, i2, ..., ix) 
SOLUTION Notice that each line of output consists of k integers 
iin +++ ig, (6.3.2) 
where 


n>y>nh>-s >> 1, (6.3.3) 


and that every sequence (6.3.2) satisfying (6.3.3) occurs. Thus the problem is to count 
the number of ways of choosing k integers, with repetitions allowed, from the set 
{1,2,...,}. [Any such selection can be ordered to produce (6.3.3).] By Theorem 
6.3.5, the total number of selections possible is C(k +n — 1,k). < 


6.3 Problem-Solving Tips 


The formulas in Section 6.3 generalize the formulas of Section 6.2 by allowing repeti- 
tions. A permutation is an ordering of s;,..., S,, where the s; are distinct. There are n! 
permutations. Now suppose that we have n items containing duplicates—specifically, n; 
identical objects of type 7, fori = 1, ..., f. Then the number of orderings is 
n! 
——————— (6.3.4) 
n!n2!---n,! 

To determine whether one of these formulas may be relevant to a particular problem, 
first be sure that the problem asks for orderings. If the items to be ordered are distinct, 
the permutation formula may be used. On the other hand, if there are duplicates among 
the items to be ordered, formula (6.3.4) may be used. 

An r-combination is an unordered selection of r elements, repetitions not allowed, 
from among n items. There are C(n, r) r-combinations. Now suppose that we want to 
count unordered selections of k elements, repetitions allowed, from among t items. The 
number of such selections is 


Cik+t—1,t-1). (6.3.5) 
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To determine whether one of these formulas may be relevant to a particular problem, 
first be sure that the problem asks for unordered selections. If the items are to be selected 
without repetition, the combination formula may be used. On the other hand, if the items 
are to be selected with repetition, formula (6.3.5) may be used. 

The following table summarizes the various formulas: 


No Repetitions Repetitions Allowed 
Ordered Selections n! n!/(ny!++-ny!) 
Unordered Selections C(n, r) C(A+t-—1,t-1) 
6.3 Review Exercises 
1. How many orderings are there of n items of t types with n; iden- 2. How many unordered, k-element selections are there from a 
tical objects of type 7? How is this formula derived? t-element set, repetitions allowed? How is this formula derived? 
6.3 Exercises 
In Exercises 1-6, determine the number of strings that can be 17. In how many ways can 10 distinct books be divided among 
formed by ordering the letters given. three students if the first student gets five books, the second 
i ? 
1. GUIDE 2. SCHOOL three books, and the third two books? 
3. SALESPERSONS 4. GOOGOO Exercises 18-24 refer to piles of identical red, blue, and green balls 
5. CLASSICS 6. SUGGESTS where each pile contains at least 10 balls. 
7. How many strings can be formed by ordering the letters 18. In how many ways can 10 balls be selected? 
‘ : oe 
Ee eae eons) 19. In how many ways can 10 balls be selected if at least one red 
8. How many strings can be formed by ordering the letters ball must be selected? 


SALESPERSONS if no two S’s are consecutive? 


9. How many strings can be formed by ordering the letters 
SCHOOL using some or all of the letters? 


20. In how many ways can 10 balls be selected if at least one red 
ball, at least two blue balls, and at least three green balls must 
be selected? 


Exercises 10-12 refer to selections among Action Comics, Super- 21. In how many ways can 10 balls be selected if exactly one red 
man, Captain Marvel, Archie, X-Man, and Nancy comics. ball must be selected? 


22. In how many ways can 10 balls be selected if exactly one red 


10. How many ways are there to select six comics? 
ball and at least one blue ball must be selected? 


11. How many ways are there to select 10 comics? 
DR. h ieee ics if h 23. In how many ways can 10 balls be selected if at most one red 
. How many ways are there to select 10 comics 1f we choose at ball is selected? 


least one of each book? 
24. In how many ways can 10 balls be selected if twice as many 


13. How many routes are there in the ordinary xyz-coordinate sys- red balls as green balls must be selected? 


tem from the origin to the point (i, j,k), where i, j, and k are 


positive integers, if we are limited to steps one unit in the posi- In Exercises 25-30, find the number of integer solutions of 
tive x-direction, one unit in the positive y-direction, or one unit 
in the positive z-direction? x) $29 +43 = 15 


14. An exam has 12 problems. How many ways can (integer) 
points be assigned to the problems if the total of the points subject to the conditions given. 


, f od 
is 100 and each problem is worth at least five points? 5 a SOS, k0 


15. A bicycle collector has 100 bikes. How many ways can the 26. x > 1o>lw>l 

bikes be stored in four warehouses if the bikes and the ware- 
houses are considered distinct? 21. #4 = 1,%2 20,4320 
28. x1) > 0,x2 > 0,x3 = 1 


16. A bicycle collector has 100 bikes. How many ways can the 
29. O< x; < 6,22 > 0,x3 > 0 


bikes be stored in four warehouses if the bikes are indistin- 
guishable, but the warehouses are considered distinct? *30. O <x) < 6,1 <x < 9,43 >0 


«31. 


32. 


33. 


34. 


«35, 


36. 


37. 


38. 
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Find the number of solutions in integers to 
x1 +x. 4+%34+%4 = 12 


satisfying 0 < x1 < 4,0 < » < 5,0 < x3 < 8, and 
OK<x4 <9. 


Prove that the number of solutions to the equation 


y+24+=n, n>, 


where x1, x2, and x3 are positive integers, is (n — 1)(n — 2)/2. 
Show that the number of solutions in nonnegative integers of 
the inequality 

X1 $42 +--+ +X <M, 


where M is a nonnegative integer, is C(M +n, n). 


How many integers between | and 1,000,000 have the sum of 
the digits equal to 15? 


How many integers between | and 1,000,000 have the sum of 
the digits equal to 20? 


How many bridge deals are there? (A deal consists of par- 
titioning a 52-card deck into four hands, each containing 13 
cards.) 


In how many ways can three teams containing four, two, and 
two persons be selected from a group of eight persons? 


A domino is a rectangle divided into two squares with each 
square numbered one of 0,1,...,6, repetitions allowed. 
How many distinct dominoes are there? 


Exercises 39-44 refer to a bag containing 20 balls—six red, six 
green, and eight purple. 


39. 


40. 


41. 


42. 


43. 


In how many ways can we select five balls if the balls are con- 
sidered distinct? 


In how many ways can we select five balls if balls of the same 
color are considered identical? 


In how many ways can we draw two red, three green, and two 
purple balls if the balls are considered distinct? 


We draw five balls, then replace the balls, and then draw five 
more balls. In how many ways can this be done if the balls are 
considered distinct? 


We draw five balls without replacing them. We then draw five 
more balls. In how many ways can this be done if the balls are 
considered distinct? 


6.4 


44. 


45. 


46. 


47. 


48. 


49. 
50. 


*51. 


52. 


53. 
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We draw five balls where at least one is red, and then replace 
them. We then draw five balls and at most one is green. In how 
many ways can this be done if the balls are considered distinct? 


In how many ways can 15 identical mathematics books be dis- 
tributed among six students? 


In how many ways can 15 identical computer science books 
and 10 identical psychology books be distributed among five 
students? 


In how many ways can we place 10 identical balls in 12 boxes 
if each box can hold one ball? 


In how many ways can we place 10 identical balls in 12 boxes 
if each box can hold 10 balls? 


Show that (kn)! is divisible by (nl)k. 
By considering 
for ij = lton 
for i2 = 1 toi, 
printin(iy, i2) 


and Example 6.3.9, deduce 


1+2+---4 pee” 
2 
Use Example 6.3.9 to prove the formula 
C(k—1,k-—1)+ Clk, k-— 1) +--+ Cn+k—2,k-1) 
= C(k+n-—1,k). 


Write an algorithm that lists all solutions in nonnegative inte- 
gers tox) + 2X2 +43 =n. 


What is wrong with the following argument, which suppos- 
edly counts the number of partitions of a 10-element set into 
eight (nonempty) subsets? 

List the elements of the set with blanks between them: 


X| —X2— X3— X4— X5 — X6 — X7 — X8 — X9 — X10. 


Every time we fill seven of the nine blanks with seven verti- 
cal bars, we obtain a partition of {x,,..., x19} into eight sub- 
sets. For example, the partition {x;}, {x2}, {13, x4} {x5}, {x6}, 
{x7, xg} {x9}, {x19} would be represented as 


Xy | X2 | x3X4 | x5 | X6 | x7 xg | X9 | X10. 


Thus the solution to the problem is C(9, 7). 


Algorithms for Generating Permutations 


and Combinations 


Go Online 

For more on generating 
permutations, see 
goo.gl/WRoK11 


Ts Bay a 


The rock group Unhinged Universe has recorded n videos whose running times are 
.,t, seconds. A DVD is to be released that can hold C seconds. Since this is 
the first DVD by Unhinged Universe, the group wants to include as much material as 
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Example 6.4.2 


possible. Thus the problem is to choose a subset {i),..., i} of {1,2,...,m} such that 
the sum 


k 
yf (6.4.1) 
j=l 


does not exceed C and is as large as possible. A straightforward approach is to examine 
all subsets of {1, 2, ...,} and choose a subset so that the sum (6.4.1) does not exceed 
C and is as large as possible. To implement this approach, we need an algorithm that 
generates all combinations of an n-element set. In this section we develop algorithms to 
generate combinations and permutations. 

Since there are 2” subsets of an n-element set, the running time of an algorithm that 
examines all subsets is 2(2"). As we saw in Section 4.3, such algorithms are impractical 
to run except for small values of n. Unfortunately, there are problems (an example of 
which is the DVD-filling problem described previously) for which no method much 
better than the “list all” approach is known. 

Our algorithms list permutations and combinations in lexicographic order. 
Lexicographic order generalizes ordinary dictionary order. 

Given two distinct words, to determine whether one precedes the other in the dic- 
tionary, we compare the letters in the words. There are two possibilities: 


1. The words have different lengths, and each letter in the shorter word 
is identical to the corresponding letter in the longer word. 


2. The words have the same or different lengths, and at some 
position, the letters in the words differ. (6.4.2) 


If 1 holds, the shorter word precedes the longer. (For example, “dog” precedes 
“doghouse” in the dictionary.) If 2 holds, we locate the leftmost position p at which the 
letters differ. The order of the words is determined by the order of the letters at position p. 
(For example, “gladiator” precedes “gladiolus” in the dictionary. At the leftmost position 
at which the letters differ, we find “‘a” in “gladiator” and “o” in “gladiolus”; “a” precedes 
“o” in the alphabet.) 

Lexicographic order generalizes ordinary dictionary order by replacing the alpha- 
bet by any set of symbols on which an order has been defined. We will be concerned 


with strings of integers. 


Definition 6.4.1 > Leta = s)5)-+-+s, and B = th--+ty be strings over 


{1,2,...,m}. We say that @ is lexicographically less than B and write a < f if either 
(a) p < qand 5; = fj fori=1,...,p, 

or 
(b) for some i, s; ~ t;, and for the smallest such i, we have s; < fj. 4 


In Definition 6.4.1, case (a) corresponds to possibility 1 of (6.4.2) and case (b) cor- 
responds to possibility 2 of (6.4.2). 


Leta = 132 and 6B = 1324 be strings over {1, 2, 3, 4}. In the notation of Definition 6.4.1, 
P=3,¢g=45 = 1s = 3,53 = 2,4 = 1,f = 3, = 2, and t = 4. Since 
p=3<4=qands; = ¢; fori = 1,2, 3, condition (a) of Definition 6.4.1 is satisfied. 
Therefore, a < B. < 


Example 6.4.3 
Example 6.4.4 


Example 6.4.5 


Example 6.4.6 


Example 6.4.7 


Example 6.4.8 
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Let a = 13246 and 6 = 1342 be strings over {1, 2, 3, 4, 5, 6}. In the notation of Defini- 
tion 6.4.1,p =5,qg= 4,51 = 1, 892 = 3,53 = 2,54 =4,55 = 6, =1b =3,h =4, 
and t4 = 2. The smallest i for which 5; 4 ¢; is i = 3. Since s3 < f3, by condition (b) of 
Definition 6.4.1, a < B. < 


Let w = 1324 and 6 = 1342 be strings over {1, 2, 3, 4}. In the notation of Definition 6.4.1, 
p=q=45 = 1,5. = 3,53 = 2,54 = 4,4 = 1,h = 3,6 = 4, and ty = 2. The 
smallest i for which s; 4 ¢; is i = 3. Since s3 < f;, by condition (b) of Definition 6.4.1, 
a<fB. < 


Let a = 13542 and 6 = 21354 be strings over {1, 2, 3, 4, 5}. In the notation of Defi- 
nition 6.4.1, 5s} = 1, 5) = 3,53 = 5,54 = 4,55 =2,t) =2,h =1,8 =3,t4 =5, 
and ts = 4. The smallest i for which 5; ¥ ¢; is i = 1. Since s; < t, by condition (b) of 
Definition 6.4.1, a < B. < 


For strings of the same length over {1, 2, ..., 9}, lexicographic order is the same 
as numerical order on the positive integers if we interpret the strings as decimal numbers 
(see Examples 6.4.4 and 6.4.5). For strings of unequal length, lexicographic order may 
be different than numerical order (see Example 6.4.3). Throughout the remainder of this 
section, order will refer to lexicographic order. 

First we consider the problem of listing all r-combinations of {1,2,..., }. In 
our algorithm, we will list the r-combination {x,,...,x,} as the string s; ---s,, where 
St < So < +++ < s, and {x,,...,x-} = {51,...,5,}. For example, the 3-combination 
{6, 2, 4} will be listed as 246. 

We will list the r-combinations of {1, 2, ..., } in lexicographic order. Thus the 
first listed string will be 12---r and the last listed string will be (n — r+ 1)---n. 


Consider the order in which the 5-combinations of {1, 2, 3, 4, 5, 6, 7} will be listed. The 
first string is 12345, which is followed by 12346 and 12347. The next string is 12356, 
followed by 12357. The last string will be 34567. < 


Find the string that follows 13467 when we list the 5-combinations of X = {1, 2, 3, 4, 
5, 6, 7}. 


SOLUTION No string that begins 134 and represents a 5-combination of X exceeds 
13467. Thus the string that follows 13467 must begin 135. Since 13567 is the smallest 
string that begins 135 and represents a 5-combination of X, the answer is 13567. < 


Find the string that follows 2367 when we list the 4-combinations of X = {1, 2, 3, 4, 
5; 6,7}: 


SOLUTION No string that begins 23 and represents a 4-combination of X exceeds 
2367. Thus the string that follows 2367 must begin 24. Since 2456 is the smallest string 


that begins 24 and represents a 4-combination of X, the answer is 2456. < 
A pattern is developing. Given a string a=s,---s,, which represents the 
r-combination {s;,...,5,-}, to find the next string 6 =f, ---t,, we find the rightmost 


element s,,, that is not at its maximum value. (s, may have the maximum value n, s,_| 
may have the maximum value n — 1, etc.) Then 


t; = 8; fori=1,...,m—1. 
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The element f,, is equal to s,, + 1. For the remainder of the string 6B we have 
tm+1 oh (Sm = 2)(Sm + 3) aaa 


The algorithm follows. 


This algorithm lists all r-combinations of {1,2,...,} in increasing lexicographic 
order. 
Input: r,n 
Output: All r-combinations of {1, 2, ..., m} in increasing lexicographic 
order 
1. combination(r, n) { 
De for? — 1ito7; 
Be ss = i 
4. println(s,,...,5,) // print the first y-combination 
3 Loniy— Dito (name 
6. aie 
iL max_val=n 
8. while (s,,, == max_val) { 
G), // find the rightmost element not at its maximum value 
10. m=m—1 
iit, max_val = max_val — | 
12: } 
113}, // the rightmost element is incremented 
14. Sm =Sm+1 
IS, // the rest of the elements are the successors of 5,» 
16. forj=m-+l1tor 
7, ss=s-itl 
18. println(s,,..., 8,) // print the ith combination 
I), } 


~) 
S 
eo 


We will show how Algorithm 6.4.9 generates the 5-combination of {1, 2, 3, 4,5, 6, 7} 
that follows 23467. We are supposing that 


s=2, so =3, s93=4, so =6, ss=7. 


At line 13, we find that 53 is the rightmost element not at its maximum value. At line 14, 
53 18 set to 5. At lines 16 and 17, sq is set to 6 and ss is set to 7. At this point 


Sic= 2, So = 3, 53 —5, 54 = 6, 55 = 7. 


We have generated the 5-combination 23567, which follows 23467. 


We next prove that Algorithm 6.4.9 is correct, that is, that it generates all 
r-combinations of {1, 2, ..., } in increasing lexicographic order. 


Theorem 6.4.11 


Example 6.4.12 


Example 6.4.13 
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Algorithm 6.4.9 generates all r-combinations of {1,2,...,m} in increasing lexico- 
graphic order. Furthermore, each r-combination output by Algorithm 6.4.9 lists the 
digits in increasing order. 


Proof In this proof, “lexicographic order” means “increasing lexicographic order.” 
We first show that Algorithm 6.4.9 generates r-combinations of {1, 2,...,m} in lexi- 
cographic order and that each r-combination lists the digits in increasing order. We use 
induction on k to show that the kth r-combination output by Algorithm 6.4.9 lists the dig- 
its in increasing order, and that the (k + 1)st r-combination output by Algorithm 6.4.9 
is lexicographically the next r-combination. 

For k = 1 (base case), the first r-combination output by Algorithm 6.4.9 is 12---r, 
which has the digits in increasing order. If r = n, the proof is complete. Otherwise, in the 
for loop at lines 5-19, Algorithm 6.4.9 generates the next (i.e., second lexicographically) 
r-combination 12---(r — 1)(r + 1) and outputs it. This r-combination has the digits in 
increasing order. 

If k = C(u,r), the proof is complete. Otherwise, for the inductive step, assume 
that the kth r-combination output by Algorithm 6.4.9 has the digits in increasing order. 
In the for loop at lines 5-19, Algorithm 6.4.9 generates the (k + 1)st (lexicographi- 
cally, the next) r-combination, and, by construction, the digits are in increasing order. 
The inductive step is complete. Therefore, Algorithm 6.4.9 generates r-combinations of 
{1,2,..., } in lexicographic order, and each r-combination has the digits in increasing 
order. 

Finally, we show that Algorithm 6.4.9 generates all r-combinations of {1, 2, ..., m}. 
Suppose, by way of contradiction, that some r-combination is not generated by Algo- 
rithm 6.4.9, and let s, with the digits listed in increasing order, be the least r-combination 
(lexicographically) that is not generated. Then s is not 12 - - -m because it is generated. Let 
s’ be the predecessor of s lexicographically. Since s is the least r-combination not gener- 
ated, s’ is generated. But by construction Algorithm 6.4.9 generates the next 
r-combination, which is s. This contradiction shows that Algorithm 6.4.9 generates all 
r-combinations of {1,2,..., m}. < 


The 4-combinations of {1, 2, 3, 4, 5, 6} as listed by Algorithm 6.4.9 are 


1234, 1235, 1236, 1245, 1246, 1256, 1345, 1346, 
1356, 1456, 2345, 2346, 2356, 2456, 3456. < 


Let X = {1, 2, 3, 4,5, 6, 7, 8, 9}. 


(a) What is the first 5-combination of X listed by Algorithm 6.4.9 that begins 24? 
(b) What is the last 5-combination of X listed by Algorithm 6.4.9 that begins 13? 


(c) How many 5-combinations of X begin 13? 


SOLUTION 

(a) The digit following 4 must be larger than 4 since each combination listed by Al- 
gorithm 6.4.9 has the digits in increasing order. Since 246__, 247__, 248__,... 
are lexicographically larger than 245__, and Algorithm 6.4.9 lists the 
5-combinations in lexicographic order, the digit following 4 must be 5. Similarly 
the digit following 5 must be 6, and the final digit must be 7 giving us 24567. 
Therefore the first 5-combination of X listed by Algorithm 6.4.9 that begins 24 
is 24567. 
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Example 6.4.14 


Example 6.4.15 


(b) The digit following 3 must be as large as possible but smaller than the last two 
digits since Algorithm 6.4.9 lists 5-combinations with digits in increasing order. 
Thus the digit following 3 must be 7; hence, the desired 5-combination begins 
137. Similarly the digit following 7 must be 8, and the digit following 8 must be 9. 
Therefore, the last 5-combination of X listed by Algorithm 6.4.9 that begins 13 
is 13789. 


(c) Since Algorithm 6.4.9 generates all 5-combinations of X with the digits in in- 
creasing order, the last three digits must be selected from 


{4, 5, 6, 7, 8, 9}, 


and there are C(6, 3) = 20 of them. Therefore, there are twenty 5-combinations 
of X that begin 13. < 


Like the algorithm for generating r-combinations, the algorithm to generate per- 
mutations will list the permutations of {1, 2, ..., 7} in lexicographic order. (Exercise 26 
asks for an algorithm that generates all r-permutations of an n-element set.) 


To construct the permutation of {1, 2, 3, 4, 5, 6} that follows 163542, we should keep as 
many digits as possible at the left the same. 

Can the permutation following the given permutation have the form 1635__? 
Since the only permutation of the form 1635__ distinct from the given permutation is 
163524, and 163524 is smaller than 163542, the permutation following the given per- 
mutation is not of the form 1635__. 

Can the permutation following the given permutation have the form 163___? The 
last three digits must be a permutation of {2, 4, 5}. Since 542 is the largest permutation 
of {2, 4, 5}, any permutation that begins 163 is smaller than the given permutation. Thus 
the permutation following the given permutation is not of the form 163___. 

The reason that the permutation following the given permutation cannot begin 
1635 or 163 is that in either case the remaining digits in the given permutation (42 and 
542, respectively) decrease. Therefore, working from the right, we must find the first 
digit d whose right neighbor r satisfies d<r. In our case, the third digit, 3, has this 
property. Thus the permutation following the given permutation will begin 16. 

The digit following 16 must exceed 3. Since we want the next smallest permuta- 
tion, the next digit is 4, the smallest available digit. Thus the desired permutation begins 
164. The remaining digits 235 must be in increasing order to achieve the minimum value. 
Therefore, the permutation following the given permutation is 164235. < 


We see that to generate all of the permutations of {1, 2,..., m}, we can begin with 
the permutation 12 ---n and then repeatedly use the method of Example 6.4.12 to gener- 
ate the next permutation. We will end when the permutation n(n — 1) - - - 21 is generated. 


Using the method of Example 6.4.14, we can list the permutations of {1, 2, 3, 4} in lex- 
icographic order as 


1234, 1243, 1324, 1342, 1423, 1432, 2134, 2143, 
2314, 2341, 2413, 2431, 3124, 3142, 3214, 3241, 
3412, 3421, 4123, 4132, 4213, 4231, 4312, 4321. 4 


The algorithm follows. 


Algorithm 6.4.16 


Go Online 

For a C program 
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Generating Permutations 


This algorithm lists all permutations of {1,2,...,m} in increasing lexicographic 
order. 
Input: n 
Output: All permutations of {1, 2, ..., m} in increasing lexicographic order 
1. permutation(n) { 
2} fori=1ton 
3 s= i 
4 println(s,,..., 5) // print the first permutation 
3) for i = 2 ton! { 
6. m=n-—1 
7 while (Sm > Sm+1) 
8 // find the first decrease working from the right 
9), m=m—1 
10. ke 
iil, while (Sm > sx) 
i122, // find the rightmost element s, with 5, < 5, 
13}, k=k—-1 
14. SWaP(Sins Sk) 
Isy, eal 
16. = ial 
17. while (p < q) { 
18. // swap Sm+1 and Sp, Swap Sm42 and s,—1, and so on 
19. swap (Sp, Sq) 
20. a=psr ll 
Pall q=q-1 
22 } 
22). printin(s,,..., 8») // print the ith permutation 
24. } 
2, Ij 


We leave the proof that Algorithm 6.4.16 is correct to the exercises (see Exercise 33). 


Example 6.4.17 Show how Algorithm 6.4.16 generates the permutation that follows 163542. 


SOLUTION Suppose that 


seal, se=]6,. s=3). Hs, set, 142 


and that we are at line 6. The largest index m satisfying sy, < S41 is 3. At lines 10-13, 
we find that the largest index k satisfying s; > Ss» is 5. At line 14, we swap sj, and sx. 
At this point, we have s = 164532. At lines 15—22, we reverse the order of the elements 
$48556 = 532. We obtain the desired permutation, 164235. < 


Example 6.4.18 Let X = {1, 2,3, 4,5, 6, 7, 8, 9}. 


(a) What is the first permutation of X listed by Algorithm 6.4.16 that begins 48? 
(b) What is the last permutation of X listed by Algorithm 6.4.16 that begins 48? 
(c) How many permutations of X begin 48? 
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SOLUTION 

(a) The digit following 8 must be | since 482--- ,483--- ,... are lexicographically 
larger than 481 ---, and Algorithm 6.4.16 lists the permutations in lexicographic 
order. Similarly the next digits must be 235679. Therefore the first permutation 
of X listed by Algorithm 6.4.16 that begins 48 is 481235679. 
The digit following 8 must be as large as possible since Algorithm 6.4.16 lists the 
permutations in lexicographic order. Thus the digit following 8 is 9. Similarly, 
the next digits must be 765321. Therefore, the last permutation of X listed by 
Algorithm 6.4.16 that begins 48 is 489765321. 


(c) The permutations of X that begin 48 are 48 followed by a permutation of 


(b 


YS 


{1, 2, 3,5, 6,7, 9}, 


and there are 7! = 5040 of them. Therefore, there are 5040 permutations of X 


that begin 48. < 
6.4 Review Exercises 
1. Define lexicographic order. 3. Describe the algorithm for generating permutations. 


2. Describe the algorithm for generating r-combinations. 


In Exercises 1-3, find the r-combination that will be generated by 18. How many 6-combinations of X listed by Algorithm 6.4.9 end 


Algorithm 6.4.9 with n = 7 after the r-combination given. 46? 
1. 1356 2. 12367 3, 14567 19. How many 6-combinations of X listed by Algorithm 6.4.9 start 
2 and end 79? 
In Exercises 4—6, find the permutation that will be generated by _ 2). What is the first permutation of X listed by Algorithm 6.4.16 
Algorithm 6.4.16 after the permutation given. that ends 97? 
4. 12354 5. 625431 6. 12876543 21. What is the last permutation of X listed by Algorithm 6.4.16 
Bc. ig . 4 . that ends 397? 
7. For each string in Exercises 1-3, explain (as in Exam- : . ’ 
ple 6.4.10) exactly how Algorithm 6.4.9 generates the next 22. How ai permutations of X listed by Algorithm 6.4.16 end 
r-combination. 1938? 
8. For each string in Exercises 4—6, explain (as in Exam- 23. ee permutations of X listed by Algorithm 6.4.16 end 


ple 6.4.17) exactly how Algorithm 6.4.16 generates the next 
permutation. 24. Modify Algorithm 6.4.9 so that line 5 
9. Show the output from Algorithm 6.4.9 when n = 6 andr = 3. 


10. Show the output from Algorithm 6.4.9 when n = 6 and r = 2. ee SEE 


11. Show the output from Algorithm 6.4.9 when n = 7 andr = 5. is eliminated. Base the terminating condition on the fact that 
12. Show the output from Algorithm 6.4.16 when n = 2. the last r-combination has every element s; equal to its maxi- 
13. Show the output from Algorithm 6.4.16 when n = 3. mum value. 
In Exercises 14-23, let X = {1, 2,3, 4,5, 6,7, 8, 9}. 25. Modify Algorithm 6.4.16 so that line 5 
14. What is the first 5S-combination of X listed by Algorithm 6.4.9 5. for i = 2 ton! { 
that ends 79? ee —— - 
15, What is the last 5-combination of X listed by Algorithm 6.4.9 is eliminated. Base the terminating condition on the fact that 
that ends 689 eS the last permutation has the elements s; in decreasing order. 
16. How many 5-combinations of X listed by Algorithm 6.4.9 end 26. Write an algorithm that generates all r-permutations of an 
89? n-element set. 


17. How many 5-combinations of X listed by Algorithm 6.4.9end 27. Write an algorithm whose input is an r-combination of 
539 {1,2,...,n}. The output is the next (in lexicographic 


28. 


29. 


30. 


order) r-combination. The first r-combination follows the last 
r-combination. 


Write an algorithm whose input is a permutation of 
{1,2,...,}. The output is the next (in lexicographic or- 
der) permutation. The first permutation follows the last 
permutation. 


Write an algorithm whose input is an r-combination of 
{1,2,...,n}. The output is the previous (in lexicographic 
order) r-combination. The last r-combination precedes the 
first r-combination. 


Write an algorithm whose input is a permutation of 
{1,2,...,}. The output is the previous (in lexicographic or- 
der) permutation. The last permutation precedes the first per- 
mutation. 
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x31. Write a recursive algorithm that generates all r-combinations 
of the set {51, 52, ...,5,}. Divide the problem into two sub- 
problems: 


@ List the r-combinations containing 51. 
@ List the r-combinations not containing s1. 


32. Write a recursive algorithm that generates all permuta- 
tions of the set {s1,s2,...,8,}. Divide the problem into n 
subproblems: 


© List the permutations that begin with s,. 
® List the permutations that begin with s2. 


@ List the permutations that begin with s,,. 
33. Show that Algorithm 6.4.16 is correct. 


6.5 Introduction to Discrete Probabilityt 


Probability was developed in the seventeenth century to analyze games and, in this 
earliest form, directly involved counting. For example, suppose that a six-sided fair die 
whose sides are labeled 1, 2,3, 4,5, 6 is rolled (see Figure 6.5.1). “Fair” means that 


Go Online 

For more on 
probability, see 
goo.gl/WRoK11 


each number is equally likely to appear when the die is rolled. To compute the chance 
or probability that an even number appears, we first count how many ways an even 
number can appear (three: 2, 4, 6) and how may ways an arbitrary number can appear 
(six: 1, 2, 3, 4,5, 6); then, the probability is the quotient: 3/6 = 1/2. After introducing 


some terminology, we will give several examples of computing probabilities. 


Figure 6.5.1 Rolling a fair die. 
“Fair” means that each number is 
equally likely to appear when the die 
is rolled. [Photo by the author. Hand 
courtesy of Ben Schneider. ] 


An experiment is a process that yields an outcome. An event is an outcome or 
combination of outcomes from an experiment. The sample space is the event consisting 


of all possible outcomes. 


Example 6.5.1 


Examples of experiments are 


@ Rolling a six-sided die. 


= Randomly selecting 5 microprocessors from a lot of 1000 microprocessors. 


®™ Selecting a newborn child at St. Rocco’s Hospital. 


‘This section can be omitted without loss of continuity. 
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Example 6.5.3 


Example 6.5.4 


Example 6.5.5 


Examples of events that might occur when the previous experiments are performed are 


= Obtaining a 4 when rolling a six-sided die. 
m Finding no defective microprocessors out of 5 randomly chosen from a lot of 1000. 


® Selecting a newborn female child at St. Rocco’s Hospital. 
The sample spaces for the previous experiments are 


= The numbers 1, 2, 3, 4,5, 6—all possible outcomes when a die is rolled. 


= All possible combinations of 5 microprocessors selected from a lot of 1000 micro- 
processors. 


m= All newborn children at St. Rocco’s Hospital. < 


If all outcomes in a finite sample space are equally likely, the probability of an 
event is defined as the number of outcomes in the event divided by the number of out- 
comes in the sample space. In the following section, we will relax the assumption that 
all outcomes are equally likely. 


Definition 6.5.2 » Let S be a finite sample space in which all outcomes are 
equally likely. The probability P(E) of an event E from S is 


P(E) = lel a 
\S| 


Two fair dice are rolled. What is the probability that the sum of the numbers on the dice 
is 10? 


SOLUTION Since the first die can show any one of six numbers and the second die 
can show any one of six numbers, by the Multiplication Principle there are 6-6 = 36 
possible sums; that is, the size of the sample space is 36. There are three possible ways 
to obtain the sum of 10—(4, 6), (5,5), (6, 4)—that is, the size of the event “obtaining a 
sum of 10” is 3. [The notation (x, y) means that we obtain x on the first die and y on the 
second die.] Therefore, the probability is 3/36 = 1/12. < 


Five microprocessors are randomly selected from a lot of 1000 microprocessors among 
which 20 are defective. Find the probability of obtaining no defective microprocessors. 


SOLUTION There are C(1000, 5) ways to select 5 microprocessors among 1000. There 
are C(980, 5) ways to select 5 good microprocessors since there are 1000 — 20 = 980 
good microprocessors. Therefore, the probability of obtaining no defective micropro- 
cessors is 


C(980, 5) 980 -979 -978 -977 -976 
= = 0.903735781. 
C(1000,5) 1000 +999 - 998 -997 -996 < 


In a state lottery game, to win the grand prize the contestant must match six distinct 
numbers, in any order, among the numbers 1 through 52 randomly drawn by a lottery 
representative. What is the probability of choosing the winning numbers? 


‘Recall that |X| is the number of elements in a finite set X. 


Example 6.5.6 
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SOLUTION Six numbers among 52 can be selected in C(52, 6) ways. Since there is 
one winning combination, the probability of choosing the winning numbers is 
1 _ 6! 
C(52,6) 52-51 -50-49-48 -47 


= 0.000000049. 4 


A bridge hand consists of 13 cards from an ordinary 52-card deck. Find the probability 
of obtaining a 4—4—4—1 distribution, that is, four cards in each of three different suits and 
one card of a fourth suit. 


SOLUTION There are C(52, 13) bridge hands. The one-card suit can be chosen in 
4 ways, and the card itself can be chosen in 13 ways. Having chosen this card, we must 
choose four cards from each of the three remaining suits, which can be done in C(13, 4)3 
ways. Thus there are 4-13-C(13, 4)3 hands with a 444-1 distribution. Therefore, the 
probability of obtaining a 4-4—4—1 distribution is 


4-13-C(13, 4)3 
= 0.03. 


C(52, 13) < 


6.5 Review Exercises 


1. What is an experiment? 


2. What is an event? 


3. What is a sample space? 


4. If all outcomes in a finite sample space are equally likely, how 


is the probability of an event defined? 


In Exercises 1-4, suppose that a coin is flipped and a die is rolled. 


1. 
2. 


List the members of the sample space. 


List the members of the event “the coin shows a head and the 
die shows an even number.” 


. List the members of the event “the die shows an odd number.” 


. List the members of the event “the coin shows a head and the 


die shows a number less than 4.” 


In Exercises 5—7, two dice are rolled. 


5. 


List the members of the event “the sum of the numbers on the 
dice is even.” 


. List the members of the event “doubles occur” (i.e., the num- 


bers are the same on both dice). 


. List the members of the event “4 appears on at least one die.” 


. Give an example of an experiment different from those in this 


section. 


. Give an example of an event when the experiment of Exercise 


8 is performed. 


. What is the sample space for the experiment of Exercise 8? 
. One fair die is rolled. What is the probability of getting a 5? 
. One fair die is rolled. What is the probability of getting an even 


number? 


. One fair die is rolled. What is the probability of not getting 


a5? 


. Acard is selected at random from an ordinary 52-card deck. 


What is the probability that it is the ace of spades? 


. A card is selected at random from an ordinary 52-card deck. 


What is the probability that it is a jack? 


. A card is selected at random from an ordinary 52-card deck. 


What is the probability that it is a heart? 


. Two fair dice are rolled. What is the probability that the sum 


of the numbers on the dice is 9? 


. Two fair dice are rolled. What is the probability that the sum 


of the numbers on the dice is odd? 


. Two fair dice are rolled. What is the probability of doubles? 


. Four microprocessors are randomly selected from a lot of 100 


microprocessors among which 10 are defective. Find the prob- 
ability of obtaining no defective microprocessors. 


. Four microprocessors are randomly selected from a lot of 


100 microprocessors among which 10 are defective. Find 
the probability of obtaining exactly one defective micropro- 
cessor. 


. Four microprocessors are randomly selected from a lot of 100 


microprocessors among which 10 are defective. Find the prob- 
ability of obtaining at most one defective microprocessor. 
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23. In the California Daily 3 game, a contestant must select 
three numbers among 0 to 9, repetitions allowed. A “straight 
play” win requires that the numbers be matched in the ex- 
act order in which they are randomly drawn by a lottery rep- 
resentative. What is the probability of choosing the winning 
numbers? 


24. In the California Daily 3 game, a contestant must select three 
numbers among 0 to 9. One type of “box play” win requires 
that three numbers match in any order those randomly drawn 
by a lottery representative, repetitions allowed. What is the 
probability of choosing the winning numbers, assuming that 
the contestant chooses three distinct numbers? 


25. In the Maryland Lotto game, to win the grand prize the con- 
testant must match six distinct numbers, in any order, among 
the numbers | through 49 randomly drawn by a lottery rep- 
resentative. What is the probability of choosing the winning 
numbers? 


26. In the multi-state Big Game, to win the grand prize the con- 
testant must match five distinct numbers, in any order, among 
the numbers | through 50, and one Big Money Ball num- 
ber between | and 36, all randomly drawn by a lottery rep- 
resentative. What is the probability of choosing the winning 
numbers? 


27. In the Maryland Cash In Hand game, to win the grand prize 
the contestant must match seven distinct numbers, in any or- 
der, among the numbers | through 31 randomly drawn by a 
lottery representative. What is the probability of choosing the 
winning numbers? 

28. Find the probability of obtaining a bridge hand with 5-4—2-2 
distribution, that is, five cards in one suit, four cards in another 
suit, and two cards in each of the other two suits. 


29. Find the probability of obtaining a bridge hand consisting only 
of red cards, that is, no spades and no clubs. 


Exercises 30-33 concern an unprepared student who takes a 
10-question true—false quiz and guesses at the answer to every 
question. 


30. What is the probability that the student answers every question 
correctly? 


31. What is the probability that the student answers every question 
incorrectly? 

32. What is the probability that the student answers exactly one 
question correctly? 


33. What is the probability that the student answers exactly five 
questions correctly? 


Exercises 34-36 refer to a small consumer survey in which 10 peo- 
ple were asked to choose a cola among Coke, Pepsi, and RC. 


34. If each person chose a cola randomly, what is the probability 
that no one chose Coke? 


35. If each person chose a cola randomly, what is the probability 
that at least one person did not choose Coke? 


36. If each person chose a cola randomly, what is the probability 
that everyone chose Coke? 
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37. If five student records are chosen randomly, what is the prob- 
ability that they are chosen so that the first record selected has 
the lowest grade point average (GPA), the second selected has 
the second-lowest GPA, and so on? Assume that the GPAs are 
distinct. 


Exercises 38-40 concern three persons who each randomly choose 
a locker among 12 consecutive lockers. 


38. What is the probability that the three lockers chosen are 
consecutive? 


39. What is the probability that no two lockers are consecutive? 


40. What is the probability that at least two of the lockers are 
consecutive? 


Exercises 41—44 deal with a roulette wheel that has 38 numbers: 
18 red, 18 black, a 0, and a 00 (0 and 00 are neither red nor 
black). When the wheel is spun, all numbers are equally likely to 
be selected. 


41. What is the probability that the wheel lands on a black num- 
ber? 


42. What is the probability that the wheel lands on a black number 
twice in a row? 


43. What is the probability that the wheel lands on 0? 
44. What is the probability that the wheel lands on 0 or 00? 


Exercises 45—48 concern the Monty Hall problem, in which a con- 
testant chooses one of three doors; behind one of the doors is a car, 
and behind the other two are goats. After the contestant chooses a 
door, the host opens one of the other two doors that hides a goat. 
(Because there are two goats, the host can open a door that hides 
a goat no matter which door the contestant first chooses.) The host 
then gives the contestant the option of abandoning the chosen door 
in favor of the still-closed, unchosen door. For each strategy, what 
is the probability of winning the car? 


45. Stay with the door initially chosen. 


46. Make a random decision about whether to stay with the door 
initially chosen or switch to the unchosen, unopened door. 


47. Switch to the unchosen, unopened door. 


48. Suppose that the host forgets which door hides the car and, 
after the contestant chooses a door, picks a door at ran- 
dom. If the door hides a car, the game is over. Assuming 
that the host chooses a door that hides a goat, what is the 
probability of winning the car for each strategy in Exercises 
45-47? 


Exercises 49-51 concern a variant of the Monty Hall problem, in 
which a contestant chooses one of four doors; behind one of the 
doors is a car, and behind the other three are goats. After the con- 
testant chooses a door, the host opens one of the other three doors 
that hides a goat. The host then gives the contestant the option of 
abandoning the chosen door in favor of one of the two still-closed, 
unchosen doors. For each strategy, what is the probability of win- 
ning the car? 


49. Stay with the door initially chosen. 


50. 


51. 


52. 


53. 


Make a random decision about whether to stay with the door 
initially chosen or switch to one of the unchosen, unopened 
doors. 


Switch to one of the unchosen, unopened doors. The choice 
between the two unchosen, unopened doors is made randomly. 


In a multiple-choice exam, one question has three choices: 
A, B, C. A student randomly chooses A. The teacher then 
states that choice C is incorrect. What is the probability of 
a correct answer if the student stays with choice A? What is 
the probability of a correct answer if the student switches to 
choice B? 


Is the following reasoning correct? A county health inspec- 
tor told a restaurant offering four-egg quiches that, because 
research by the FDA (Food and Drug Administration) shows 
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that one in four eggs is contaminated with salmonella bacteria, 
the restaurant should use only three eggs in each quiche. 


54. A two-person game is played in which a fair coin is tossed 
until either the sequence HT (heads, tails) or the sequence TT 
(tails, tails) appears. If HT appears, the first player wins; if TT 
appears, the second player wins. Would you rather be the first 
or second player? Explain. 


Exercises 55 and 56 refer to 10 identical compact discs that are 
randomly given to Mary, Ivan, and Juan. 


55. What is the probability that each person receives at least two 
compact discs? 


56. What is the probability that Ivan receives exactly three com- 
pact discs? 


6.6 _ Discrete Probability Theory* 


In Section 6.5, we assume that all outcomes are equally likely; that is, if there are n 
possible outcomes, the probability of each outcome is 1/n. In general, the outcomes are 
not equally likely. For example, a “loaded” die is weighted so that certain numbers are 
more likely to appear than others. To handle the case of outcomes that are not equally 
likely, we assign a probability P(x) to each outcome x. The values P(x) need not all be 
the same. We call P a probability function. Throughout this section, we assume that all 


sample spaces are finite. 


Definition 6.6.1 >» A probability function P assigns to each outcome x in a sam- 
ple space S a number P(x) so that 


and 


0 <P) <1 for allx € S, 


S Fy 1. 


xeS < 


The first condition guarantees that the probability of an outcome is nonnegative 
and at most 1, and the second condition guarantees that the sum of all the probabilities 
is 1—that is, that some outcome will occur when the experiment is performed. 


Example 6.6.2 Suppose that a die is loaded so that the numbers 2 through 6 are equally likely to appear, 
—e———_ but that | is three times as likely as any other number to appear. To model this situation, 


we should have 


and 


P(2) = P(3) = P(4) = PS) = P(6) 


P(1) = 3P(2). 


iThis section can be omitted without loss of continuity. 
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Example 6.6.4 


Theorem 6.6.5 


Since 


1 = P(1) + P(2) + P(3) + P(4) + P(S) + P(6) 
= 3P(2) + P(2) + P(2) + P(2) + P(2) + P(2) = 8P(2), 


we must have P(2) = 1/8. Therefore, 


P(2) = P(3) = P(4) = PS) = P(6) = ; 
and 


P(1) = 3P(2) = 4 


The probability of an event E is defined as the sum of the probabilities of the 
outcomes in E. 


Definition 6.6.3 > Let E be anevent. The probability of E, P(E), is 


PRY = Y_ Pe): 
xeE 
< 
Given the assumptions of Example 6.6.2, the probability of an odd number is 
P(1) + P(3) + P(S) = : + : =h i= 
8 8 8 8 
Of course, for a fair die (with equally likely probabilities), the probability of an odd 
number is 1/2. < 


Formulas 


We next develop some formulas that are useful in computing probabilities. 


Let E be an event. The probability of E, the complement of E, satisfies 


P(E) + P(E) =1. 


Proof Suppose that E = {x,,...,x,} and E = {xy14,..., x,}. Then 


n 


k 
P(E) = So PC) and P(E) = x P(x;). 
i=1 


i=k+1 


Now 


k n 
P(E) + PE) => Pa) + D> Pa 
i=1 


ixk+1 
= * P(x;) = 1. 
i=l 


The last equality follows from Definition 6.6.1, which states that the sum of the proba- 
bilities of all outcomes equals 1. < 


Example 6.6.6 


Example 6.6.7 


Go Online 
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Example 6.6.8 
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Theorem 6.6.5 is often useful when it is easier to compute P(E) than P(E). After 
computing P(E), we may obtain P(E) by subtracting P(E) from 1. 


Five microprocessors are randomly selected from a lot of 1000 microprocessors among 
which 20 are defective. In Example 6.5.4, we found that the probability of obtaining no 
defective microprocessors is 0.903735781. By Theorem 6.6.5, the probability of obtain- 
ing at least one defective microprocessor is 


1 — 0.903735781 = 0.096264219. 


Notice how much more complex a direct approach would be to calculate the prob- 
ability of obtaining at least one deficient microprocessor. We would have to calculate 
the probability of obtaining exactly one deficient microprocessor, then exactly two de- 
ficient microprocessors, then exactly three deficient microprocessors, then exactly four 
deficient microprocessors, and then exactly five deficient microprocessors; and then sum 
the resulting probabilities: 


C(20, 1)C(980, 4) + C(20, 2)C(980, 3) + C20, 3)C(980, 2) 
+ C(20, 4)C(980, 1) + C(20, 5)C(980, 0) 
C(1000, 5) < 


Birthday Problem Find the probability that among n persons, at least two people have 
birthdays on the same month and date (but not necessarily in the same year). Assume 
that all months and dates are equally likely, and ignore February 29 birthdays. 


SOLUTION We let E denote the event “at least two persons have the same birthday.” 
Then E is the event “no two persons have the same birthday.” As we shall see, it is easier 
to compute P(E) than P(E). We can use Theorem 6.6.5 to obtain the desired probability. 

Since all months and dates are equally likely and we are ignoring February 29 birth- 
days, the size of the sample space is 365”. 

The first person’s birthday can occur on any one of 365 days. If no two persons 
have the same birthday, the second person’s birthday can occur on any day except the day 
of the first person’s birthday. Therefore, the second person’s birthday can occur on any 
one of 364 days. Similarly, the third person’s birthday can occur on any one of 363 days. 
It follows that the size of the event “no two persons have the same birthday” is 


365 -364--- (365 —n+ 1). 


By Theorem 6.6.5, the probability that at least two persons have birthdays on the 
same month and date is 


365 -364--- (365 —n+ 1) 
365” , 


For n = 22, the probability is 0.475695, and for n = 23, the probability is 0.507297. 
Thus if n > 23, the probability is greater than 1/2 that at least two persons have birth- 
days on the same month and date. Many persons would guess that n would have to be 
considerably larger than 23 for the probability to be greater than 1/2. < 


If £; and E> are events, the event E; U E2 represents the event E) or E> (or both), 
and the event £; M E> represents the event E, and Ep. 


Among a group of students, some take art and some take computer science. A student 
is selected at random. Let A be the event “the student takes art,’ and let C be the event 
“the student takes computer science.” Then A U C is the event “the student takes art or 
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Theorem 6.6.9 


Example 6.6.10 


computer science or both,” and A M C is the event “the student takes art and computer 
science.” < 


The next theorem gives a formula for the probability of the union of two events. 
In the case of equally likely outcomes, it is an application of the Inclusion-Exclusion 
Principle for two sets (see Theorem 6.1.13). 


Let E; and E> be events. Then 


P(E, U Ep) = P(E\) + P(Ep) — P(E, 9 Ep). 


Proof Let 
Ey = {x1,...,xi} 
Ey = {y1,-.-5 yy} 
EF, NE = {21,---5 Zh 


and assume that each set element is listed exactly one time per set (see Figure 6.6.1). 
Then in the list 


Hise os Xi Vip cond Nis 


Zj,...,Z,% occurs twice. It follows that 


i i k 
P(E U Fo) = > PQ) + 9) PO) — D> Pe) 


t=1 t=1 t=1 


= P(E\) + P(E) — P(E, 9 Ed). 


Ey Ey 


Figure 6.6.1 Events E; and E>. The x’s denote the 
elements in E), the y’s denote the elements in E>, and the z’s 
denote the elements in E, M E>. The z’s are thus found 
twice: once among the x’s and again among the y’s. 


Two fair dice are rolled. What is the probability of getting doubles (two dice showing 
the same number) or a sum of 6? 


Corollary 6.6.11 


Example 6.6.12 
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SOLUTION We let E; denote the event “get doubles” and E> denote the event “get a 
sum of 6.” Since doubles can be obtained in six ways, 


6 1 


Since the sum of 6 can be obtained in five ways [(1, 5), (2, 4), (3, 3), (4, 2), 6, DJ], 
P(Ey) = 2 
1 36° 


The event £) M E> is “get doubles and get a sum of 6.” Since this last event can occur 
only one way (by getting a pair of 3s), 


1 
P(E; N Ey) = —. 
(E, 0 £5) 36 
By Theorem 6.6.9, the probability of getting doubles or a sum of 6 is 


P(E, U Ey) = P(E\) + P(Ea) — P(E, 9 Ep) 
LL. 3 1 > 


6" 36 56° Is" < 


Events E; and E> are mutually exclusive if E,; E> = ©. It follows from Theo- 
rem 6.6.9 that if £; and £2 are mutually exclusive, 


P(E| U Ex) = P(E\) + P(E2). 


If E; and E> are mutually exclusive events, 


P(E, U Ep) = P(E)) + P(Ed). 


Proof Since £, and E) are mutually exclusive events, E; 0 Ex = ©. Therefore, 
P(E; M Ex) = 0. Theorem 6.6.9 now gives 


P(E) U Ex) = P(E1) + P(E2) — P(E) 9 E2) = P(E1) + P(E). < 


Two fair dice are rolled. Find the probability of getting doubles or the sum of 5. 


SOLUTION We let E; denote the event “get doubles” and £2 denote the event “get the 
sum of 5.” Notice that E; and E> are mutually exclusive: You cannot get doubles and the 
sum of 5 simultaneously. Since doubles can be obtained in six ways, 


6 1 


Since the sum of 5 can be obtained in four ways [(1, 4), (2, 3), (3, 2), (4, DJ], 


4 1 


By Corollary 6.6.11, 


P(E| U Ex) = P(E}) + P(E2) = = + 


306 Chapter 6 ® Counting Methods and the Pigeonhole Principle 


Example 6.6.14 


1,1 | 2,1 | 3,1 | 4,1 | 5,1 | 6,1 
1,2 | 2,2 | 3,2 | 4,2 | 5,2 | 6,2 
1,3 | 2,3 | 3,3 | 4,3 [Ea 6,3 
1,4 | 2,4 | 3,4 | 4,4 | 5,4 | 6,4 
is || 25) || She) || 4he) || 243) || Ge) 
1,6 | 2,6 | 3,6 | 4,6 | 5,6 | 6,6 


Figure 6.6.2 Rolling two fair dice. 
Since each outcome is assigned the 
value 1/36, the probability of getting a 
sum of 10 is 1/12. If at least one die 
shows a 5, one of the shaded outcomes 
occurs. The shaded outcomes become 
the new sample space, and each shaded 
outcome is reassigned the value 1/11. 
The probability of getting a sum of 10 
given that at least one 5 occurs is 1/11. 


Conditional Probability 


Suppose that we roll two fair dice. The sample space consists of all 36 possible outcomes 
with each outcome assigned the value 1/36 (see Figure 6.6.2). The probability of getting 
a sum of 10 is 1/12, the sum of the values of the outcomes that add up to 10. 

Let us modify the example slightly. Suppose that we roll two dice and we are told 
that at least one die is 5. Now the probability of getting a sum of 10 is no longer 1/12 
because we know that one of the outcomes shown shaded in Figure 6.6.2 has occurred. 
Since the 11 outcomes shaded are equally likely, the probability of getting a sum of 10 
given that at least one die is 5 is 1/11. A probability given that some event occurred is 
called a conditional probability. 

We now discuss conditional probabilities in general. We let P(E | F) denote the 
probability of E given F. In this situation, F becomes the new sample space. Since the 
values of the outcomes in F originally summed to P(F), we change the value of each 
outcome in F by dividing it by P(F) so that the reassigned values sum to 1. The outcomes 
that satisfy E given that F occurred are precisely the outcomes in EM F. Summing the 
reassigned values of the outcomes in EM F, we obtain the value of P(E | F): 


P(ENF) 
P(F) ~ 


This discussion motivates the following definition. 


Definition 6.6.13 > Let E and F be events, and assume that P(F) > 0. The 
conditional probability of E given F is 


P(ENF) 


P(E|F) = a 


Use Definition 6.6.13 to compute the probability of getting a sum of 10, given that at 
least one die shows 5, when two fair dice are rolled. 


Example 6.6.15 


Example 6.6.17 
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SOLUTION Let E denote the event “getting a sum of 10,” and let F denote the event 
“at least one die shows 5.” The event EM F is “getting a sum of 10 and at least one die 
shows 5.” Since only one outcome belongs to EN F, 


1 
PEENF) = 3. 


Since 11 outcomes belong to F (see Figure 6.6.2), 


PCF) = 11 
~ 36° 
Therefore, 
1 
P(ENF) 36 1 
P(E| F) = = = 
lf) P(F) Mit 
36 < 


Weather records show that the probability of high barometric pressure is 0.80, and the 
probability of rain and high barometric pressure is 0.10. Using Definition 6.6.13, the 
probability of rain given high barometric pressure is 

P(RNH) _ 0.10 


PR h=——_ = —_ 0135, 
ae P(A) ie 


where R denotes the event “rain,” and H denotes the event “high barometric pressure.” 
< 


Independent Events 


If the probability of event E does not depend on event F in the sense that P(E | F) = P(E), 
we say that E and F are independent events. By Definition 6.6.13, 


P(E F) 
P(E | F) = —— 
P(F) 
Thus if E and F are independent events, 
P(E) = PE|F) = 20) 
. PCF) 


or 
P(ENF) = P(E)P(F). 


We take this last equation as the formal definition of independent events. 


Definition 6.6.16 > Events E and F are independent if 


P(EN F) = P(E)P(F). < 


Intuitively, if we flip a fair coin twice, the outcome of the second toss does not depend 
on the outcome of the first toss (after all, coins have no memory). For example, if H is 
the event “head on first toss,’ and T is the event “tail on second toss,” we expect that 
events H and T are independent. Use Definition 6.6.16 to verify that H and T are indeed 
independent. 
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Example 6.6.18 


Example 6.6.19 


SOLUTION The event H 1 T is the event “head on first toss and tail on second toss.” 
Thus P(N T) = 1/4. Since P(H#) = 1/2 = P(T), we have 


1 1 1 
P(H NT) = i (5) (5) = P(H)P(T). 


Therefore, events H and T are independent. < 


Joe and Alicia take a final examination in discrete mathematics. The probability that Joe 
passes is 0.70, and the probability that Alicia passes is 0.95. Assuming that the events 
“Joe passes” and “Alicia passes” are independent, find the probability that Joe or Alicia, 
or both, passes the final exam. 


SOLUTION We let J denote the event “Joe passes the final exam” and A denote the 
event “Alicia passes the final exam.” We are asked to compute P(J U A). 
Theorem 6.6.9 says that 


P(J UA) = P(VJ) + P(A) — PU NA). 


Since we are given P(/) and P(A), we need only compute P(J 1 A). Because the events 
J and A are independent, Definition 6.6.16 says that 


P(J MA) = P(J)P(A) = (0.70)(0.95) = 0.665. 
Therefore, 


P(JUA) = PV) + P(A) — P(J NA) = 0.70 + 0.95 — 0.665 = 0.985. < 


Pattern Recognition and Bayes’ Theorem 


Pattern recognition places items into various classes based on features of the items. 
For example, wine might be placed into the classes premium, table wine, and swill based 
on features such as acidity and bouquet. One way to perform such a classification uses 
probability theory. Given a set of features F’, one computes the probability of a class 
given F for each class and places the item into the most probable class; that is, the class C 
chosen is the one for which P(C | F) is greatest. 


Let R denote class premium, T denote class table wine, and S denote class swill. Suppose 
that a particular wine has feature set F and 


P(R|F)=0.2, P(T|F)=0.5,  P(S|F) =0.3. 


Since class table wine has the greatest probability, this wine would be classified as table 
wine. < 


Bayes’ Theorem is useful in computing the probability of a class given a set of 
features. 


Theorem 6.6.20 


Example 6.6.21 
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Bayes’ Theorem 
Suppose that the possible classes are C;,..., Cy. Suppose further that each pair of 
classes is mutually exclusive and each item to be classified belongs to one of the 
classes. For a feature set F, we have 

PF | C)P(G) 
We AG || Cac 


P(G||F) = 


Proof By Definition 6.6.13, 


P(C; NF) 
P(C; | F) = ———_., 
(Gil F) P® 
and again by Definition 6.6.13, 
P(F OCG) 
P(F | C;) = ——— 
i P(C) 


Combining these equations, we obtain 


P(GNF) _ P(F|C)P(G) 
P(F) P(F) 


P(Ci|F) = 
To complete the proof of Bayes’ Theorem, we need to show that 


PCF) = 5° PF | C)P(C)). 
i=! 
Because each item to be classified belongs to one of the classes, we have 


F=(FNC))U(FNG)U---U(FNG,). 


Since the C; are pairwise mutually exclusive, the FM C; are also pairwise mutually ex- 
clusive. By Corollary 6.6.11, 


P(F) = P(FOC)) + PUFA C2) ++-+- + PF OAC,). 
Again by Definition 6.6.13, 
P(F OC) = P(F | C)P(G). 
Therefore, 


P(F) = 0 PFC) P(C), 


i=1 
and the proof is complete. < 
Telemarketing At the telemarketing firm Sel/Phone, Dale, Rusty, and Lee make calls. 


The following table shows the percentage of calls each caller makes and the percentage 
of persons who are annoyed and hang up on each caller: 


Caller 
Dale Rusty Lee 
Percent of calls 40 25 35 


Percent of hang-ups 20 55 30 
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Example 6.6.22 


Let D denote the event “Dale made the call,” let R denote the event “Rusty made 
the call,’ let L denote the event “Lee made the call,’ and let H denote the event “the 
caller hung up.” Find P(D), P(R), P(L), P(H | D), P(H| R), P(H | L), P(D| H), P(R| A), 
P(L|A), and P(A). 


SOLUTION Since Dale made 40 percent of the calls, P(D) = 0.4. Similarly, from the 
table we obtain P(R) = 0.25 and P(L) = 0.35. 

Given that Dale made the call, the table shows that 20 percent of the persons hung 
up; therefore, P(H | D) = 0.2. Similarly, 


P(H|R) = 0.55 and P(H | L) = 0.3. 


To compute P(D | H), we use Bayes’ Theorem: 
P(H|D)P(D) 
P(H | D)P(D) + P| R)P(R) + P(A | L)P(L) 


7 (0.2)(0.4) 
~~ (0.2)(0.4) + (0.55)(0.25) + (0.3) (0.35) 


P(D|H) = 


= 0.248. 


A similar computation using Bayes’ Theorem gives P(R|H) = 0.426. Again using 
Bayes’ Theorem or noting that 


P(D|H)+P(R|H)+ P(L|A) = 1, 


we obtain P(L| H) = 0.326. 
Finally, the proof of Bayes’ Theorem shows that 


P(H) = P(H | D)P(D) + P| R)P(R) + P(A | LL) P(L) 


= (0.2)(0.4) + (0.55)(0.25) + (0.3)(0.35) = 0.3225. 4 


Detecting the HIV Virus The enzyme-linked immunosorbent assay (ELISA) test is 
used to detect antibodies in blood and can indicate the presence of the HIV virus. Ap- 
proximately 15 percent of the patients at one clinic have the HIV virus. Furthermore, 
among those that have the HIV virus, approximately 95 percent test positive on the 
ELISA test. Among those that do not have the HIV virus, approximately 2 percent test 
positive on the ELISA test. Find the probability that a patient has the HIV virus if the 
ELISA test is positive. 


SOLUTION The classes are “has the HIV virus,” which we denote H, and “does not 
have the HIV virus” (1). The feature is “tests positive,’ which we denote Pos. Using this 
notation, the given information may be written 


P(H) =0.15, P(H) =0.85, P(Pos|H) =0.95, P(Pos|H) = 0.02. 


Bayes’ Theorem gives the desired probability: 
P(Pos | H)P(H) 
P(Pos | H)P(H) + P(Pos | H)P(H) 


= (0.95) (0.15) — 0.893. 
(0.95) (0.15) + (0.02) (0.85) < 


P(H| Pos) = 
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6.6 Review Exercises 


1. What is a probability function? 


2. If P is a probability function and all outcomes are equally 
likely, what is the value of P(x), where x is an outcome? 


3. How is the probability of an event defined? 


4. If E is an event, how are P(E) and P(E) related? Explain how 
the formula is derived. 


5. If E; and E> are events, what does the event E; U E2 represent? 
6. If E; and E> are events, what does the event E; M E2 represent? 


7. If E; and E2 are events, how are P(E; UE2), P(E, NE2), P(E)), 
and P(E2) related? Explain how the formula is derived. 


8. Explain what it means for two events to be “mutually 
exclusive.” 


9. Give an example of mutually exclusive events. 


10. If E; and £2 are mutually exclusive events, how are P(E; U 
E), P(E\), and P(E2) related? Explain how the formula is de- 
rived. 


11. Give an informal, intuitive description of the meaning of the 
event E given F. 


12. How is the event E given F denoted? 

13. Give a formula for the probability of E given F. 

14. Explain what it means for two events to be “independent.” 
15. Give an example of independent events. 

16. What is pattern recognition? 


17. State Bayes’ Theorem. Explain how the formula is derived. 


Exercises 1-3 refer to Example 6.6.2 in which a die is loaded so 
that the numbers 2 through 6 are equally likely to appear, but 1 is 
three times as likely as any other number to appear. 

1. One die is rolled. What is the probability of getting a 5? 


2. One die is rolled. What is the probability of getting an even 
number? 


3. One die is rolled. What is the probability of not getting a 5? 
Exercises 4-9 refer to a die that is loaded so that the numbers 1 


and 3 are equally likely to appear and 2, 4, 5, and 6 are equally 
likely to appear. However, 1 is three times as likely to appear as 2. 


4. One die is rolled. Assign probabilities to the outcomes that 
accurately model the likelihood of the various numbers to ap- 
pear. 


. One die is rolled. What is the probability of getting a 3? 

. One die is rolled. What is the probability of getting a 4? 

. One die is rolled. What is the probability of not getting a 3? 

. One die is rolled. What is the probability of getting a | ora 4? 


Coe nMrrainn 


. One die is rolled. What is the probability of getting a 1 ora 4 
or a6? 


Exercises 10-19 refer to dice that are loaded so that the numbers 
2, 4, and 6 are equally likely to appear. 1, 3, and 5 are also equally 
likely to appear, but I is three times as likely as 2 is to appear. 


10. One die is rolled. Assign probabilities to the outcomes that ac- 
curately model the likelihood of the various numbers to appear. 
11. One die is rolled. What is the probability of getting a 5? 


12. One die is rolled. What is the probability of getting an even 
number? 


13. One die is rolled. What is the probability of not getting a 5? 


14. Two dice are rolled. What is the probability of getting doubles? 

15. Two dice are rolled. What is the probability of getting a sum 
of 7? 

16. Two dice are rolled. What is the probability of getting doubles 
or a sum of 6? 

17. Two dice are rolled. What is the probability of getting a sum 
of 6 given that at least one die shows 2? 

18. Two dice are rolled. What is the probability of getting a sum 
of 6 or doubles given that at least one die shows 2? 

19. Two dice are rolled. What is the probability of getting a sum 
of 6 or a sum of 8 given that at least one die shows 2? 


In Exercises 20-24, suppose that a coin is flipped and a die is 
rolled. Let E, denote the event “the coin shows a tail,” let E> 
denote the event “the die shows a 3,” and let E3 denote the event 
“the coin shows heads and the die shows an odd number.” 


20. List the elements of the event E; or E. 
21. List the elements of the event E2 and E3. 
22. Are E; and £2 mutually exclusive? 

23. Are E; and £3 mutually exclusive? 

24. Are E> and £3 mutually exclusive? 


25. Six microprocessors are randomly selected from a lot of 
100 microprocessors among which 10 are defective. Find 
the probability of obtaining no defective microprocessors. 


26. Six microprocessors are randomly selected from a lot of 
100 microprocessors among which 10 are defective. Find the 
probability of obtaining at least one defective microprocessor. 


27. Six microprocessors are randomly selected from a lot of 
100 microprocessors among which 10 are defective. Find 
the probability of obtaining at least three defective micropro- 
cessors. 
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Exercises 28-35 refer to a family with four children. Assume that 
it is equally probable for a boy or a girl to be born. 


28. What is the probability of all girls? 

29. What is the probability of exactly two girls? 

30. What is the probability of at least one boy and at least one 
girl? 

31. What is the probability of all girls given that there is at least 
one girl? 

32. What is the probability of exactly two girls given that there is 
at least one girl? 


33. What is the probability of at least one boy and at least one girl 
given that there is at least one girl? 


34. Are the events “there are children of both sexes” and “there is 
at most one boy” independent? 


35. Are the events “there is at most one boy” and “there is at most 
one girl” independent? 

36. A family has n children. Assume that it is equally probable 
for a boy or a girl to be born. For which values of n are the 
events “there are children of both sexes” and “there is at most 
one girl” independent? 


Exercises 37-45 refer to a fair coin that is repeatedly flipped. 


37. If the coin is flipped 10 times, what is the probability of no 
heads? 


38. Ifthe coin is flipped 10 times, what is the probability of exactly 
five heads? 


39. Ifthe coin is flipped 10 times, what is the probability of exactly 
four or five or six heads? 


40. If the coin is flipped 10 times, what is the probability of at 
least one head? 


41. If the coin is flipped 10 times, what is the probability of at 
most five heads? 


42. Ifthe coin is flipped 10 times, what is the probability of exactly 
five heads given at least one head? 


43. Ifthe coin is flipped 10 times, what is the probability of exactly 
four or five or six heads given at least one head? 


44, If the coin is flipped 10 times, what is the probability of at 
least one head given at least one tail? 


45. If the coin is flipped 10 times, what is the probability of at 
most five heads given at least one head? 


46. Suppose that you buy a lottery ticket containing k distinct 
numbers from among {1,2,...,}, 1 < k <n. To determine 
the winning ticket, k balls are randomly drawn without re- 
placement from a bin containing n balls numbered 1, 2, ...,n. 
What is the probability that at least one of the numbers on your 
lottery ticket is among those drawn from the bin? 


Exercises 47-50 ask about the following situation. In a small 
charity fundraiser, 70 tickets are sold numbered 1 through 70. 
Each person buys one ticket. Later in the evening, 20 numbers 
are randomly drawn from among I through 70, and those holding 
these numbers win modest prizes. Among those buying the tickets 
are Maya and Chloe. 


47 


48. 
49. 


50. 
51. 


52. 


$3, 


54. 


55. 


56. 


§7. 


58. 


. What is the probability that either Maya or Chloe (or both) 
wins a prize? 
What is the probability that both Maya and Chloe win prizes? 


What is the probability that either Maya or Chloe, but not 
both, wins a prize? 


What is the probability that Maya, but not Chloe, wins a prize? 


Prove that Algorithm 4.2.4, which generates random permu- 
tations, can potentially output any permutation of the input 
and that all outcomes are equally likely. 


Mr. Wizard suggests changing the third line of Algorithm 
4.2.4, which generates random permutations, to 


swap (di, rand(1 wn)) 


Show that in Mr. Wizard’s version, the algorithm can po- 
tentially output any permutation of the input, but that the 
outcomes are not equally likely. 


Find the probability that among n persons, at least two people 
have birthdays on April 1 (but not necessarily in the same 
year). Assume that all months and dates are equally likely, 
and ignore February 29 birthdays. 


Find the least n such that among n persons, the probability 
that at least two persons have birthdays on April 1 (but not 
necessarily in the same year) is greater than 1/2. Assume that 
all months and dates are equally likely, and ignore February 
29 birthdays. 


Find the probability that among n > 3 persons, at least three 
people have birthdays on the same month and date (but not 
necessarily in the same year). Assume that all months and 
dates are equally likely, and ignore February 29 birthdays. 


Under the conditions of Exercise 55, find the minimum value 
of n for which the probability of at least three people having 
birthdays on the same month and date is greater than or equal 
to 1/2. 


Suppose that a professional wrestler is selected at random 
among 90 wrestlers, where 35 are over 350 pounds, 20 are 
bad guys, and 15 are over 350 pounds and bad guys. What is 
the probability that the wrestler selected is over 350 pounds 
or a bad guy? 

Suppose that the probability of a person having a headache 
is 0.01, that the probability of a person having a fever given 
that the person has a headache is 0.4, and that the probability 
of a person having a fever is 0.02. Find the probability that a 
person has a headache given that the person has a fever. 


Exercises 59-62 refer to a company that buys computers from 
three vendors and tracks the number of defective machines. The 
following table shows the results. 


Vendor 
Acme DotCom Nuclear 
Percent purchased 55 10 35 
Percent defective 1 3 3 
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Let A denote the event “the computer was purchased from Acme,” 


let 


D denote the event “the computer was purchased from Dot- 


Com,” let N denote the event “the computer was purchased 
from Nuclear,” and let B denote the event “the computer was 
defective.” 


59. 
60. 
61. 
62. 
63. 


64. 


65. 


66. 


Find P(A), P(D), and P(N). 

Find P(B|A), P(B|D), and P(B|N). 
Find P(A |B), P(D|B), and P(N |B). 
Find P(B). 


In Example 6.6.22, how small would P(H) have to be so that 
the conclusion would be “no HIV” even if the result of the test 
is positive? 


Show that for any events FE; and E>, 
P(E) 1 Ex) = P(E) + P(E2) — 1. 


Use mathematical induction to show that if FE, Eo, ... 
events, then 


, E, are 


n 
P(E| UE) U-+-U En) < 5 P(E). 
i=l 


If E and F are independent events, are E and F independent? 


6.7 


67. 
68. 


69. 


313 


If E and F are independent events, are E and F independent? 


Is the following reasoning correct? Explain. 

A person, concerned about the possibility of a bomb on 
a plane, estimates the probability of a bomb on a plane to be 
0.000001. Not satisfied with the chances, the person computes 
the probability of two bombs on a plane to be 


0.0000017 = 0.000000000001. 


Satisfied with the chances now, the person always carries a 
bomb on a plane so that the probability of someone else car- 
rying a bomb, and thus there being two bombs on the plane, 
is 0.000000000001—small enough to be safe. 


A track enthusiast decides to try to complete the East-South- 
East Marathon. The runner will stop if the marathon is com- 
pleted or after three attempts. The probability of completing 
the marathon in one attempt is 1/3. Analyze the following 
argument that, assuming independence, purportedly shows 
that the runner is almost, but not quite, certain to complete the 
marathon. 

Since the probability of each attempt is 1/3 = 0.3333, 
after three attempts the probability of completing the marathon 
is 0.9999, which means that the runner is almost, but not quite, 
certain to complete the marathon. 


Binomial Coefficients and 


Combinatorial Identities 


At first glance the expression (a+b)” does not have much to do with combinations; but as 
we will see in this section, we can obtain the formula for the expansion of 
(a+b)" by using the formula for the number of r-combinations of n objects. Frequently, 
we can relate an algebraic expression to some counting process. Several advanced count- 
ing techniques use such methods (see [Riordan; and Tucker]). 


Go Online 

For more on the 
binomial theorem, see 
goo.gl/WRoK11 


(a +b)". Since 


(a+b)" = (a+b)a+b)---(a+b), 
SS 


The Binomial Theorem gives a formula for the coefficients in the expansion of 


(6.7.1) 


n factors 


the expansion results from selecting either a or b from each of the n factors, multiply- 
ing the selections together, and then summing all such products obtained. For example, 
in the expansion of (a + b)3, we select either a or b from the first factor (a + b); either 
aor b from the second factor (a + b); and either a or b from the third factor (a + b); 
multiply the selections together; and then sum the products obtained. If we select a from 
all factors and multiply, we obtain the term aaa. If we select a from the first factor, 
b from the second factor, and a from the third factor and multiply, we obtain the term 
aba. Table 6.7.1 shows all the possibilities. If we sum the products of all the selections, 


we obtain 


(a+b)? = (a4 


= daa 


+ b)(a + b)(a + b) 


+ aab + aba + abb + baa + bab + bba + bbb 
=@4+@0b4+a0b+al’?+ab+abl?+ar’?+b> 
=a 4+3a°b + 3ab> +b’. 
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Theorem 6.7.1 


Example 6.7.2 


Example 6.7.3 


TABLE 6.7.1 m Computing (a + b)? 


Selection from Selection from Selection from 
First Factor Second Factor Third Factor Product of 
(a+b) (a+b) (a+b) Selections 
a a a aaa = a? 
a a b aab = a*b 
a b a aba = a*b 
a b b abb = ab” 
b a a baa = a*b 
b a b bab = ab* 
b b a bba = ab* 
b b b bbb = b° 


In (6.7.1), a term of the form a’*b* arises from choosing b from k factors and a 
from the other n — k factors. But this can be done in C(n, k) ways, since C(n, k) counts 
the number of ways of selecting k things from n items. Thus a’~*b* appears C(n, k) 
times. It follows that 


(a +b)" = C(n, O)a"b® + C(n, Iya"! b! + C(n, 2)a"?b* 
+--+ C(n,n— Valo"! +c, nya°b". (6.7.2) 


This result is known as the Binomial Theorem. 


Binomial Theorem 


If a and b are real numbers and n is a positive integer, then 


(a+b)" = we C(n, Ka" *b*. 
k=0 


Proof The proof precedes the statement of the theorem. < 


The Binomial Theorem can also be proved using induction on n (see Exercise 17). 
The numbers C(n, r) are known as binomial coefficients because they appear in 
the expansion (6.7.2) of the binomial a + b raised to a power. 


Taking n = 3 in Theorem 6.7.1, we obtain 


(a+b) = CG, 0)a*b® + CB, Yarb! + CB, 2)a'b? + CG, 3)a°b? 
=a +3a°b + 3ab? +b’. 4 


Expand (3x — 2y)* using the Binomial Theorem. 
SOLUTION If we take a = 3x, b = —2y, andn = 4 in Theorem 6.7.1, we obtain 


(3x — 2y)* = (a+b)! 
= C(4, O)atb® + C(4, Lath! + C4, 2)a*b? 
+C(4, 3)a'b? + C(4, 4)a°b* 
= C(4, 0)(3x)*(—2y)? + C(4, Ix)? (—2y)! 
+ C(4, 2)(3x)°(—2y)? + C(4, 3)(3x)!(—2y)? 


Example 6.7.4 


Example 6.7.5 


Figure 6.7.1 Pascal’s triangle. 


Go Online 

A biography of 
Pascal is at 
goo.gl/WRoK11 


Theorem 6.7.6 
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+ C(4, 4)(3x)°(—2y)* 

= 34y4 4.4.33x3(—2y) + 6 -32x7(-2)°y" 

+4(3x)(—2)?y? + (-2)*y* 

= 81x4 — 216x3y + 216x7y? — 96xy? + L6y*. < 


Find the coefficient of a>b* in the expansion of (a + b)?. 


SOLUTION The term involving a°b* arises in the Binomial Theorem by taking n = 9 
and k = 4: 


C(n, kya" *bk = C9, 4)a°b* = 126a°b*. 


Thus the coefficient of a°b* is 126. < 


Find the coefficient of x?y*z* in the expansion of (x + y +z)’. 


SOLUTION Since 
(xtyt+z) =@+y+Dx+y+2)---(@+y+z) (nine terms), 


we obtain x?y>z* each time we multiply together x chosen from two of the nine terms, 
y chosen from three of the nine terms, and z chosen from four of the nine terms. We can 
choose two terms for the x’s in C(9, 2) ways. Having made this selection, we can choose 
three terms for the y’s in C(7, 3) ways. This leaves the remaining four terms for the z’s. 
Thus the coefficient of x*y*z* in the expansion of (x + y + z)? is 


9! 7! 9! 


CO. 2901, 3) = dra 31a — 231A 


= 1260. < 


We can write the binomial coefficients in a triangular form known as Pascal’s 
triangle (see Figure 6.7.1). The border consists of 1’s, and any interior value is the sum 
of the two numbers above it. This relationship is stated formally in the next theorem. The 
proof is a combinatorial argument. An identity that results from some counting process 
is called a combinatorial identity and the argument that leads to its formulation is called 
a combinatorial argument. 


Can+1,k4 =Cn,k-1)+C(n,k) 


forl <k<n. 


Proof Let X be aset with n elements. Choose a ¢ X. Then C(n+ 1, k) is the number 
of k-element subsets of Y = X U {a}. Now the k-element subsets of Y can be divided 
into two disjoint classes: 


1. Subsets of Y not containing a. 


2. Subsets of Y containing a. 


The subsets of class | are just k-element subsets of X and there are C(n, k) of these. Each 
subset of class 2 consists of a (k — 1)-element subset of X together with a and there are 
C(n, k — 1) of these. Therefore, 


Cin+1,k) = C(n,k— 1) + C(n,k). < 
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Example 6.7.7 


Example 6.7.8 


Example 6.7.9 


Theorem 6.7.6 can also be proved using Theorem 6.2.17 (Exercise 18 of this 
section). 

We conclude by showing how the Binomial Theorem (Theorem 6.7.1) and 
Theorem 6.7.6 can be used to derive other combinatorial identities. 


Use the Binomial Theorem to derive the equation 


y cae. (6.7.3) 


k=0 


SOLUTION The sum is the same as the sum in the Binomial Theorem, 


a C(n, Ka" *b*, 


k=0 


except that the expression a”~*b* is missing. One way to “eliminate” this expression is 
to take a = b = 1, in which case the Binomial Theorem becomes 


Fase =¥ Cr => Coe. 


k=0 k=0 < 


There are often many different ways to prove a result. In Example 6.7.7, we used 
an algebraic technique (the Binomial Theorem) to prove equation (6.7.3). In the next 
two examples, we first give a combinatorial proof and then a proof using mathematical 
induction of equation (6.7.3). 


Prove equation (6.7.3) using a combinatorial argument. 


SOLUTION Given an n-element set X, C(n, k) counts the number of k-element subsets 
of X. Thus the left side of equation (6.7.3) counts the total number of subsets of X. 
But the number of subsets of X is 2” (see Theorem 2.4.6), and equation (6.7.3) follows 
immediately. < 


Prove equation (6.7.3) using mathematical induction. 


SOLUTION The Basis Step (n = 0) is readily verified. 
For the Inductive Step, we first assume that equation (6.7.3) is true for n. Using 
Theorem 6.7.6, we have 


n+l n 
So cn+ 1=Cin+ 1,0)+ 5° Ca+ 1kb+Cnm4+1,n41) 
k=0 k=1 


— [4 > ich, k-1)+C(n,k] +1 
k=1 


=2+5>Ca,k-1) +5) Cn, k) 
k=1 k=1 


=2+ + 


* C(n, k) — C(n, n) > C(n, k) — C(n, 0| 


k=0 k=0 
=9R eH S2-2S2". P 
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Example 6.7.10 Use Theorem 6.7.6 to show that 


SoC.) = Ct 1k+I). (6.7.4) 
i=k 


SOLUTION We use Theorem 6.7.6 in the form 
CG,k) = CG+1,k+1)-CU,k+1) 
to obtain 


C(k,k) + C(R+ 1,k) + C(k+2,k) +--+ + C(n, k) 
=14+CK+2,k+1)—-CK+1,k +1 +C(K+3,k 4+ 1) 
—C(K+2,k4+1)+---+C4+1,k+1)—- Ct, k+ 1) 


=C(n+1,k+1). < 
Exercise 51, Section 6.3, shows another way to prove equation (6.7.4). 
Example 6.7.11 Use equation (6.7.4) to find the sum 1+2+---+n. 
SOLUTION We may write 
142+---+n=C(1,1)+CQ@,1)+---+CQ@, 1) 
= C(n+ 1,2) by equation (6.7.4) 
(n+ 1)n 
= —— < 
6.7 Review Exercises 
1. State the Binomial Theorem. 3. What is Pascal’s triangle? 
2. Explain how the Binomial Theorem is derived. 4. State the formulas that can be used to generate Pascal’s triangle. 
6.7 Exercises 
1. Expand (x + y)* using the Binomial Theorem. 10. @+y+z" 
2. Expand (2c — 3d)° using the Binomial Theorem. i. w+xtyto? 
In Exercises 3-9, find the coefficient of the term when the expres. *12. (@+Y + DM w+atytz? 
sion is expanded. 13. Find the next row of Pascal’s triangle given the row 
AV. i 
seas tae. i 7 ot Se Seon 7 a 
4. 5°; (25 — 1)? 
5. reyes (aatytol? 14. (a) Show that C(n, k) < C(n, k+1) ifand only ifk < (n—1)/2. 
6. wx y?25; (Qw+x+3y+2)? (b) Use part (a) to deduce that the maximum of C(n, k) for 
7. ax: (a+x+toP(atx+a3 k=0,1,...,nis C(n, [n/2]). 
8. x3: (a tax +x(at+x4 15. Prove that 
9. x4; (at Jax +x? (a+x)> m 
DEC ica, b= (- "Cn — 1, m) 
In Exercises 10-12, find the number of terms in the expansion of k=0 


each expression. 
for allm,0 <m<n-1. 
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16. 


17. 
18. 
19. 


x20. 
21. 


«22. 


23. 


24. 


25. 


26. 
27. 


28. 
«29. 


x30. 


«31. 


Use the Binomial Theorem to show that 


0= > ED‘C(, b). 


k=0 
Use induction on n to prove the Binomial Theorem. 
Prove Theorem 6.7.6 by using Theorem 6.2.16. 


Give a combinatorial argument to show that 
C(n, k) = C(n,n — k). 


Prove equation (6.7.4) by giving a combinatorial argument. 


Find the sum 
1-24+2-3+4---+(n—1)n. 

Use equation (6.7.4) to derive a formula for 

P42? 4..-47°. 
Use the Binomial Theorem to show that 

n 

Soca, i = 3". 

k=0 
Suppose that n is even. Prove that 


n/2 n/2 


35 Cq, 2) = 2"! = $5 C(n, 2k - 1). 
k=1 


k=0 
Prove 


(a+b4 abc, 


c= y : n! 


ij!(n—i-j)! 
O<i+j<n HNC oD) 


Use Exercise 25 to write the expansion of (x + y + z)*. 


Prove 
n! 
v= ¥ 
Nin—-i-p! 
O<i+j<n ul(a—i-p! 
Prove that 2C(2n — 1, n) = C(2n, n) for alln > 1. 


Prove that 
n—-1 
S> C= 1,)C(n, n= i) = CQn—1,n) 
i=0 

for alln > 1. 


Give a combinatorial argument to prove that 
n 
.y C(n, k)? = C(2n, n). 
k=0 
Prove that for alln, g, andk,0 <q <nand0 <k <n, 


q 
C(n, q) = S C(k, i)C(n—k, gi). 
i=0 
If j > i, define C(i,j) to be 0. Notice that Exercise 30 is a 
special case of this result with n replaced by 2n, and k and q 
replaced by n. Hint: Use a combinatorial argument. Consider 
k red balls and n—k blue balls to be distributed into containers 
of size g andn — q. 


32. 


33. 


x34, 
35; 


36. 


37. 


38. 


«39, 


40. 
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Prove 
n 
nl +x)" =S° Ca, bet". 
k=l 
Use the result of Exercise 32 to show that 


n 
n2"| = So kCMn, Wd. (6.7.5) 
k=1 
Prove equation (6.7.5) by induction. 


A smoothing sequence bo, ..., bg—1 is a (finite) sequence sat- 
isfying bj>0 fori = 0,...,k — 1, and SB; = 1. A 
smoothing of the (infinite) sequence a,, a2, ... by the smooth- 
ing sequence bo, ..., bg—1 is the sequence {aj} defined by 


k-1 

y 

a= ) i+ jDi- 
i=0 


The idea is that averaging smooths noisy data. 
The binomial smoother of size k is the sequence 


Bo By-1 
Qn? seey Qn , 
where Bo, ..., Be—1 is row n of Pascal’s triangle (row 0 being 


the top row). 

Let co,c; be the smoothing sequence defined by 
co = c; = 1/2. Show that if a sequence a is smoothed by c, the 
resulting sequence is smoothed by c, and so on k times; then, 
the sequence that results can be obtained by one smoothing of 
a by the binomial smoother of size k + 1. 


In Example 6.1.6 we showed that there are 3” ordered 
pairs (A, B) satisfying AC BCX, where X is an n-element 
set. Derive this result by considering the cases |A|=0, 
|A|=1,..., |A] = 27, and then using the Binomial Theorem. 


Show that 


Rn 1 
So Clk, mH = Cn + 1,m + 1) (Ho - at) 
m+1 


k=m 


for all n > m, where Hx, the kth harmonic number, is defined 
as 


Prove that 


n—1 


3 1 n+1 2! 
=—CHi) 2 +17 
for alln € Zt. 


Prove that 


i 
i=0 


[n/2 
So Cn -kW) =n 


k=0 
for all n > 1, where f,, is the nth Fibonacci number. 


Prove that if p is prime, C(p, i) is divisible by p for all i, 
l<i<p-l. 


41. 


42. 


Prove Fermat’s little theorem, which states that if p is a prime 
and a is a positive integer, p divides a? — a. Hint: (j + 1)? — 
P—1= 327) CQ, dj forallj, 1<j<a-1. 

Prove that 


m m n n 
( ) ( ) Com 4 
m+n m+n 


for all m,n € Z+. Hint: Consider the term for k = m in the 
binomial theorem expansion of (x + y)""*” for appropriate x 
and y. 


n,m) <1 


6.8 


x43, 
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Prove that for each k € Z"""°S, there exist constants 
Co, C1,..-, Cx, depending on k, such that 


StH _ nttl 
~ k+1 
i=l 


for all n e€ Zt. Hint: Use Strong Induction on k; Exer- 
cise 70, Section 2.4, with a = +1. and the Binomial 
Theorem. 


+ Cynk + Cyn! +--+ + Cin t+ Co, 


The Pigeonhole Principle 


Go Online 

For more on the 
Pigeonhole Principle, 
see 


goo.gl/WRoK11 how many there are. 


The Pigeonhole Principle (also known as the Dirichlet Drawer Principle or the Shoe 
Box Principle) is sometimes useful in answering the question: Is there an item having 
a given property? When the Pigeonhole Principle is successfully applied, the principle 
tells us only that the object exists; the principle will not tell us how to find the object or 


The first version of the Pigeonhole Principle that we will discuss asserts that if 


Go Online 

A biography of 
Dirichlet is at 
goo.gl/WRoK11 


n pigeons fly into k pigeonholes and k <n, some pigeonhole contains at least two pi- 
geons (see Figure 6.8.1). The reason this statement is true can be seen by arguing by 
contradiction. If the conclusion is false, each pigeonhole contains at most one pigeon 
and, in this case, we can account for at most k pigeons. Since there are n pigeons and 
n > k, we have a contradiction. 


Figure 6.8.1 n = 6 pigeons in k = 4 pigeonholes. Some pigeonhole contains at least 
two pigeons. 


Pigeonhole Principle 


(First Form) _ pigeons. 


If n pigeons fly into k pigeonholes and k < n, some pigeonhole contains at least two 


We note that the Pigeonhole Principle tells us nothing about how to locate the 
pigeonhole that contains two or more pigeons. It only asserts the existence of a pigeon- 
hole containing two or more pigeons. 

To apply the Pigeonhole Principle, we must decide which objects will play the 
roles of the pigeons and which objects will play the roles of the pigeonholes. Our first 
example illustrates one possibility. 


Example 6.8.1 


Ten persons have first names Alice, Bernard, and Charles and last names Lee, McDuff, 
and Ng. Show that at least two persons have the same first and last names. 
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Example 6.8.2 


Pigeonhole Principle 
(Second Form) 


Example 6.8.3 


Example 6.8.4 


SOLUTION There are nine possible names for the 10 persons. If we think of the persons 
as pigeons and the names as pigeonholes, we can consider the assignment of names to 
people to be that of assigning pigeonholes to the pigeons. By the Pigeonhole Principle, 
some name (pigeonhole) is assigned to at least two persons (pigeons). < 


A proof using the Pigeonhole Principle can often be recast as a proof by contra- 
diction. (After all, the validity of the Pigeonhole Principle was verified using proof by 
contradiction!) Although the proofs are essentially the same, one may be easier than the 
other for a particular individual to construct. 


Give another proof of the result in Example 6.8.1 using proof by contradiction. 


SOLUTION Suppose, by way of contradiction, that no two of the 10 people in Exam- 
ple 6.8.1 have the same first and last names. Since there are three first names and three 
last names, there are at most nine people. This contradiction shows that there are at least 
two people having the same first and last names. < 


We next restate the Pigeonhole Principle in an alternative form. 


If f is a function from a finite set X to a finite set Y and |X| > |Y|, then f(x,) = f(@2) 
for some x), x2 € X, x1 # Xo. 


The second form of the Pigeonhole Principle can be reduced to the first form by 
letting X be the set of pigeons and Y be the set of pigeonholes. We assign pigeon x 
to pigeonhole f(x). By the first form of the Pigeonhole Principle, at least two pigeons, 
X1,X2 € X, are assigned to the same pigeonhole; that is, f(x,) =f(x2) for some x), x2 € 
xX >X1 # XO. 

Our next examples illustrate the use of the second form of the Pigeonhole Principle. 


If 20 processors are interconnected, show that at least 2 processors are directly connected 
to the same number of processors. 


SOLUTION Designate the processors 1, 2,..., 20. Let a; be the number of processors 
to which processor i is directly connected. We are to show that a; = aj, for some i 4j. 
The domain of the function a is X = {1,2,..., 20} and the range Y is some subset of 
{0, 1,..., 19}. Unfortunately, |X| = |{0, 1, ..., 19}| and we cannot immediately use the 
second form of the Pigeonhole Principle. 

Let us examine the situation more closely. Notice that we cannot have a; = 0, for 
some i, and a; = 19, for some j, for then we would have one processor (the ith processor) 
not connected to any other processor while, at the same time, some other processor 
(the jth processor) is connected to all the other processors (including the ith processor). 
Thus the range Y is a subset of either {0, 1,..., 18} or {1,2,..., 19}. In either case, 
|¥| < 20 = |X|. By the second form of the Pigeonhole Principle, a; = aj, for some 
i #j, as desired. < 


Show that if we select 151 distinct computer science courses numbered between | and 
300 inclusive, at least two are consecutively numbered. 


SOLUTION Let the selected course numbers be 


(1, C2, wes C151- (6.8.1) 


Example 6.8.5 


Pigeonhole Principle 
(Third Form) 


Example 6.8.6 
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The 302 numbers consisting of (6.8.1) together with 
ctl, ctl, ..., cis, +1 (6.8.2) 


range in value between | and 301. By the second form of the Pigeonhole Principle, at 
least two of these values coincide. The numbers (6.8.1) are all distinct and hence the 
numbers (6.8.2) are also distinct. It must then be that one of (6.8.1) and one of (6.8.2) 
are equal. Thus we have c; = c; + 1 and course c; follows course cj. < 


An inventory consists of a list of 89 items, each marked “available” or “unavailable.” 
There are 50 available items. Show that there are at least two available items in the list 
exactly nine items apart. (For example, available items at positions 13 and 22 or positions 
69 and 78 satisfy the condition.) 


SOLUTION Let a; denote the position of the ith available item. We must show that 
a; — a; = 9 for some i and j. Consider the numbers 


a, do, «..-., a50 (6.8.3) 
and 
ay + 9, a2 + 9, wees aso + 9. (6.8.4) 


The 100 numbers in (6.8.3) and (6.8.4) have possible values from 1| to 98. By the second 
form of the Pigeonhole Principle, two of the numbers must coincide. We cannot have 
two of (6.8.3) or two of (6.8.4) identical; thus some number in (6.8.3) is equal to some 
number in (6.8.4). Therefore, a; — a; = 9 for some i and j, as desired. < 


We next state yet another form of the Pigeonhole Principle. 


Let f be a function from a finite set X into a finite set Y. Suppose that |X| = n and 
|Y| = m. Let k = [n/m]. Then there are at least k distinct values a), ..., a, € X such 
that 


fa) =f(@) =--- = f(a). 


To prove the third form of the Pigeonhole Principle, we argue by contradiction. 


Let Y = {)1, ..., Ym}. Suppose that the conclusion is false. Then there are at most k — 1 
distinct values x € X with f(x) = y,; there are at most k — 1 distinct values x € X with 
f(x) = y2;...; there are at most k — 1 distinct values x € X with f(x) = y». Thus there 


are at most m(k — 1) members in the domain of f. But 
n 

mk — 1) <m— =n, 
m 


which is a contradiction. Therefore, there are at least k distinct values, a,,..., a, € X, 
such that 


f(ai) = faz) = +++ =f (ax). 


Our last example illustrates the use of the third form of the Pigeonhole Principle. 


A useful feature of black-and-white pictures is the average brightness of the picture. Let 
us say that two pictures are similar if their average brightness differs by no more than 
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some fixed value. Show that among six pictures, there are either three that are mutually 
similar or three that are mutually dissimilar. 


SOLUTION Denote the pictures P;, P2,..., Ps. Each of the five pairs 
(Pi,P2), (Pi, P3), (Pi, P4), (Pi, Ps), (Pi, Po), 

has the value “similar” or “dissimilar.” By the third form of the Pigeonhole Principle, 
there are at least [5/2] = 3 pairs with the same value; that is, there are three pairs 

(Pi, Pi), (P,P), (Pi, Pe) 
all similar or all dissimilar. Suppose that each pair is similar. (The case that each pair is 
dissimilar is Exercise 14.) If any pair 

(Pi, Pj), (Pi, Px), (Pj, Pr) (6.8.5) 
is similar, then these two pictures together with P; are mutually similar and we have 


found three mutually similar pictures. Otherwise, each of the pairs (6.8.5) is dissimilar 
and we have found three mutually dissimilar pictures. < 


6.8 Review Exercises 


1. State three forms of the Pigeonhole Principle. 


2. 


Give an example of the use of each form of the Pigeonhole Prin- 
ciple. 


1. 


Prove that if five cards are chosen from an ordinary 52-card 
deck, at least two cards are of the same suit. 


. Prove that among a group of six students, at least two received 


the same grade on the final exam. (The grades assigned were 
chosen from A, B, C, D, F.) 


. Suppose that each person in a group of 32 people receives a 


check in January. Prove that at least two people receive checks 
on the same day. 


. Prove that among 35 students in a class, at least two have first 


names that start with the same letter. 


. Prove that if f is a function from the finite set X to the finite 


set Y and |X| > |Y|, then f is not one-to-one. 


. Suppose that six distinct integers are selected from the set 


{1, 2,3, 4, 5, 6, 7, 8, 9, 10}. Prove that at least two of the six 
have a sum equal to 11. Hint: Consider the partition {1, 10}, 


{2, 9}, {3, 8}, {4, 7}, {5, 6}. 


. Thirteen persons have first names Dennis, Evita, and 


Ferdinand and last names Oh, Pietro, Quine, and 
Rostenkowski. Show that at least two persons have the same 
first and last names. 


. Eighteen persons have first names Alfie, Ben, and Cissi and 


last names Dumont and Elm. Show that at least three persons 
have the same first and last names. 


. Professor Euclid is paid every other week on Friday. Show that 


in some month she is paid three times. 


10. 


+13. 


Is it possible to interconnect five processors so that exactly 
two processors are directly connected to an identical number 
of processors? Explain. 


. An inventory consists of a list of 115 items, each marked 


“available” or “unavailable.” There are 60 available items. 
Show that there are at least two available items in the list ex- 
actly four items apart. 


. An inventory consists of a list of 100 items, each marked 


“available” or “unavailable.” There are 55 available items. 
Show that there are at least two available items in the list ex- 
actly nine items apart. 


An inventory consists of a list of 80 items, each marked “avail- 
able” or “unavailable.” There are 50 available items. Show that 
there are at least two unavailable items in the list either three 
or six items apart. 


. Complete Example 6.8.6 by showing that if the pairs (P;, Pi), 


(P\, Pj), (P1, Px) are dissimilar, there are three pictures that 
are mutually similar or mutually dissimilar. 


. Does the conclusion to Example 6.8.6 necessarily follow if 


there are fewer than six pictures? Explain. 


. Does the conclusion to Example 6.8.6 necessarily follow if 


there are more than six pictures? Explain. 


. Prove that for any positive integer n, there exists a positive in- 


teger which, when expressed in decimal, consists of at most n 
Os and Is and is a multiple of n. Hint: Consider the set of the 


n integers, {1, 11, 111, ...}, using only 1s, and the remainders 
of these numbers when divided by n. 


Answer Exercises 18-21 to give an argument that shows that if X is 

any (n+2)-element subset of {1,2,..., 2n-+1} and m is the great- 

est element in X, there exist distinct i and j in X withm = i+. 
For each element k € X — {m}, let 


k wen 
ifk< = 

= 2 

c= m 
—k ifk>-—. 

m if a5 


18. How many elements are in the domain of a? 

19. Show that the range of a is contained in {1,2,..., n}. 

20. Explain why Exercises 18 and 19 imply that a; = a; for some 
i xj. 

21. Explain why Exercise 20 implies that there exist distinct i and 
jin X with m =i+/). 

22. Give an example of an (n + 1)-element subset X of 
{1,2,..., 22+ 1} having the property: For no distinct i,j € X 
do we havei+j € X. 


Answer Exercises 23-26 to give an argument that proves the 
following result. 


A sequence aj, a2, ..., Ay241 of n2 + 1 distinct numbers 
contains either an increasing subsequence of length 
n+ 1 ora decreasing subsequence of length n + 1. 


Suppose by way of contradiction that every increasing or de- 
creasing subsequence has length n or less. Let bj be the length of 
a longest increasing subsequence starting at aj, and let c; be the 
length of a longest decreasing subsequence starting at aj. 


23. Show that the ordered pairs (b;, c;), i = 1,...,n? + 1, are 
distinct. 

24. How many ordered pairs (b;, c;) are there? 

25. Explain why | < bj <nand1 <c <n. 

26. What is the contradiction? 

Answer Exercises 27-30 to give an argument that shows that in a 

group of 10 persons there are at least two such that either the dif- 


ference or sum of their ages is divisible by 16. Assume that the ages 
are given as whole numbers. 


Let a,,..., 4,9 denote the ages. Let r; = aj mod 16 and let 
ri ifr, <8 
So= 
; 16-7, ifr >8. 


27. Show that s;,.. 
28. Explain why s; = s, for some j 4 k. 


., 519 range in value from 0 to 8. 


29. Suppose that s; = s; for some j 4 k. Explain why if s; = 1; 
and sx = rg or sj = 16 — 1; and sx = 16 — rx, then 16 divides 
aj — Ak. 

30. Show that if the conditions in Exercise 29 fail, then 16 divides 
aj + ak. 


31. 


«32, 


«33. 


«34, 


«35, 


«36. 


«37. 


38. 


39. 


40. 


41. 
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Show that in the decimal expansion of the quotient of two in- 
tegers, eventually some block of digits repeats. Examples: 


217 
——~ = 0.32878787.... 
660 


1 

— =0.1666..., 

6 se) 
Twelve basketball players, whose uniforms are numbered 1 
through 12, stand around the center ring on the court in an ar- 
bitrary arrangement. Show that some three consecutive players 
have the sum of their numbers at least 20. 


For the situation of Exercise 32, find and prove an estimate for 
how large the sum of some four consecutive players’ numbers 
must be. 


Let f be a one-to-one function from X = {1, 2,..., n} onto X. 
Let f« = fof o---of denote the k-fold composition of f with 
itself. Show that there are distinct positive integers i andj such 
that f'(x) = f/(x) for all x € X. Show that for some positive 
integer k, f*(x) = x for all x € X. 

A 3 x 7 rectangle is divided into 21 squares each of which is 
colored red or black. Prove that the board contains a nontrivial 
rectangle (not | x k or k x 1) whose four corner squares are 
all black or all red. 


Prove that if p ones and g zeros are placed around a circle in 
an arbitrary manner, where p, q, and k are positive integers 
satisfying p > kq, the arrangement must contain at least k 
consecutive ones. 

Write an algorithm that, given a sequence a, finds the length 
of a longest increasing subsequence of a. 

A 2k x 2k grid is divided into 4k? squares and four k x k 
subgrids. The following figure shows the grid for k = 4: 


Show that it is impossible to mark k squares in the upper-left, 
k x k subgrid and k squares in the lower-right, k x k subgrid 
so that no two marked squares are in the same row, column, or 
diagonal of the 2k x 2k grid. 

This is a variant of the n-queens problem, which we dis- 
cuss in detail in Section 9.3. 


Prove that every (n + 1)-element subset of {1,2,...,2n} 
contains two distinct integers p and q_ such _ that 
gcd(p, q) = 1. Hint: Let {ay, ..., dn41} be an (n+ 1)-element 
subset of {1,2,..., 2m}. Consider the list a), ..., dn41,a1 + 
1,...,4@n41 41. 

Show that if “(nm + 1)-element subset” is changed to 
“n-element subset” in Exercise 39, the statement is false. 


Prove that every (n+ 1)-element subset of {1, 2,..., 2m} con- 
tains two distinct integers p and q such that p divides q. Hint: 
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If X is an (n + 1)-element subset of {1, 2, ..., 2m}, write the 
numbers in X in the form 2'"k, where k is odd. 


. Prove Exercise 41 using mathematical induction. 


. Show that if “(nm + 1)-element subset” is changed to 


“n-element subset” in Exercise 41, the statement is false. 


. Prove that a planar polygon with n sides, n > 3, has at least 


three interior angles each less than 180 degrees. Assume no 
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0-degree interior angles. As an example, in the following fig- 
ure angles A, C, and E are each less than 180 degrees. 


Chapter 6 Notes 


An elementary book concerning counting methods is [Niven, 1965]. References on com- 
binatorics are [Brualdi; Even, 1973; Liu, 1968; Riordan; and Roberts]. [Vilenkin] contains 
many worked-out combinatorial examples. [Benjamin] contains an outstanding collection of 
combinatorial proofs. The general discrete mathematical references [Liu, 1985; and Tucker] 
devote several sections to the topics of Chapter 6. [Even, 1973; Hu; and Reingold] treat 
combinatorial algorithms. References on probability are [Billingsley; Ghahramani; 
Kelly; Ross; and Rozanov]. [Fukunaga; Gose; and Nadler] are texts on pattern 


recognition. 


Several proofs of Fermat’s little theorem can be found at http://en.wikipedia.org/wiki/ 
Proofs_of_Fermat’s_little_theorem. 
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Section 6.1 


1. 
2. 
3. 


Multiplication Principle 
Addition Principle 
Inclusion-Exclusion Principle 


Section 6.2 


4. 
5. 
6. 


. r-combination of {x;,. 


Permutation of x), ...,x,: ordering of xj,..., Xn 
n! = number of permutations of an n-element set 
r-permutation of x),.. 
Generar ara © 


.,X,: ordering of r elements of 


. P(n,r): number of r-permutations of an n-element set; 


P(n,r) =n(n— 1)---(n—r+1) 


..,Xn}: (unordered) subset of 


{x,,...,X,} containing r elements 


. C(n,r): number of r-combinations of an n-element set; 


C(a,r) = P(n,r)/r! =n! /[(a—r)!r!] 


Section 6.3 


10. 


11. 


Number of orderings of n items of ¢ types with n; identical 
objects of type i = n!/[ny!---n;!] 

Number of unordered, k-element selections from a 
t-element set, repetitions allowed = C(k + t — 1, k) 


Section 6.4 


12. 
13. 


Lexicographic order 
Algorithm for generating r-combinations: Algorithm 6.4.9 


14. 


Algorithm — for 
6.4.16 


generating permutations: Algorithm 


Section 6.5 


. Experiment 

. Event 

. Sample space 

. Probability of an event when all outcomes are equally 


likely 


Section 6.6 


. Probability function 

. Probability of an event 

. If Eis an event, P(E) + P(E) = 1. 

. If E, and E) are events, P(E, U E,) = P(E,) + P(E.) — 


P(E, MEd). 


. Events £; and E are mutually exclusive if FE; N Ey = ©. 
. If events E; and Ey are mutually exclusive, P(E, U Ex) = 


P(E,) + P(E). 


. If E and F are events and P(F) > 0, the conditional proba- 


bility of E given F is P(E | F) = P(EN F)/P(F). 


. Events E and F are independent if P(E M F) = P(E)P(F). 
. Bayes’ Theorem: If the possible classes are Cj,... 


Ch, 
each pair of these classes is mutually exclusive, and each 
item to be classified belongs to one of these classes, for a 
feature set F we have 


PF |G) 
Die PF | C)P(C)) 


P(G)|F) = 
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Section 6.7 Section 6.8 
28. Binomial Theorem: (a + b)" = Syy_9 C(n, kKa"* bk 30. Pigeonhole Principle (three forms) 
29. Pascal’s triangle: C(n + 1, k) = C(n,k — 1) + C(n, k) 


Chapter 6 Self-Test 


1. How many eight-bit strings begin with 0 and end with 101? and Smith. Show that at least two persons have the same 


2. How many 3-combinations are there of six objects? first, middle, and last names. 


3. How many strings can be formed by ordering the letters 18. Rotate Pascal’s triangle counterclockwise so that the top 


ABCDEF if A appears before C and E appears before C? row consists of 1’s. Explain why the second row lists the 
itive int in order 1, 2,.... 

4. How many strings can be formed by ordering the letters aie galas sai ee : 
ILLINOIS? 19. An inventory consists of a list of 200 items, each marked 


“available” or “unavailable.” There are 110 available items. 
Show that there are at least two available items in the list 
exactly 19 items apart. 


5. How many strings can be formed by ordering the letters 
ILLINOIS if some I appears before some L? 


6. How many ways can we select three books each from a dif- 20 
ferent subject from a set of six distinct history books, nine 
distinct classics books, seven distinct law books, and four 
distinct education books? 


. Let P = {p1, p2, p3, pa, ps} be a set of five (distinct) points 
in the ordinary Euclidean plane each of which has integer 
coordinates. Show that some pair has a midpoint that has 


integer coordinates. 
7. How many functions are there from an n-element set onto 4 


{0, 1}? 

8. How many six-card hands chosen from an ordinary 52-card 22 
deck contain three cards of one suit and three cards of an- 
other suit? 


. Find the 5-combination that will be generated by 
Algorithm 6.4.9 after 12467 if n = 7. 


. Find the 6-combination that will be generated by Algo- 
rithm 6.4.9 after 145678 if n = 8. 


: : : ? 23. Find the permutation that will be generated by 
_p)4 
9. Expand the expression (s—r)* using the Binomial Theorem. Igorithm 6.4.16 after 6427135. 


sa al Rishi hatin ani EN amone pare ef 24. Find the permutation that will be generated by Algo- 
. rithm 6.4.16 after 625431. 

11. A seven-person committee composed of Greg, Hwang, 25 
Isaac, Jasmine, Kirk, Lynn, and Manuel is to select a chair- 
person, vice-chairperson, social events chairperson, secre- 
tary, and treasurer. How many ways can the officers be 
chosen if either Greg is secretary or he is not an officer? 


. Acard is selected at random from an ordinary 52-card deck. 
What is the probability that it is a heart? 


26. A coin is loaded so that a head is five times as likely to 
occur as a tail. Assign probabilities to the outcomes that ac- 


tel del the likelihood of the out t : 
12. A shipment of 100 compact discs contains five defective Danette ery eae ee 


discs. In how many ways can we select a set of four com- 
pact discs that contains more defective than nondefective 
discs? 


27. A family has three children. Assume that it is equally prob- 
able for a boy or a girl to be born. Are the events “there 
are children of both sexes” and “there is at most one girl” 
independent? Explain. 


28. Two fair dice are rolled. What is the probability that the sum 
of the numbers on the dice is 8? 


13. In how many ways can 12 distinct books be divided among 
four students if each student gets three books? 
14. Find the coefficient of x*yz+ in the expansion of (2x + 
yto8. 29. In the Maryland Cash In Hand game, the contestant chooses 
seven distinct numbers among the numbers | through 31. 
The contestant wins a modest amount ($40) if exactly five 
a numbers, in any order, match those among the seven dis- 
» Pad ee r*c(n, =. tinct numbers randomly drawn by a lottery representative. 
k=0 What is the probability of winning $40? 


15. Use the Binomial Theorem to prove that 


30. Joe and Alicia take a final examination in C++. The proba- 
bility that Joe passes is 0.75, and the probability that Alicia 
passes is 0.80. Assume that the events “Joe passes the fi- 
nal examination” and “Alicia passes the final examination” 
are independent. Find the probability that Joe does not pass. 


; , Find the probability that both pass. Find the probability that 
17. Nineteen persons have first names Zeke, Wally, and Linda; both fail. Find the probability that at least one passes. 


middle names Lee and David; and last names Yu, Zamora, 


16. How many integer solutions of 
xy +X2 +243 +24 = 17 


satisfy x) > 0, x2 > 1, x3 > 2, x4 > 3? 
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31. 


Trisha, Roosevelt, and José write programs that sched- 
ule tasks for manufacturing dog toys. The following table 
shows the percentage of code written by each person and 
the percentage of buggy code for each person. 


Coder 
Trisha Roosevelt José 
Percent of code 30 45 25 
Percent of bugs 3 2 5 


32. 
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Given that a bug was found, find the probability that it was 
in the code written by José. 


Find the probability of obtaining a bridge hand with 6—-5— 
2-0 distribution, that is, six cards in one suit, five cards in 
another suit, two cards in another suit, and no cards in the 
fourth suit. 


Chapter 6 Computer Exercises 


1. 


Write a program that generates all r-combinations of the 
elements {1,..., m}. 


. Write a program that generates all permutations of the ele- 


ments {1,..., n}. 
. Write a program that generates all r-permutations of the el- 
ements {1,..., n}. 


. [Project] Report on algorithms different from those 


presented in this chapter for generating combinations and 
permutations. Implement some of these algorithms as pro- 
grams. 


. Write a program that lists all permutations of ABCDEF in 


which A appears before D. 


6. 


10. 


Write a program that lists all permutations of ABCDEF in 
which C and E are side by side in either order. 


. Write a program that lists all the ways that m distinct Mar- 


tians and n distinct Jovians can wait in line if no two Jovians 
stand together. 


. Write a program to compute the Catalan numbers. 


. Write a program that generates Pascal’s triangle to level n, 


for arbitrary n. 


Write a program that finds an increasing or decreasing sub- 
sequence of length n + 1 of a sequence of n* + 1 distinct 
numbers. 
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7.2 Solving Recurrence 


Relations 
7.3 Applications to the 
Analysis of Algorithms 
17.4 The Closest-Pair 
Problem 
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Chapter 7 


RECURRENCE 
RELATIONS 


This chapter offers an introduction to recurrence relations. Recurrence relations are use- 
ful in certain counting problems. A recurrence relation relates the nth element of a se- 
quence to its predecessors. Because recurrence relations are closely related to recursive 
algorithms, recurrence relations arise naturally in the analysis of recursive algorithms. 


Introduction 


Consider the following instructions for generating a sequence: 


1. Start with 5. 
2. Given any term, add 3 to get the next term. 


If we list the terms of the sequence, we obtain 
5,8, 11, 14, 17,.... (7.1.1) 


The first term is 5 because of instruction 1. The second term is 8 because instruction 2 
says to add 3 to 5 to get the next term, 8. The third term is 11 because instruction 2 says 
to add 3 to 8 to get the next term, 11. By following instructions | and 2, we can compute 
any term in the sequence. Instructions | and 2 do not give an explicit formula for the 
nth term of the sequence in the sense of providing a formula that we can “plug n into” 
to obtain the value of the nth term, but by computing term by term we can eventually 
compute any term of the sequence. 

If we denote the sequence (7.1.1) as a), a2, ..., We may rephrase instruction 1 as 


a =5 (7.1.2) 
and we may rephrase instruction 2 as 


Gn = An-1 + 3, n> 2. (7.1.3) 


iThis section can be omitted without loss of continuity. 
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Taking n = 2 in (7.1.3), we obtain ay = a; + 3. By (7.1.2), a; = 5; thus 


a2 =a, +3=543=8. 


Taking n = 3 in (7.1.3), we obtain a3 = a2 + 3. Since az = 8, 


a3 =a +3=84+3=11. 


By using (7.1.2) and (7.1.3), we can compute any term in the sequence just as we did 
using instructions | and 2. We see that (7.1.2) and (7.1.3) are equivalent to instructions 
1 and 2. 

Equation (7.1.3) furnishes an example of a recurrence relation. A recurrence 
relation defines a sequence by giving the nth value in terms of certain of its predecessors. 
In (7.1.3) the nth value is given in terms of the immediately preceding value. In order for 
a recurrence relation such as (7.1.3) to define a sequence, a “start-up” value or values, 
such as (7.1.2), must be given. These start-up values are called initial conditions. The 
formal definitions follow. 


Definition 7.1.1 » A recurrence relation for the sequence ag, a1, ... is an equa- 
tion that relates a, to certain of its predecessors ao, a1, ..., Un—1- 

Initial conditions for the sequence do, a), ... are explicitly given values for a finite 
number of the terms of the sequence. < 


We have seen that it is possible to define a sequence by a recurrence relation to- 
gether with certain initial conditions. Next we give several examples of recurrence rela- 
tions. 


The Fibonacci sequence (see the discussion following Algorithm 4.4.6) is defined by the 
recurrence relation f, = f,-1 +fn—2, n = 3, and initial conditions fj = 1, fp = 1. < 


A person invests $1000 at 12 percent interest compounded annually. If A, represents the 
amount at the end of n years, find a recurrence relation and initial conditions that define 
the sequence {A,}. 


SOLUTION At the end of n — 1 years, the amount is A,_,. After one more year, we 
will have the amount A,,_; plus the interest. Thus 


A,= 4, (40 DAH hIDA, 21 (7.1.4) 


To apply this recurrence relation for n = 1, we need to know the value of Ag. Since 
Ao is the beginning amount, we have the initial condition 


Ao = 1000. (7.1.5) 
< 


The initial condition (7.1.5) and the recurrence relation (7.1.4) allow us to compute 
the value of A, for any n. For example, 


Ap=T1A2As = ADA ADA; 
= (1.12)(1.12)(1.12)Ap = (1.12)3(1000) = 1404.93. (7.1.6) 


Thus, at the end of the third year, the amount is $1404.93. 


Algorithm 7.1.4 


Example 7.1.5 
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The computation (7.1.6) can be carried out for an arbitrary value of n to obtain 


An = (1.12)An-1 


= (1.12)"(1000). 


We see that sometimes an explicit formula can be derived from a recurrence relation 
and initial conditions. Finding explicit formulas from recurrence relations is the topic of 
Section 7.2. 

Although it is easy to obtain an explicit formula from the recurrence relation and 
initial condition for the sequence of Example 7.1.3, it is not immediately apparent how 
to obtain an explicit formula for the Fibonacci sequence. In Section 7.2 we give a method 
that yields an explicit formula for the Fibonacci sequence. 

Recurrence relations, recursive algorithms, and mathematical induction are closely 
related. In all three, prior instances of the current case are assumed known. A recurrence 
relation uses prior values in a sequence to compute the current value. A recursive al- 
gorithm uses smaller instances of the current input to process the current input. The 
inductive step in a proof by mathematical induction assumes the truth of prior instances 
of the statement to prove the truth of the current statement. 

A recurrence relation that defines a sequence can be directly converted to an algo- 
rithm to compute the sequence. For example, Algorithm 7.1.4, derived from recurrence 
relation (7.1.4) and initial condition (7.1.5), computes the sequence of Example 7.1.3. 


Computing Compound Interest 


This recursive algorithm computes the amount of money at the end of n years as- 
suming an initial amount of $1000 and an interest rate of 12 percent compounded 
annually. 


Input: n, the number of years 


Output: The amount of money at the end of 7 years 


compound _interest(n) { 
it ——10) 
return 1000 
return 1.12 * compound_interest(n — 1) 


CaS 


Algorithm 7.1.4 is a direct translation of equations (7.1.4) and (7.1.5), which define 
the sequence Ag, A;,.... Lines 2 and 3 correspond to initial condition (7.1.5), and line 4 
corresponds to recurrence relation (7.1.4). 


Let S, denote the number of subsets of an n-element set. Since going from an (n — 1)- 
element set to an n-element set doubles the number of subsets (see Theorem 2.4.6), we 
obtain the recurrence relation S, = 2S,_,. The initial condition is Sp = 1. < 


One of the main reasons for using recurrence relations is that sometimes it is easier 
to determine the nth term of a sequence in terms of its predecessors than it is to find 
an explicit formula for the nth term in terms of n. The next examples are intended to 
illustrate this thesis. 
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Example 7.1.6 


Example 7.1.7 


Let S,, denote the number of n-bit strings that do not contain the pattern 111. Develop a 
recurrence relation for $;, S2,... and initial conditions that define the sequence S. 


SOLUTION We will count the number of n-bit strings that do not contain the pat- 
tern 111 


(a) that begin with 0; 
(b) that begin with 10; 
(c) that begin with 11. 


Since the sets of strings of types (a), (b), and (c) are disjoint, by the Addition Principle 
S, will equal the sum of the numbers of strings of types (a), (b), and (c). Suppose that 
an n-bit string begins with 0 and does not contain the pattern 111. Then the (n — 1)-bit 
string following the initial 0 does not contain the pattern 111. Since any (n— 1)-bit string 
not containing 111 can follow the initial 0, there are S,,_; strings of type (a). If an n-bit 
string begins with 10 and does not contain the pattern 111, then the (nm — 2)-bit string 
following the initial 10 cannot contain the pattern 111; therefore, there are S,,_» strings 
of type (b). If an n-bit string begins with 11 and does not contain the pattern 111, then 
the third bit must be 0. The (n — 3)-bit string following the initial 110 cannot contain 
the pattern 111; therefore, there are S,,_3 strings of type (c). Thus 


Sn = Sn—1 + Sn—2 + Sp—3 n> 4. 


By inspection, we find the initial conditions $; = 2, $2 = 4, $3 = 7. < 


Recall (see Example 6.2.22) that the Catalan number C,, is equal to the number of routes 
from the lower-left corner of an n x n square grid to the upper-right corner if we are 
restricted to traveling only to the right or upward and if we are allowed to touch but not 
go above a diagonal line from the lower-left corner to the upper-right corner. We call 
such a route a good route. We give a recurrence relation for the Catalan numbers. 

We divide the good routes into classes based on when they first meet the diago- 
nal after leaving the lower-left corner. For example, the route in Figure 7.1.1 first meets 
the diagonal at point (3, 3). We regard the routes that first meet the diagonal at (k, k) as 


Figure 7.1.1 Decomposition of a good route. 


Example 7.1.8 
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constructed by a two-step process: First, construct the part from (0, 0) to (k, k). Second, 
construct the part from (k, k) to (n,n). A good route always leaves (0, 0) by moving 
right to (1, 0) and it always arrives at (k, k) by moving up from (k, k — 1). The moves 
from (1,0) to (k, k — 1) give a good route in the (k — 1) x (k — 1) grid with corners 
at (1,0), 1, k — 1), (k, k — 1), and (k, 0). [In Figure 7.1.1, we marked the points (1, 0) 
and (k,k — 1), k = 3, with diamonds, and we isolated the (k — 1) x (k — 1) subgrid.] 
Thus there are Cy_; routes from (0, 0) to (k, k) that first meet the diagonal at (k, k). The 
part from (k, k) to (n,n) is a good route in the (n — k) x (n — k) grid with corners at 
(k, k), (k,n), (n,n), and (n, k) (see Figure 7.1.1). There are C,_, such routes. By the 
Multiplication Principle, there are Cy,_;C,_, good routes in ann x n grid that first meet 
the diagonal at (k, k). The good routes that first meet the diagonal at (k, k) are distinct 
from those that first meet the diagonal at (k’, k’), k # k’. Thus we may use the Addi- 
tion Principle to obtain a recurrence relation for the total number of good routes in an 
n X n grid: 


C, = > Cy-1Cy_x. 
k=l < 


Tower of Hanoi The Tower of Hanoi is a puzzle consisting of three pegs mounted on 
a board and n disks of various sizes with holes in their centers (see Figure 7.1.2). It is 
assumed that if a disk is on a peg, only a disk of smaller diameter can be placed on top 
of the first disk. Given all the disks stacked on one peg as in Figure 7.1.2, the problem 
is to transfer the disks to another peg by moving one disk at a time. 

We will provide a solution and then find a recurrence relation and an initial con- 
dition for the sequence c, c2,..., where c, denotes the number of moves our solution 
takes to solve the n-disk puzzle. We will then show that our solution is optimal; that is, 
we will show that no other solution uses fewer moves. 

We give a recursive algorithm. If there is only one disk, we simply move it to the 
desired peg. If we have n > 1 disks on peg | as in Figure 7.1.2, we begin by recur- 
sively invoking our algorithm to move the top n — | disks to peg 2 (see Figure 7.1.3). 
During these moves, the bottom disk on peg | stays fixed. Next, we move the remaining 
disk on peg | to peg 3. Finally, we again recursively invoke our algorithm to move the 
n — 1 disks on peg 2 to peg 3. We have succeeded in moving n disks from peg 1 to 
peg 3. 

Ifn > 1, we solve the (n — 1)-disk problem twice and we explicitly move one 
disk. Therefore, c, = 2cy_; +1, n > 1. The initial condition is c; = 1. In Section 7.2, 
we will show that c, = 2” — 1. 

We next show that our solution is optimal. We let d, be the number of moves an 
optimal solution requires. We use mathematical induction to show that 


Ch = dy, n= 1. (7.1.7) 


J § 
i 


Figure 7.1.2 Tower of Hanoi. 
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Example 7.1.9 


; 2 
Figure 7.1.3 After recursively moving the top n — | disks from peg | to peg 2 
in the Tower of Hanoi. 


Basis Step (n= 1) 
By inspection, c; = | = d; thus (7.1.7) is true when n = 1. 


Inductive Step 


Assume that (7.1.7) is true for n — 1. Consider the point in an optimal solution to the 
n-disk problem when the largest disk is moved for the first time. The largest disk must 
be on a peg by itself (so that it can be removed from the peg) and another peg must be 
empty (so that this peg can receive the largest disk). Thus the n— 1 smaller disks must be 
stacked on a third peg (see Figure 7.1.3). In other words, the (n — 1) disk problem must 
have been solved, which required at least d,_; moves. The largest disk was then moved, 
which required one additional move. Finally, at some point the n — 1 disks were moved 
on top of the largest disk, which required at least d,_, additional moves. It follows that 


d, > 2d) + 1. 
By the inductive assumption, cy»_1 = dy—1. Thus 
dy > 2dy—1 +1 = 2ep-1 +1 = ep. (7.1.8) 
The last equality follows from the recurrence relation for the sequence cj, C2,.... By 


definition, no solution can take fewer moves than an optimal solution, so 
Cn = dn. (7.1.9) 


Inequalities (7.1.8) and (7.1.9) combine to give c,, = d,. The inductive step is complete. 
Therefore, our solution is optimal. < 


The Tower of Hanoi puzzle was invented by the French mathematician Edouard 
Lucas in the late nineteenth century. [Lucas was the first person to call the sequence 
1, 1,2,3,5,... the Fibonacci sequence. The Lucas sequence (see Exercise 68) is also 
named after him.] The following myth was also created to accompany the puzzle (and, 
one assumes, to help market it). The puzzle was said to be derived from a mythical gold 
tower that consisted of 64 disks. The 64 disks were to be transferred by monks according 
to the rules set forth previously. It was said that before the monks finished moving the 
tower, the tower would collapse and the world would end in a clap of thunder. Since at 
least 2 — 1 = 18,446,744,073,709,551,615 moves are required to solve the 64-disk 
Tower of Hanoi puzzle, we can be fairly certain that something would happen to the 
tower before it was completely moved. 


The Cobweb in Economics We assume an economics model in which the supply and 
demand are given by linear equations (see Figure 7.1.4). Specifically, the demand is 
given by the equation p = a — bq, where p is the price, q is the quantity, and a and b 


Price 


Supply 


Demand 


Quantity 
> 


Figure 7.1.4 An economics 
model. 
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are positive parameters. The idea is that as the price increases, the consumers demand 
less of the product. The supply is given by the equation p = kg, where p is the price, 
q is the quantity, and k is a positive parameter. The idea is that as the price increases, the 
manufacturer is willing to supply greater quantities. 

We assume further that there is a time lag as the supply reacts to changes. (For 
example, it takes time to manufacture goods and it takes time to grow crops.) We denote 
the discrete time intervals asn = 0, 1,.... We assume that the demand is given by the 
equation 


Pn = 4— ban; (7.1.10) 


that is, at time n, the quantity q, of the product will be sold at price p,. We assume that 
the supply is given by the equation 


Pra >= kan413 (7.1.11) 


that is, one unit of time is required for the manufacturer to adjust the quantity g,41, at 
time n + 1, to the price p,, at the prior time n. 

If we solve equation (7.1.11) for g,,; and substitute into the demand equation 
(7.1.10) for time n + 1, Pn41 = a — bgn41, we obtain the recurrence relation 


b 
Pati =a— yen 
for the price. We will solve this recurrence relation in Section 7.2. 

The price changes through time may be viewed graphically. If the initial price is 
po, the manufacturer will be willing to supply the quantity q;, at time n = 1. We locate 
this quantity by moving horizontally to the supply curve (see Figure 7.1.5). However, 
the market forces drive the price down to p;, as we can see by moving vertically to the 
demand curve. At price p;, the manufacturer will be willing to supply the quantity gz at 
time n = 2, as we can see by moving horizontally to the supply curve. Now the market 
forces drive the price up to p2, as we can see by moving vertically to the demand curve. 
By continuing this process, we obtain the “cobweb” shown in Figure 7.1.5. 


Price 
Supply 


Demand 


Quantity 


92 944 46 9795 93 71 


Figure 7.1.5 A cobweb with a stabilizing price. 


For the supply and demand functions of Figure 7.1.5, the price approaches that 
given by the intersection of the supply and demand curves. This is not always the case, 
however. For example, in Figure 7.1.6, the price fluctuates between po and p;, whereas 
in Figure 7.1.7, the price swings become more and more pronounced. The behavior is 
determined by the slopes of the supply and demand lines. To produce the fluctuating 
behavior of Figure 7.1.6, the angles aw and 6 must add to 180°. The slopes of the supply 
and demand curves are tan a and tan £, respectively; thus in Figure 7.1.6, we have 


k=tana= —tanB=b. 
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P2 
PO 
PA 
P3 
q4 Q a «643 «~Quantity 
Figure 7.1.6 A cobweb with a fluctuating price. Figure 7.1.7 A cobweb with increasing price 


Example 7.1.10 


Go Online 

A biography of 
Ackermann is at 
goo.gl/P8Y2SF 


swings. 


We have shown that the price fluctuates between two values when k = b. A similar 
analysis shows that the price tends to that given by the intersection of the supply and de- 
mand curves (Figure 7.1.5) when b < k; the increasing price-swings case (Figure 7.1.7) 
occurs when b > k (see Exercises 38 and 39). In Section 7.2 we discuss the behavior of 
the price through time by analyzing an explicit formula for the price p,. < 


It is possible to extend the definition of recurrence relation to include functions 
indexed over n-tuples of positive integers. Our last example is of this form. 


Ackermann's Function Ackermann’s function can be defined by the recurrence 
relations 


A(m, 0) = A(m — 1, 1) MS Tg Ds caves (7.1.12) 
A(m, n) = A(m — 1, A(m,n — 1)) MT Ds osved 
W= 1y.2, oni (7.1.13) 


and the initial conditions 
A(O,n) =n+1 | aa 0 al (7.1.14) 


Ackermann’s function is of theoretical importance because of its rapid rate of growth. 

Functions related to Ackermann’s function appear in the time complexity of certain 

algorithms such as the time to execute union/find algorithms (see [Tarjan, pp. 22—29]). 
The computation 


A(1, 1) =A(0,A(1,0)) by (7.1.13) 
=A(0,A(0,1)) by (7.1.12) 
= A(0, 2) by (7.1.14) 
=3 by (7.1.14) 


illustrates the use of equations (7.1.12)—(7.1.14). 

What we and others have called Ackermann’s function is actually derived from 
Ackermann’s original function. Ackermann’s original function defines increasing orders 
of arithmetic: Addition is defined in terms of the “add-one” function, multiplication is 
defined in terms of addition, exponentiation is defined in terms of multiplication, and so 
on. (See Exercises 47-52.) < 
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7.1 Problem-Solving Tips 


To set up a recurrence relation, first define, say A, to be the desired quantity. For ex- 
ample, let A, denote the amount of money at the end of n years if $1000 is invested 
at 12 percent interest compounded annually. Next, just as in proof by induction, look 
for smaller instances within instance n. Continuing the compound interest example, in- 
stance n — | is “within” instance n in the sense that the amount of money at the end of 
n years is the amount at the end of n — | years plus interest. We obtain the recurrence 


relation 


An = An—1 + (0.12)An-1. 


7.1 Review Exercises 


1. What is a recurrence relation? 


nN 


. What is an initial condition? 


vs) 


. What is compound interest and how can it be described by a 
recurrence relation? 


4. What is the Tower of Hanoi puzzle? 
5. Give a solution of the Tower of Hanoi puzzle. 
6. Describe the cobweb in economics. 


7. Define Ackermann’s function. 


In Exercises 1-3, find a recurrence relation and initial conditions 
that generate a sequence that begins with the given terms. 

Mh, 35.75.11, 155.026 

2. 3;:6;-9) 15,24, 39) 303 

3. 1,1, 2,4, 16, 128, 4096,... 
In Exercises 4—8, assume that a person invests $2000 at 14 percent 
interest compounded annually. Let Ay represent the amount at the 
end of n years. 

4. Find a recurrence relation for the sequence Ao, Aj, .... 

5. Find an initial condition for the sequence Ao, Aj,.... 

6. Find A,, Ao, and A3. 

7. Find an explicit formula for A,. 

8. How long will it take for a person to double the initial 

investment? 


If a person invests in a tax-sheltered annuity, the money invested, 
as well as the interest earned, is not subject to taxation until with- 
drawn from the account. In Exercises 9-12, assume that a person 
invests $2000 each year in a tax-sheltered annuity at 10 percent 
interest compounded annually. Let Ay represent the amount at the 
end of n years. 


9. Find a recurrence relation for the sequence Ao, Aj, .... 
10. Find an initial condition for the sequence Ag, Aj,.... 
11. Find Aj, Ao, and A3. 

12. Find an explicit formula for A,,. 


In Exercises 13-17, assume that a person invests $3000 at 12 per- 
cent annual interest compounded quarterly. Let Ay represent the 
amount at the end of n years. 


13. Find a recurrence relation for the sequence Ag, Aj,.... 
14. Find an initial condition for the sequence Ao, Aj,.... 
15. Find Aj, Az, and A3. 

16. Find an explicit formula for Ap. 


17. How long will it take for a person to double the initial 
investment? 

18. Let S,, denote the number of n-bit strings that do not contain 
the pattern 000. Find a recurrence relation and initial condi- 
tions for the sequence {S,,}. 


Exercises 19-21 refer to the sequence S where S;, denotes the num- 
ber of n-bit strings that do not contain the pattern 00. 


19. Find a recurrence relation and initial conditions for the se- 
quence {S,,}. 

20. Show that S,=fn42,n=1,2,..., where f denotes the 
Fibonacci sequence. 


21. By considering the number of n-bit strings with exactly 7 0’s 
and Exercise 20, show that 


Lan+1)/2] 
frz= So Cat1-ii 


i=0 


oe eee 


where f denotes the Fibonacci sequence. 
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Exercises 22-24 refer to the sequence S,, S2, ..., where S, denotes 
the number of n-bit strings that do not contain the pattern 010. 


22. Compute $1, $2, $3, and S4. 


23. By considering the number of n-bit strings that do not contain 
the pattern 010 that have no leading 0’s (i.e., that begin with 
1); that have one leading 0 (i.e., that begin 01); that have two 
leading 0’s; and so on, derive the recurrence relation 


Sn = Sn-1 + Sn—3 + Sn—4 + Sn—5 + +++ +S] +3. (7.1.15) 


24. By replacing n by n — 1 in (7.1.15), write a formula for S,,_1. 
Subtract the formula for S,—1 from the formula for S,, and use 
the result to derive the recurrence relation 


Sn = 28n—1 = Sn—2 + Sn—3- 


In Exercises 25-33, Co, Cj, C2,... 
Catalan numbers. 


denotes the sequence of 


25. Given that Co = C; = | and Cz = 2, compute C3, Cy, and 
Cs by using the recurrence relation of Example 7.1.7. 


26. Show that the Catalan numbers are given by the recurrence 
relation 


(a+ 2)Cn41 = (4n + 2)Cn n= 0, 


and initial condition Co = 1. 
27. Prove thatn +2 < C, foralln > 4. 


*28. Prove that C,, is prime if and only ifn = 2 orn = 3. Hint: 
First, use proof by contradiction to prove that ifn > 5, C, is 
not prime. Use Exercises 26 and 27, this section, and Exercise 
27, Section 5.3, to derive a contradiction. Now examine the 
cases = 0, 1, 2,3, 4. 

29. Prove that 


4n- 1 
Ch = 5) 
n~ 


for alln > 1. 


30. Derive a recurrence relation and an initial condition for the 
number of ways to parenthesize the product 


, * 2 * +++ * Ay n> 2. 


Examples: There is one way to parenthesize a; * a2, namely, 
(a, * a2). There are two ways to parenthesize a, * az * a3, 
namely, ((a, * a2) * a3) and (ay * (a2 * a3)). Deduce that the 
number of ways to parenthesize the product of n elements is 
Cpa. 

Derive a recurrence relation and an initial condition for the 
number of ways to divide a convex (n + 2)-sided polygon, 
n > 1, into triangles by drawing n — | lines through the 
corners that do not intersect in the interior of the polygon. (A 
polygon is convex if any line joining two points in the poly- 
gon lies wholly in the polygon.) For example, there are five 
ways to divide a convex pentagon into triangles by drawing 
two nonintersecting lines through the corners: 


VNauveg 


*31. 


Deduce that the number of ways to divide a convex (n + 2)- 
sided polygon into triangles by drawing n — | nonintersecting 
lines through the corners is C,,n > 1. 


32. How many parenthesized expressions are there containing n 
distinct binary operators, n + 1 distinct variables, and n — 1 
pairs of parentheses? For example, if n = 2, and we choose « 
and + as the operators and x, y, and z as the variables, some 
of the expressions are 


(x*y) +z, x*(V+z), xe(z+y), x+(z¥y), Ze(V +X). 


33. Consider routes from the lower-left corner to the upper-right 
corner in an (n+ 1) x (n + 1) grid in which we are restricted 
to traveling only to the right or upward. By dividing the routes 
into classes based on when, after leaving the lower-left cor- 
ner, the route first meets the diagonal line from the lower-left 
corner to the upper-right corner, derive the recurrence relation 

I n-1 
Cn = 5CQ@+1),n+ 1 - S25 CQ(n — k),n- W). 
k=0 

In Exercises 34 and 35, let S, denote the number of routes from 
the lower-left corner of ann x n grid to the upper-right corner in 
which we are restricted to traveling to the right, upward, or diago- 
nally northeast [i.e., from (i, j) to (i+1, j+1)] and in which we are 
allowed to touch but not go above a diagonal line from the lower- 
left corner to the upper-right corner. The numbers So, S,... are 
called the Schroder numbers. 


34. Show that So = 1, 5; = 2, $2 = 6, and S3 = 22. 


35. Derive a recurrence relation for the sequence of Schréder 
numbers. 


36. Write explicit solutions for the Tower of Hanoi puzzle for 
n= 3,4. 


37. To what values do the price and quantity tend in Example 7.1.9 
when b < k? 


38. Show that when b < kin Example 7.1.9, the price tends to that 
given by the intersection of the supply and demand curves. 


39. Show that when b > k in Example 7.1.9, the differences be- 
tween successive prices increase. 


Exercises 40-46 refer to Ackermann’s function A(m, n). 


40. Compute A(2, 2) and A(2, 3). 
41. Use induction to show that 


A(l,n) =n+2 W=0> Uh ctese 
42. Use induction to show that 
A(2,n) =3+2n i — he Pee 


43. Guess a formula for A(3, 7) and prove it by using induction. 


*44, Prove that A(m,n) > n for all m > 0,n > 0 by induction on 


m. The inductive step will use induction on n. 


45. By using Exercise 44 or otherwise, prove that A(m, n) > 1 for 
allm>1,n>0. 


a 


6. By using Exercise 44 or otherwise, prove that A(m,n) < 
A(m,n-+ 1) for allm > 0,n > 0. 


In Exercises 47-52, Ackermann’s original function AO is defined 
by 


AO, y,z2)=z+1 y,220, 

AO(1,y,zZ)=y+z y,z>0, 

AO(2, y, Z) =yz y,z20, 
AO(x + 3, y, 0) =1 x>0,y>1, 


AO(x+3,y,z+)D= 
AO(x + 2, y, AO(x + 3, y, z)) 


Exercises 47-52 refer to the function AO and to Ackermann’s 
function A. 


47. Prove that AO(3, y, z) = y* fory > 1,z> 0. 
48. Find a formula for AO(4, y, z) for y > 1, z > 0 and prove it. 


49. Show that A(x, y)=AO(x,2,y + 3) — 3 for y>O and 
x=, 1,2. 


50. Show that AO(x, 2, 1) = 2 for x > 2. 
51. Show that AO(x, 2,2) = 4 for x > 2. 
*52. Show that A(x, y) = AO(x, 2, y+ 3) — 3 forx, y > 0. 


53. A network consists of n nodes. Each node has communica- 
tions facilities and local storage. Periodically, all files must 
be shared. A link consists of two nodes sharing files. Specif- 
ically, when nodes A and B are linked, A transmits all its 
files to B and B transmits all its files to A. Only one link ex- 
ists at a time, and after a link is established and the files are 
shared, the link is deleted. Let a, be the minimum number 
of links required by n nodes so that all files are known to all 
nodes. 


x>0,y>1,z>0. 


(a) Show that az = 1, a3 < 3, a4 < 4. 
(b) Show that a, < dnp—; +2,n > 3. 


54. If P,, denotes the number of permutations of n distinct objects, 
find a recurrence relation and an initial condition for the se- 
quence P|, P2,.... 


55. Suppose that we have n dollars and that each day we buy either 
orange juice ($1), milk ($2), or beer ($2). If R, is the number 
of ways of spending all the money, show that 

Ry = Rn-1 + 2Rn-2. 
Order is taken into account. For example, there are 11 ways 
to spend $4: MB, BM, OOM, OOB, OMO, OBO, MOO, BOO, 
OOOO, MM, BB. 


56. Suppose that we have n dollars and that each day we buy either 
tape ($1), paper ($1), pens ($2), pencils ($2), or binders ($3). 
If Ry is the number of ways of spending all the money, derive 
a recurrence relation for the sequence Rj, R2,.... 


57. Let R, denote the number of regions into which the plane is 
divided by n lines. Assume that each pair of lines meets in a 
point, but that no three lines meet in a point. Derive a recur- 
rence relation for the sequence Rj, R2,.... 


Exercises 58 and 59 refer to the sequence S,, defined by 


Sn—1 + Sp 
Sy = 7 n—2 


Sy=0, 8 =1, 


58. Compute $3 and $4. 


x59. Guess a formula for S, and use induction to show that it is 
correct. 


x60. 


61. 


62. 


63. 


64. 


65. 


66. 


67. 
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Let F;, denote the number of functions f from X = {1,..., n} 
into X having the property that if 7 is in the range of f, then 


1,2,...,i— 1 are also in the range of f. (Set Fo = 1.) Show 
that the sequence Fo, F1,... satisfies the recurrence relation 


If a is a bit string, let C(@) be the maximum number of consec- 
utive 0’s in a. [Examples: C(10010) = 2, C(00110001) = 3.] 
Let S, be the number of n-bit strings a with C(a) < 2. 
Develop a recurrence relation for S;, S2,.... 


The sequence g1, g2,... is defined by the recurrence relation 


8n = 8n-1 + 8n24+1 n> 3, 


and initial conditions gj = 1, g2 = 3. By using mathematical 


induction or otherwise, show that 


8n = nti — 1 n=l, 


where f, fo, ... is the Fibonacci sequence. 


Consider the formula 


U3 
Un = { 3n+1 
Un/2 


if n is odd and greater than | 
if n is even and greater than 1 


and initial condition uw; = 1. Explain why the formula is not a 
recurrence relation. A longstanding, open conjecture is that for 
every positive integer n, u, is defined and equal to 1. Compute 
Uy FOF HS Le sccay Vs 
Define the sequence ft, f2, ... by the recurrence relation 

tn = tn—1tn-2 n= 3; 


and initial conditions t; = 1, f2 = 2. What is wrong with the 
following “proof” that ¢, = 1 for alln > 1? 


Basis Step Forn = 1, we have t; = 1; thus, the Basis Step 
is verified. 


Inductive Step Assume that % = 1 for k < n. We must 
prove that t, = 1. Now 


tn = tn—1tn-2 
1-1 
=. 


by the inductive assumption 


The Inductive Step is complete. 


Derive a recurrence relation for C(n,k), the number of 
k-element subsets of an n-element set. Specifically, write 
C(n + 1, k) in terms of C(n, i) for appropriate 7. 


Derive a recurrence relation for S(k, m), the number of ways of 
choosing k items, allowing repetitions, from n available types. 
Specifically, write S(k, n) in terms of S(k — 1, i) for appropri- 
ate i. 


Let S(n, k) denote the number of functions from {1,..., ”} 
onto {1,...,k}. Show that S(n,k) satisfies the recurrence 
relation 


k-1 


Sin,k =k x C(k, i)S(n, i). 


i=1 
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The Lucas sequence L,, L2,... (named after Edouard Lucas, 
the inventor of the Tower of Hanoi puzzle) is defined by the 
recurrence relation 


Lyn = Ln-1 + Ln-2 n> 3; 


and the initial conditions L; = 1, Ly = 3. 
(a) Find the values of L3, L4, and Ls. 
(b) Show that 


n>1, 


Ln42 = fnt1 + fn43 
where f|, /2,... denotes the Fibonacci sequence. 


Establish the recurrence relation 


Sntl,k = Sn,k-1 + 18p,k 
for Stirling numbers of the first kind (see Exercise 90, 
Section 6.2). 
Establish the recurrence relation 

Sntik => Snjk-1 + KSnk 
for Stirling numbers of the second kind (see Exercise 91, 
Section 6.2). 
Show that 


k 
1 ; : : 
Suk = (DIE I"CK |, 
i=0 
where S,,; denotes a Stirling number of the second kind (see 
Exercise 91, Section 6.2). 


Assume that a person invests a sum of money at r percent com- 
pounded annually. Explain the rule of thumb: To estimate the 
time to double the investment, divide 70 by r. 


Derive a recurrence relation for the number of multiplica- 
tions needed to evaluate an n x n determinant by the cofactor 
method. 


A rise/fall permutation is a permutation p of 1,2, ...,n satisfying 
pi) < pit+1) for i= 1,3, 5,05: 

and 
p(t) > pit+1) for i = 2, 4,-6) 65.6 


For example, there are five rise/fall permutations of 1, 2, 3, 4: 


1,3, 2,45 1, 4, 2, 3; 2,3,1,4; 
2,4, 1, 3; 3,4, 1,2. 
Let E,, denote the number of rise/fall permutations of 1,2, ...,n. 
(Define Ep = 1.) The numbers Eo, E, Er, ... are called the Euler 
numbers. 
74. List all rise/fall permutations of 1, 2, 3. What is the value of 
E3? 
75. List all rise/fall permutations of 1, 2, 3, 4,5. What is the value 
of E5? 
76. Show that in a rise/fall permutation of 1, 2,...,,m must oc- 


cur in position 2i, for some i. 


«77. Use Exercise 76 to derive the recurrence relation 
[n/2] 
E, = >> C@—1,2j — 1)Ey-1Fn-3- 
j=l 
x78. By considering where | must occur in a rise/fall permutation, 
derive the recurrence relation 
L(n—1)/2)] 
Ex= > C= 1, 2) ExjEn—2j-1. 
j=0 
*79. Prove that 


n-1 


1 
En = 2 » C(n _ 1, ))Ej}En-j-1- 
j= 


7.2 Solving Recurrence Relations 


To solve a recurrence relation involving the sequence do, dj, ... 


is to find an explicit 


formula for the general term a,,. In this section we discuss two methods of solving re- 


Go Online 

For more on 

solving recurrence relations, 
see g00.g81/P8Y2SF 


SOrS Qy-1,- 
An-15-- 


currence relations: iteration and a special method that applies to linear homogeneous 
recurrence relations with constant coefficients. For more powerful methods, such as 
methods that make use of generating functions, consult [Brualdi]. 

To solve a recurrence relation involving the sequence ao, a1, .. 
use the recurrence relation to write the nth term a, in terms of certain of its predeces- 
.., do. We then successively use the recurrence relation to replace each of 
. by certain of their predecessors. We continue until an explicit formula is ob- 


. by iteration, we 


tained. The iterative method was used to solve the recurrence relation of Example 7.1.3. 


Example 7.2.1 


Solve the recurrence relation 


An = An-1 + 35 (7.2.1) 


subject to the initial condition, a; = 2, by iteration. 


Example 7.2.2 


Example 7.2.3 
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SOLUTION Replacing n by n — 1 in (7.2.1), we obtain 
An—1 = An—-2 + 3: 


If we substitute this expression for a,_; into (7.2.1), we obtain 


Qn = |4n-1 +3 
1 
=/d,2+3 +3 
= An-2 + 2-3. (7.2.2) 


Replacing n by n — 2 in (7.2.1), we obtain 
An—2 = An—3 + 3. 


If we substitute this expression for a,_2 into (7.2.2), we obtain 


Qn = | An—2 +23 
v 
= | An-3 + 3 + 2+3 
= a34+3-3. 
In general, we have 
An = np +k +3. 


If we set k = n — | in this last expression, we have 

ad, =a, + (n—1)-3. 
Since a, = 2, we obtain the explicit formula 

a, = 2+ 3(n — 1) 

for the sequence a. < 
Solve the recurrence relation S, = 2S;,_; of Example 7.1.5, subject to the initial condi- 
tion, So = 1, by iteration. 
SOLUTION S,, = 2S,-1 = 2(2S,-2) = +--+ = 2"So = 2”. < 
Population Growth Assume that the deer population of Rustic County is 1000 at time 
n = 0 and that the increase from time n — | to time n is 10 percent of the size at time 


n— 1. Write a recurrence relation and an initial condition that define the deer population 
at time n and then solve the recurrence relation. 


SOLUTION Let d,, denote the deer population at time n. We have the initial condition 
dy = 1000. The increase from time n — | to time n is d, — d,_,. Since this increase 
is 10 percent of the size at time n — 1, we obtain the recurrence relation d,, — d,_; = 
0.1d,—1, which may be rewritten 


d= 11d,.4, 
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Example 7.2.4 


Example 7.2.5 


The recurrence relation may be solved by iteration: 
dy = 1.Ady_y = 1.101 1dy—2) = (1.1)? Gn2) 
= +++ = (1.1)"dp = (1.1)"1000. 

The assumptions imply exponential population growth. < 
Find an explicit formula for c,,, the minimum number of moves in which the n-disk Tower 
of Hanoi puzzle can be solved (see Example 7.1.8). 
SOLUTION In Example 7.1.8 we obtained the recurrence relation 

Cn = 2Cn-1 +1 (7.2.3) 


and initial condition c; = 1. Applying the iterative method to (7.2.3), we obtain 


Ch = 2€n-1 +1 
= 2(2cn-2 +) +1 
=2c,.+2+1 


= 2 (2c-3+1) +241 
= 2c, 3+27+2+1 


= 2" 1c, gn-2 gn-3 Posie ah 241 


Par) ma Op cers eg cae ee ae | 
= 2" —]. 


The last step results from the formula for the geometric sum (see Example 2.4.4). 4 


Solve the recurrence relation 


b 
Pr=a— yer} 


for the price p, in the economics model of Example 7.1.9 by iteration. 
SOLUTION To simplify the notation, we set s = —b/k. 


Pn = 47 SPn-1 
rT s(a + SPn—2) 


2 
=a+as+ 8° Pn—2 


ll 
Q 
i 


=a+ast+s?(a+t spa) 


=at+ast+ as* + SPp—3 


ll 
Q 
i 
T 
Q 
a 
+ 
g 
a 
i) 
+ 
g 
iy 
2 
+ 
oy 
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v 
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Example 7.2.7 


Example 7.2.8 


Example 7.2.9 


Example 7.2.10 
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b\" ( —ak 4: 
bE} \pap 


becomes small as n gets large so that the price tends to stabilize at approximately 
ak/(k + b). If b/k = 1, (7.2.4) shows that p, oscillates between po and p,. If b/k > 1, 
(7.2.4) shows that the differences between successive prices increase. Previously, we 
observed these properties graphically (see Example 7.1.9). < 


We see that if b/k < 1, the term 


We turn next to a special class of recurrence relations. 


Definition 7.2.6 » A linear homogeneous recurrence relation of order k with 
constant coefficients is a recurrence relation of the form 


An = C1Gn—1 + C2Gn—2 + +++ + Cenk, Ck FO. (7.2.5) 
< 


Notice that a linear homogeneous recurrence relation of order k with constant co- 
efficients (7.2.5), together with the k initial conditions 


ao=Co, a =Ci,..., Aki = Ce, 


uniquely defines a sequence do, a1, .... 


The recurrence relations 
Sn = 28n—1 (7.2.6) 
of Example 7.2.2 and 


Sn =Sn-1 thn-2, (7.2.7) 


which defines the Fibonacci sequence, are both linear homogeneous recurrence relations 
with constant coefficients. The recurrence relation (7.2.6) is of order 1 and (7.2.7) is of 
order 2. < 


The recurrence relation 
an = 3Gn—1An—2 (7.2.8) 


is not a linear homogeneous recurrence relation with constant coefficients. In a linear 
homogeneous recurrence relation with constant coefficients, each term is of the form 
cay. Terms such as d,_1;@,_2 are not permitted. Recurrence relations such as (7.2.8) are 
said to be nonlinear. < 


The recurrence relation a, — d)—; = 2n is not a linear homogeneous recurrence relation 
with constant coefficients because the expression on the right side of the equation is not 
zero. (Such an equation is said to be inhomogeneous. Linear inhomogeneous recurrence 
relations with constant coefficients are discussed in Exercises 42—48.) < 


The recurrence relation a, = 3na,_; is not a linear homogeneous recurrence relation 
with constant coefficients because the coefficient 3n is not constant. It is a linear homo- 
geneous recurrence relation with nonconstant coefficients. < 
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We will illustrate the general method of solving linear homogeneous recurrence 
relations with constant coefficients by finding an explicit formula for the sequence de- 
fined by the recurrence relation 


an = San—1 = 6an—2 (7.2.9) 
and initial conditions 


a) = 7, a, = 16. (7.2.10) 


Often in mathematics, when trying to solve a more difficult instance of some problem, 
we begin with an expression that solved a simpler version. For the first-order recurrence 
relation (7.2.6), we found in Example 7.2.2 that the solution was of the form S,, = ¢”; 
thus for our first attempt at finding a solution of the second-order recurrence relation 
(7.2.9), we will search for a solution of the form V,, = 7”. 

If V,, = t” is to solve (7.2.9), we must have 


Vin = SVn-1 — OVn-2 
or 
f= 51"! — of? 
or 
f — 5"! + 6"? =0. 
Dividing by ¢’~”, we obtain the equivalent equation 
P—St+6=0. (7.2.11) 


Solving (7.2.11), we find the solutions 


At this point, we have two solutions S and T of (7.2.9), given by 
S, = 2", T, = 3". (7.2.12) 


We can verify (see Theorem 7.2.11) that if S and 7 are solutions of (7.2.9), then bS+ dT, 
where b and d are any numbers whatever, is also a solution of (7.2.9). In our case, if we 
define the sequence U by the equation 


Un = bSy + dT, = 62” + d3", 


U is a solution of (7.2.9). 
To satisfy the initial conditions (7.2.10), we must have 


7 = Uy = b2° + d3° =b +d, 16 = U, = b2'! +.d3' = 2b4 3d. 


Solving these equations for b and d, we obtain b = 5 and d = 2. Therefore, the sequence 
U defined by 


U, = 5-2" 42-3" 


satisfies the recurrence relation (7.2.9) and the initial conditions (7.2.10). We conclude 
that 


Gy = Uy = 5-2" 42-3" forn=0,1,.... 
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At this point we will summarize and justify the techniques used to solve the pre- 
ceding recurrence relation. 


Theorem 7.2.11 


Proof Since S and 7 are solutions of (7.2.13), 
Sn = CSp—1 + €2Sp-2, Ty = €1Tn—1 + CoTp-2. 
If we multiply the first equation by b and the second by d and add, we obtain 


Un, = DS, + dT, = C1 (BSn—1 + dTn—1) + C2(DSy—2 + dTy_2) 
= C,Un—1 + €2Un-2. 


Therefore, U is a solution of (7.2.13). 
Since r is a root of (7.2.14), 


r= Cir + Co. 
Now 
ar} tor? =P 7% arta) =r? =P; 
thus the sequence r”,n = 0, 1,..., is a solution of (7.2.13). 
If we set U, = brij + dr3, then U is a solution of (7.2.13). To meet the initial 
conditions (7.2.15), we must have 
Up =b+d=C, U, = bri +: dr2 = C). 


If we multiply the first equation by r; and subtract, we obtain 


d(r; — 12) =11Cyo —C. 
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Example 7.2.12 


Example 7.2.13 


Since r; — r, # 0, we can solve for d. Similarly, we can solve for b. With these choices 
for b and d, we have 


Up = Co, U,=C\. 


Let a be the sequence defined by (7.2.13) and (7.2.15). Since U also satisfies (7.2.13) 
and (7.2.15), it follows that U, = da,,n=0,1,.... < 


More Population Growth Assume that the deer population of Rustic County is 200 
at time n = O and 220 at time n = | and that the increase from time n — | to time n 
is twice the increase from time n — 2 to time n — |. Write a recurrence relation and an 
initial condition that define the deer population at time n and then solve the recurrence 
relation. 


SOLUTION Let d, denote the deer population at time n. We have the initial conditions 
dy = 200, d, = 220. 


The increase from time n — | to time n is d,, — d,_,, and the increase from time n — 2 to 
time n — | is d,_1 — d,_2. Thus we obtain the recurrence relation 


dn — dn—1 = 2 (dn-1 — dn-2), 
which may be rewritten 
dn = 3dy—1 a 2dn—2. 


To solve this recurrence relation, we first solve the quadratic equation 
? — 3t+2 =0 to obtain roots 1 and 2. The sequence d is of the form 


d,=b-\"+c-2"=b+c2". 
To meet the initial conditions, we must have 
200 =dp) =b+c, 220 = d, = b+ 2c. 


Solving for b and c, we find that b = 180 and c = 20. Thus d, is given by 
d, = 180 + 20-2”. As in Example 7.2.3, the growth is exponential. < 


Find an explicit formula for the Fibonacci sequence. 


SOLUTION The Fibonacci sequence is defined by the linear homogeneous, 
second-order recurrence relation 


tn —Sn-1 —fr—-2 = 9 n >= 3, 


and initial conditions f; = 1 and f2 = 1. We begin by using the quadratic formula to 
solve 2 — tf — 1 = 0. The solutions are 


14/5 


= 


Thus the solution is of the form 


wo 35) oo’) 


Theorem 7.2.14 


7.2 @ Solving Recurrence Relations 345 


To satisfy the initial conditions, we must have 
1 5 1-5 
(CA uS9- 


o(* +s a(* | =1, 


Solving these equations for b and d, we obtain 


a an 
Therefore, an explicit formula for the Fibonacci sequence is 
ae. es) 1 4) 
so J5 2 J5 2 * 


Surprisingly, even though f,, is an integer, the preceding formula involves the irra- 
tional number /5. < 


In Example 7.2.13, the number (1 + /5)/2 that appears in the formula for the 
nth Fibonacci number is known as the golden ratio. Calculus fans may enjoy showing 
that lim, 00 fr4i/fr = (1 + V5)/2. Historically, a ratio a:b, where a/b = (1 + /5)/2, 
was considered to be esthetically pleasing. For example, pictures and paintings whose 
dimensions were in the golden ratio were admired. The resolution 1920 x 1200 of some 
computer monitors is close to the golden ratio. The ratio 5:3, which is also close to the 
golden ratio, is a film standard in several European countries. 

Theorem 7.2.11 states that any solution of (7.2.13) may be given in terms of two 
basic solutions 77 and 73. However, in case (7.2.14) has two equal roots r, we obtain only 
one basic solution r”. The next theorem shows that in this case, nr” furnishes the other 
basic solution. 


Let 
An = C{Gn—1 + C2Gn-2 (7.2.16) 


be a second-order linear homogeneous recurrence relation with constant coefficients. 
Let a be the sequence satisfying (7.2.16) and 


a= Co, a= G ‘lis 
If both roots of 
? —cit—c, =0 (7.2.17) 
are equal to r, then there exist constants b and d such that 


Qn = br’ + dnr” =O, Igsoce 


Proof The proof of Theorem 7.2.11 shows that the sequence r’,n = 0,1,..., isa 
solution of (7.2.16). We show that the sequence nr", n = 0, 1, ..., is also a solution of 
(7.2.16). 


346 


Chapter 7 ® Recurrence Relations 


Example 7.2.15 


Since r is the only solution of (7.2.17), we must have 
P—ct-oO= (t—r. 
It follows that c) = 2r and c) = —r?. Now 
a [(a— Dr] +e [@—2)"] = Ar — Yr"! — P (n= 2)" 
=r"[2(n— 1) — (n—2)] =nr". 
Therefore, the sequence nr", n = 0, 1, ..., is a solution of (7.2.16). 
By Theorem 7.2.11, the sequence U defined by U,, = br" + dnr" is a solution of 
(7.2.16). 
The proof that there are constants b and d such that Up = Cy and U; = C, is 


similar to the argument given in Theorem 7.2.11 and is left as an exercise (Exercise 50). 
It follows that U, = d,,n =0,1,.... < 


Solve the recurrence relation 


dy = 4(dn-1 = dn—2) (7.2.18) 
subject to the initial conditions dp = 1 = dj. 


SOLUTION According to Theorem 7.2.11, S, =r" is a solution of (7.2.18), where r is 
a solution of 


rP—4t+4=0. (7.2.19) 


Thus we obtain the solution S,, = 2” of (7.2.18). Since 2 is the only solution of (7.2.19), 
by Theorem 7.2.14, 7,, = n2” is also a solution of (7.2.18). Thus the general solution of 
(7.2.18) is of the form U = aS + bT. We must have Up = 1 = U;. These last equations 
become 


aSo+bT =a+0b=1, aS; +67, =2a4+2b=1. 


Solving for a and b, we obtain 


Therefore, the solution of (7.2.18) is 
dy = 2" — n2"", < 


For the general linear homogeneous recurrence relation of order k with constant 
coefficients (7.2.5), if r is a root of 


cy = 0 


of multiplicity m, it can be shown that r’, nr", ..., n"—!/” are solutions of (7.2.5). This 
fact can be used, just as in the previous examples for recurrence relations of order 2, to 
solve a linear homogeneous recurrence relation of order k with constant coefficients. For 
a precise statement and a proof of the general result, see [Brualdi]. 


7.2 Problem-Solving Tips 


To solve a recurrence relation in which a,, is defined in terms of its immediate predeces- 
SOF Ay—| 


On = 66 Ag Aaves (7.2.20) 
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use iteration. Start with the equation (7.2.20). Substitute for a,_;, which will yield an 
expression involving d,—2, to obtain d, = ...d,—2.... Continue until you obtain a, 
(e.g., do), whose value is given explicitly as the initial condition a, = ...a,.... Plug in 
the value for az and you have solved the recurrence relation. 

To solve the recurrence relation 


An = C1An—-1 + C2Qn-2, 
first solve the equation 
P —cyt—c,=0. (7.2.21) 
If the roots are r; and r, 7; # rp, the solution is 


Gy = br? + dri (7.2.22) 


for some constants b and d. To determine these constants, use the initial conditions, say 
ay = Co, a, = C). Set n = 0 and then n = | in (7.2.22) to obtain 


Co = a9 =b+d, Cy =a, = br, + dro. 
Now solve 
Co =b+d, C, = br; + drp 


for b and d to obtain the solution of the recurrence relation. 
If the roots of (7.2.21) are r; and r2, ry) = rz, the solution is 


Qn = br" + dnr” (7.2.23) 


for some constants b and d, where r = r, = ry. To determine these constants, use the 
initial conditions, say dj = Co, a, = C). Setn = 0 and then n = | in (7.2.23) to obtain 


Co =a) = b, Cy =a, = br+dr. 


Now solve 
Co = b, C; = br+adr 


for b and d to obtain the solution of the recurrence relation. 


7.2 Review Exercises 


1. Explain how to solve a recurrence relation by iteration. 3. Give an example of a second-order, linear homogeneous recur- 


rence relation with constant coefficients. 


2. What is an nth-order, linear homogeneous recurrence relation 


with constant coefficients? 


4. Explain how to solve a second-order, linear homogeneous 
recurrence relation with constant coefficients. 


Tell whether or not each recurrence relation in Exercises 1-10 is a 1. dy = —3ap_-1 


linear homogeneous recurrence relation with constant coefficients. 
Give the order of each linear homogeneous recurrence relation 


with constant coefficients. 


2. dy = 2nan—2 — An—-1 


3. dy = 2nay_1 
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An =4n-1 +n 
Qn = Gn) #1 4+ 2"7! 

An = Tdn—2 — 64-3 

= (Ig 2n)ay—1 — [g(r — 1) ]an—2 


an = —4n—-1 — Gn—-2 


Yea A ws 
= 


an = 64n—1 = 9an—2 


10. An = —4y—-1 + San—2 _ 3an—3 


In Exercises 11-26, solve the given recurrence relation for the ini- 
tial conditions given. 

11. Exercise 1; ag = 2 

12. Exercise 3; aj = 1 

13. Exercise 4; aj = 0 


14, @.=2"0,-1:a =1 


15. Gy = 64,—1 — 8an_2; ag=1, ar=O0 
16. dy = 2an—| + 8an-2; a9 =4, ay = 10 
17. ayn = Tan—| — 10dn-2; ag =5, a, = 16 
18. 2a, = Tayn_1 — 34n-2; ag =a, = 1 
19. Exercise 5; aj = 0 
20. Exercise 9; ag = a, = 1 
21. ay = —8an—1 — 16an-2; ag =2, a, = —20 
22. The Lucas sequence 
Tn = Lyn-1 + Ln-2, n= 3; y=); g=3 
23. 9an = 64n—1 — An-2; a9 =6, ay =5 


24. Exercise 55, Section 7.1 
25. Exercise 57, Section 7.1 


26. The recurrence relation preceding Exercise 58, Section 7.1 


27. The population of Utopia increases 5 percent per year. In 2000 
the population was 10,000. What was the population in 1970? 


28. Assume that the deer population of Rustic County is 0 at 
time n = 0. Suppose that at time n, 100n deer are intro- 
duced into Rustic County and that the population increases 
20 percent each year. Write a recurrence relation and an initial 
condition that define the deer population at time n and then 
solve the recurrence relation. The following formula may be 
of use: 


Soa! (n — 1)x" — nx'-144 


i=l (@— 1? 


Exercises 29-35 concern Toots and Sly, who flip fair pennies. If 


the pennies are both heads or both tails, Toots wins one of Sly’s 
pennies. If one penny is a head and the other is a tail, Sly wins one 
of Toots’ pennies. The game ends when one person has all of the 
pennies. There are C total pennies. 

Let E;, denote the event “Toots starts with n coins and gets 


all of Sly’s coins”; let W denote the event “Toots wins the first 
toss”; and let py = P(Ey). 

29. Find P(W). 

30. Find P(W). 

31. Show that P(E,) = P(En | W) + P(E; | W). 

32. Find po. 

33. Find pc. 

34. Find a recurrence relation for py. 


35. If Toots starts with n pennies, what is the probability that 
Toots wins all of Sly’s pennies? 


Sometimes a recurrence relation that is not a linear homogeneous 
equation with constant coefficients can be transformed into a linear 
homogeneous equation with constant coefficients. In Exercises 36 
and 37, make the given substitution and solve the resulting recur- 
rence relation; then find the solution to the original recurrence 
relation. 


36. Solve the recurrence relation 


Jan = Jan-1 + 2,/an—2 


with initial conditions ag = a; = | by making the substitution 
bn = Jan. 


37. Solve the recurrence relation 


Gn—-2 
Gn-1 


an = 


with initial conditions ag = 8,a, = 1/(2/2) by taking 
the logarithm of both sides and making the substitution 
bn = Ig an. 


In Exercises 38-40, solve the recurrence relation for the initial con- 
ditions given. 


38. dy = —2nay_1 +3n(n— Van-2; ag=1, ay =2 
«39, c, = 24+ ye cj, n>2; cy=l 
x40. A(n, m) = 14+A(n—1,m—1)+A(n—1,m),n-1>m>1, 


n> 2; A(n,0) = A(n,n) = 1,n>0 
41. Show that 


-1 
; tes \" 
fri = 5 n>1, 


where f denotes the Fibonacci sequence. 


42. The equation 
An = C1An—1 + €2Gn—2 + f (n) (7.2.24) 


is called a second-order, linear inhomogeneous recurrence 
relation with constant coefficients. 


Let g(n) be a solution of (7.2.24). Show that any solu- 
tion U of (7.2.24) is of the form 


Un = Vn + g(n), (7.2.25) 


where V is a solution of the homogeneous equation (7.2.13). 


If f(n) = C in (7.2.24), it can be shown that g(n) = C’ in (7.2.25) 
if 1 is not a root of 


P —cit—c. =0, (7.2.26) 
g(n)=C'n if 1 is a root of (7.2.26) of multiplicity one, and 
g(n)=C'n’ if 1 is a root of (7.2.26) of multiplicity two. Similarly, 
if f(n) = Cn, it can be shown that g(n) = C\n+ Co if 1 is not a root 
of (7.2.26), g(n) = Cin? +Chn if 1 is a root of multiplicity one, and 
g(n) = Cyn + Con? if 1 is a root of multiplicity two. If f(n) = Cn’, 
it can be shown that g(n) = Cyn? + Cin + Co if 1 is not a root of 
(7.2.26), g(n) = Cyn? + Cin? + Ch if 1 is a root of multiplicity one, 
and g(n) =Cyn* + Cyn? + Con? if 1 is a root of multiplicity two. 
If f(n) =C", it can be shown that g(n)=C'C" if C is not a root 
of (7.2.26), g(n) = C’'nC" if C is a root of multiplicity one, and 
g(n) = C'n*C" if C is a root of multiplicity two. The constants can 
be determined by substituting g(n) into the recurrence relation and 
equating coefficients on the two sides of the resulting equation. As 
examples, the constant terms on the two sides of the equation must 
be equal, and the coefficient of n on the left side of the equation 
must equal the coefficient of n on the right side of the equation. 
Use these facts together with Exercise 42 to find the general solu- 
tions of the recurrence relations of Exercises 43-48. 


43. an =6an_1 — 8an-2 +3 
44, ay, =Tan—1 — 10a,—-2 + 16n 


45. dn = 2dn—1 + 8dn—2 + 81n? 
46. 2ay = Tan—1| — 3an—2 + 2” 
47. a, = 16ay—2 + 3n 
48. 9a = 6ay_—1 — An—2 + 5n2 


8an—1 


49. The equation 
an =f(n)an-1 + B(N)ay—2 (7.2.27) 


is called a second-order, linear homogeneous recurrence 
relation. The coefficients f(m) and g(m) are not necessarily 
constant. Show that if § and T are solutions of (7.2.27), then 
bS + dT is also a solution of (7.2.27). 


50. Suppose that both roots of 
P-ct—c =0 
are equal to r, and suppose that a, satisfies 
An =C1An-1+02dn-2, a= Co, a =C. 
Show that there exist constants b and d such that 
Qn = br" + dnr" n= 0, 1p, 


thus completing the proof of Theorem 7.2.14. 
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51. Let a, be the minimum number of links required to solve the 
n-node communication problem (see Exercise 53, Section 
7.1). Use iteration to show that a, < 2n —4,n > 4. 


The n-disk, four-peg Tower of Hanoi puzzle has the same rules as 
the three-peg puzzle; the only difference is that there is an extra 
peg. Exercises 52-55 refer to the following algorithm to solve the 
n-disk, four-peg Tower of Hanoi puzzle. 

Assume that the pegs are numbered 1, 2, 3, 4 and that the 
problem is to move the disks, which are initially stacked on peg 1, 
to peg 4. Ifn = 1, move the disk to peg 4 and stop. Ifn > 1, let kn 
be the largest integer satisfying 


kn 
> i<n. 
i=l 


Fix ky, disks at the bottom of peg 1. Recursively invoke this algo- 
rithm to move the n — ky, disks at the top of peg 1 to peg 2. During 
this part of the algorithm, the k, bottom disks on peg I remain fixed. 
Next, move the ky, disks on peg I to peg 4 by invoking the optimal 
three-peg algorithm (see Example 7.1.8) and using only pegs 1, 3, 
and 4. Finally, again recursively invoke this algorithm to move the 
n — k, disks on peg 2 to peg 4. During this part of the algorithm, 
the k, disks on peg 4 remain fixed. Let T(n) denote the number of 
moves required by this algorithm. 

This algorithm, although not known to be optimal, uses as 
few moves as any other algorithm that has been proposed for the 
four-peg problem. 


52. Derive the recurrence relation 
T(n) = 2T(n— ky) +2" — 1. 


53. Compute 7(n) forn = 1,..., 10. Compare these values with 
the optimal number of moves to solve the three-peg problem. 


x54, Let 


ee kn (kn + 1) 
a ‘aa 


nh >= 
Using induction or otherwise, prove that 
T(n) = kn + tn — 12" +1. 


Show that T(n) = O(4¥"), 


Give an optimal algorithm to solve a variant of the Tower of 
Hanoi puzzle in which a stack of disks, except for the begin- 
ning and ending stacks, is allowed as long as the largest disk 
in the stack is on the bottom (the disks above the largest disk in 
the stack can be ordered in any way whatsoever). The problem 
is to transfer the disks to another peg by moving one disk at a 
time beginning with all the disks stacked on one peg in order 
from largest (bottom) to smallest as in the original puzzle. The 
end position will be the same as in the original puzzle—the 
disks will be stacked in order from largest (bottom) to small- 
est. Prove that your algorithm is optimal. This problem is due 
to John McCarthy. 


x55, 
x56. 
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Problem-Solving Corner Recurrence Relations 
Problem 
(a) Several persons order compact discs. Their names, together with their order codes, are entered into a spread- 
sheet: 
=) File Edit View Insert Format Tools Data Window Help Adobe PDF =18) x) 
DSB AaG6RAY| 4 BOAT H-+- Bez wm 4) F)| iy ior ~ 2). 
[52 [anal = 7-0 -|\Bru|\=z= [8% , @s8|-G-A- 


A10 =) Ivanovic 


faa eel i eee E ae (| 
1 Name Order Code ian 
2 |Torre 9387215 
3 |Chen 1228111 
4 |\Johnsonbaugh 3749007 
5 |Kalin 4429711 
6 |Laser 8522673 
7 |Cabot 1236510 | 
8 |Johnson 8880031 
9 3661937 | 
10 i 4617813 | 
11 |Bacheles 4728103 = 
12 | Berthia 5820517 


I4/4/)[bi\ Sheets _{ sheet? {Sheets / I] 
Draw [3 G | Autoshapes~ \. &% [] O BE] 4 @]| O-~ w-~A-~ 


Ready \] 


Microsoft Excel, Microsoft Corporation, 


The records are to be sorted in alphabetical order, but an error occurs when the names are sorted but the order 
codes are unchanged: 


£9 Microsoft Excel - order2 BEE: 
(=) Eile Edit View Insert Format Tools Data Window Help Adobe PDF =18| x) 


DSH Ea\S6RY st BASo-o- Bz rn At Mio -B. 


|= 2 | arial $% , 53|/H-O-A- 


1 |Name Order Code 
2 |Bacheles 9387215 
3 Berthia 1228111 
4 |Cabot 3749007 
5 |Chen 4429711 
| 6 Ivanovic 8522673 
7 |Johnson 1236510 
8 \Johnsonbaugh 8880031 
9 Kalin 3661937 
40 Laser 4617813 

4728103 

5820517 


lid) 4 [> i\Sheets_(Sheet2 Sheets / 
Draw~ [$ G) | AutoShapesy \ % [] © Al 2) Po 


“Ready |] 


Microsoft Excel, Microsoft Corporation. 


As aresult, no one receives the correct compact discs. Let D,, denote the number of ways that n persons can all 
receive the wrong orders. Show that the sequence D;, Do, ... satisfies the recurrence relation 


1D}, = (n = 1)\(Dy-1 ar 1D}n 9) 
(b) Solve the recurrence relation of part (a) by making the substitution C, = D, — nDy-1. 


Attacking the Problem 


Before attacking the problem, consider what is 
required for part (a). To prove the recurrence relation, 
we must reduce the n-person problem to the (n — 1)- 
and (n—2)-person problems (since in the formula D,, is 
given in terms of D,,_; and D,,_2). Thus as we look sys- 
tematically at some examples, we should try to see how 
the case for n persons relates to the cases for n — 1 and 
n — 2 persons. The situation is similar to that of math- 
ematical induction and recursive algorithms in which 
a given instance of a problem is related to smaller in- 
stances of the same problem. 

Now for some examples. The smallest case is 
n=1.A single person must get the correct order, so 
D, = 0. For n = 2, there is one way for everyone to 
get the wrong orders: Person | gets order 2, and per- 
son 2 gets order 1. Thus D2 = 1. Before continuing, 
let’s develop some notation for the distribution of the 
orders. Carefully chosen notation can help in solving 
a problem. 

We’ ll write 


C1, C2, +++5Cn 


to mean that person 1 got order c;, person 2 got order 
C2, and so on. The one way for two persons to get the 
wrong orders is denoted 2,1. 

Ifn = 3, person 1 gets either order 2 or 3, so the 
possibilities are 2,?,? and 3,?,?. Let’s fill in the missing 
numbers. Suppose that person | gets order 2. Person 
2 can’t get order 1 (for then person 3 would get the 
correct order); thus, person 2 gets order 3. This leaves 
order 1 for person 3. Thus if person 1 gets order 2, the 
only possibility is 

Pell 


Suppose that person | gets order 3. Person 3 can’t 
get order | (for then person 2 would get the correct or- 
der); thus, person 3 gets order 2. This leaves order 1 
for person 2. Thus if person | gets order 3, the only 
possibility is 

Sek 28 


Thus D3 = 2. 
Let’s check that the recurrence relation holds for 
roo 
D3 =2=2(11+0) = B— 1). + Di). 


If n = 4, person | gets either order 2, 3, or 4, 
so the possibilities are 2,?,?,?; 3,?,?,?; and 4,?,?,?. Uf 
there are n persons, person 1 gets either order 2 or 3 or 
... orn. These n — 1 possibilities account for the lead- 
ing n — | factor in the recurrence relation.) Let’s fill in 
the missing numbers. Suppose that person | gets order 
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2. If person 2 gets order 1, then person 3 gets order 4 
and person 4 gets order 3, which gives 2,1,4,3. If per- 
son 2 does not get order 1, the possibilities are 2,3,4,1 
and 2,4,1,3. Thus if person | gets order 2, there are 
three possibilities. Similarly, if person 1 gets order 3, 
there are three possibilities, and if person 1 gets order 
4, there are three possibilities. You should list the pos- 
sibilities to confirm this last statement. Thus D4 = 9. 

Let’s check that the recurrence relation also holds 
forn = 4: 


Di= 9 30ers 


Before reading on, work through the case n = 5. 
List only the possibilities when person 1 gets order 2. 
(There are too many possibilities to list them all.) Also 
verify the recurrence relation for n = 5. 

Notice that if person 1 gets order 2 and person 2 
gets order 1, the number of ways for the other per- 
sons to get the wrong orders is D,—»2 (the remaining 
n — 2 persons must all get the wrong orders). This ac- 
counts for the presence of D,,_> in the recurrence rela- 
tion. We’ll have a solution, provided that the D,_1 term 
appears in the remaining case: Person | gets order 2, 
but person 2 does not get order 1. 


Finding a Solution 


Suppose there are n persons. Let’s summarize the argu- 
ment we’ve developed through our examples. Person | 
gets either order 2, or3,..., orm; so there are n—1 pos- 
sible ways for person 1 to get the wrong order. Suppose 
that person | gets order 2. There are two possibilities: 
Person 2 gets order 1, person 2 does not get order 1. 
If person 2 gets order 1, the number of ways for the 
other persons to get the wrong orders is D,—2. The re- 
maining case is that person 2 does not get order 1. 
Let’s write out carefully what it is we have to 
count. Persons 2, 3, ..., have among themselves or- 
ders 1, 3, 4, ..., (order 2 is missing because person | 
has it). We want to find the number of ways for per- 
sons 2,3,..., each to get the wrong order and for 
person 2 not to get order 1. This is almost the prob- 
lem of n — 1 persons all getting the wrong orders. 
We can turn it into this problem if we tell persons 
2,3,...,n that order 1 is order 2. Now person 2 will 
not get order 1 because person 2 thinks it’s order 2. 
Since there are n — 1 persons, there are D,_; ways 
for persons 2,3,...,n each to get the wrong order 
and for person 2 not to get order 1. It follows that 
there are D,_; + D,—-2 ways for person 1 to get or- 
der 2 and for all the others to get the wrong orders. 
Since there are n — | possible ways for person | to get 
the wrong order, the recurrence relation now follows. 
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The recurrence relation defines D, in terms of 
D,~-1 and D,—2, so it can’t be solved using iteration. 
Also, the recurrence relation does not have constant co- 
efficients (although it is linear), so it can’t be solved us- 
ing Theorem 7.2.11 or 7.2.14. This explains the need to 
make the substitution in part (b). Evidently, after mak- 
ing the substitution, the recurrence relation for C,, can 
be solved by the methods of Section 7.2. 

If we expand 


Dy = (n Ss 1)(Dr-1 lz D,_2), 
we obtain 
1 a) DP is DY (n ane 1)Dn-2. 


If we then move nD,_, to the left side of the equation 
(to obtain an expression equal to C,,), we obtain 


D, a nDy-1 = =D =F (n = 1) Dio. 


Now the left side of the equation is equal to C,, and 
the right side is equal to —C,_1. Thus we obtain the 
recurrence relation 


Cc = =Crai : 
This equation may be solved by iteration. 


Formal Solution 


Part (a): Suppose that n persons all have the wrong or- 
ders. We consider the order that one person p has. Sup- 
pose that p has q’s order. We consider two cases: g has 
p’s order, and q does not have p’s order. 

There are D,_» distributions in which g has p’s or- 
der since the remaining n — 2 orders are in possession 
of the remaining n — 2 persons, but each has the wrong 
order. 

We show that there are D,,_; distributions in which 
q does not have p’s order. Note that the set of orders $ 
that the n — 1 persons (excluding p) possess includes 
all except q’s (which p has). Temporarily assign owner- 
ship of p’s order to qg. Then any distribution of S among 
the n — 1 persons in which no one has his or her own 
order yields a distribution in which g does not have the 
order that is really p’s. Since there are D,_; such dis- 
tributions, there are D,_; distributions in which g does 
not have p’s order. 

It follows that there are D,_; + D,—2 distributions 
in which p has q’s order. Since p can have any one of 
n — | orders, the recurrence relation follows. 

Part (b): Making the given substitution, we obtain 


(Crp = = Ci : 


Using iteration, we obtain 


CG, = (-1)'C,-1 
= (-1)7C,2=-- 
=(-D"C, 
=(-1)"G 
= (S102 = 2D) = (SK. 
Therefore, 
Dy a nDy,-{ — (ly. 


Solving this last recurrence relation by iteration, we 
obtain 


Dy, = (Si 4b AD a 
= (-1)" +al(—D*! + (@- DDy-2] 
= (“1 +a(-1"" 
ye oe Gr 2) ras 


= (-1)"+0(-)" |! +n(n—1)(-)"7 ++ 
—[n(n— 1)---4] + [n(@—1)---3]. 
Summary of Problem-Solving Techniques 


= When examples get too wordy, develop a nota- 
tion for concisely describing the examples. Care- 
fully chosen notation can help enormously in 
solving a problem. 


= When looking at examples, try to see how the 
current problem relates to smaller instances of 
the same problem. 


= It often helps to write out carefully what is to be 
counted. 


= It is sometimes possible to convert a recurrence 
relation that is not a linear homogeneous equa- 
tion with constant coefficients into a linear ho- 
mogeneous equation with constant coefficients. 
Such a recurrence relation can then be solved by 
the methods of Section 7.2. 


Comment 


The technical name of a permutation in which no ele- 
ment is in its original position is a derangement. 


Probability 


We can compute the probability that no one receives 
the correct order if we assume that all permutations are 
equally likely. In this case, since there are n! permuta- 
tions, the probability that no one receives the correct 
order is 


Dy 


n!} 


1 =| 
SG 
ne 


+na—1)(-1)"? +--- 
— [n(n — 1)-+-4] + [n(n — 1) ---3]} 


_ (Si (=e-" (<i ce 
~- g@ " @=m” Gm 
1.14 
=o 


As n increases, the additional terms are so small that 
the probability changes very little. In other words, for 
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sufficiently large n, the probability that no one receives 
the correct order is essentially insensitive to the num- 
ber of orders! 

Calculus tells us that 


© i 
g= ys = 
= i! 
In particular, 
CO 


el= Se 

i=0 

Thus for large n, the probability that no one receives 
the correct order is approximately e~! = 0.368. 


(ir ie Ol aa 
See ei 


i! 


7.3 Applications to the Analysis of Algorithms 


In this section we use recurrence relations to analyze the time algorithms require. The 
technique is to develop a recurrence relation and initial conditions that define a sequence 


a1, 42,. 


.., Where a, is the time (best-case, average-case, or worst-case time) required 


for an algorithm to execute an input of size n. By solving the recurrence relation, we can 
determine the time needed by the algorithm. 

Our first algorithm is a version of the selection sorting algorithm. This algorithm 
selects the largest item and places it last, then recursively repeats this process. 


Algorithm 7.3.1 Selection Sort 


This algorithm sorts the sequence 


Go Online 
For more on 
selection sort, see 
goo.gl/P8Y2SF 


S1,52,---5Sn 


in nondecreasing order by first selecting the largest item and placing it last and then 


recursively sorting the remaining elements. 


Input: 
Output: 


@==1l 
return 


Mile S890 6 


Mig wae a6 


) 


// find largest 
max_index = | // assume initially that s; is largest 
for? = 2ton 

if (8; > Smaxindex) // found larger, so update 


, §, and the length n of the sequence 


., Sy, arranged in nondecreasing order 


selection_sort(s, n) { 
// base case 


9. max_index = i 


// move largest to end 


swap(Sp, Sires faba) 
selection_sort(s,n — 1) 
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Algorithm 7.3.2 


As a measure of the time required by this algorithm, we count the number of com- 
parisons D,, at line 8 required to sort n items. (Notice that the best-case, average-case, and 
worst-case times are all the same for this algorithm.) We immediately obtain the initial 
condition b; = 0. To obtain a recurrence relation for the sequence by, b2,..., we simu- 
late the execution of the algorithm for arbitrary input of size n > 1. We count the number 
of comparisons at each line and then sum these numbers to obtain the total number of 
comparisons b,. At lines 1—7, there are zero comparisons (of the type we are counting). 
At line 8, there are n — | comparisons (since line 7 causes line 8 to be executed n — 1 
times). There are zero comparisons at lines 9-11. The recursive call occurs at line 12, 
where we invoke this algorithm with input of size n— 1. But, by definition, this algorithm 
requires b,_; comparisons for input of size n — 1. Thus there are b,_; comparisons at 
line 12. Therefore, the total number of comparisons is b, = n — 1 + b,_;, which yields 
the desired recurrence relation. 

Our recurrence relation can be solved by iteration: 


by = bn-1 +n—-1 
= (bn-2 +n — 2) + (n— 1) 


= (3 +n — 3) + (n—2)+(n—- IJ) 


Shi peur e Qaim 


=04+14+24+---+(a fe 


5 = O(n’). 


Thus the time required by Algorithm 7.3.1 is O(n’). 

Our next algorithm (Algorithm 7.3.2) is binary search. Binary search looks for a 
value in a sorted sequence and returns the index of the value if it is found or 0 if it is not 
found. The algorithm uses the divide-and-conquer approach. The sequence is divided 
into two nearly equal parts (line 4). If the item is found at the dividing point (line 5), 
the algorithm terminates. If the item is not found, because the sequence is sorted, an 
additional comparison (line 7) will locate the half of the sequence in which the item 
appears if it is present. We then recursively invoke binary search (line 11) to continue 
the search. 


Binary Search 


This algorithm looks for a value in a nondecreasing sequence and returns the index 
of the value if it is found or 0 if it is not found. 


Input: A sequence 5;, Sj11,.--, sj, 1 = 1, sorted in nondecreasing order, a 
value key, i, and j 
Output: The output is an index k for which s; = key, or if key is not in the 
sequence, the output is the value 0. 


binary_search(s, i, j, key) { 

if @ > j) // not found 
return 0 

k= |@+))/2] 

if (key == s,) // found 
return k 

if (key < s,) // search left half 
j=k-1 


OSV SA ge SS 


Example 7.3.3 
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®, else // search right half 


10. t= bel 
lil, return binary_search(s, i, j, key) 
Des 


We illustrate Algorithm 7.3.2 for the input 
5, =‘B, 52 =D’, S93 =F’, “= "S’, 


and key = ‘S’. At line 2, since i > j (1 > 4) is false, we proceed to line 4, where we set 
k to 2. At line 5, since key (*S’) is not equal to s2 (‘D’) , we proceed to line 7. At line 7, 
key < sx ((S’ < ‘D’) is false, so at line 10, we set 7 to 3. We then invoke this algorithm 
with i = 3,7 = 4 to search for key in 


93 ='F, 54 = ‘S’. 


At line 2, since i > j (3 > 4) is false, we proceed to line 4, where we set k to 
3. At line 5, since key (‘S’) is not equal to s3 (‘F’), we proceed to line 7. At line 7, key 
< 5, (S° < ‘F’) is false, so at line 10, we set i to 4. We then invoke this algorithm with 
i =j = 4 to search for key in 


54 = ‘S’. 


At line 2, since i> j (4 > 4) is false, we proceed to line 4, where we set k to 4. At 
line 5, since key (*S”) is equal to s4 (“S’), we return 4, the index of key in the sequence s. 
< 


Next we turn to the worst-case analysis of binary search. We define the worst-case 
time required by binary search to be the number of times the algorithm is invoked in the 
worst case for a sequence containing n items. We let a, denote the worst-case time. 

Suppose that 7 is 1; that is, suppose that the sequence consists of one element s; 
and i = j. In the worst case, the item will not be found at line 5, so the algorithm will be 
invoked a second time at line 11. However, at the second invocation we will have i > j 
and the algorithm will terminate unsuccessfully at line 3. We have shown that if n is 1, 
the algorithm is invoked twice. We obtain the initial condition 


is, (7.3.1) 


Now suppose that n > 1. In this case, i < j, so the condition in line 2 is false. 
In the worst case, the item will not be found at line 5, so the algorithm will be invoked 
at line 11. By definition, the invocation at line 11 will require a total of a,, invocations, 
where m is the size of the sequence that is input at line 11. Since the sizes of the left and 
right sides of the original sequence are | (mn — 1)/2] and |n/2] and the worst case occurs 
with the larger sequence, the total number of invocations at line 11 will be a),/2;. The 
original invocation together with the invocations at line 11 gives all the invocations; thus 
we obtain the recurrence relation 


an = 1 + Q\n/2)- (7.3.2) 


The recurrence relation (7.3.2) is typical of those that result from divide-and- 
conquer algorithms. Such recurrence relations are usually not easily solved explicitly 
(see, however, Exercise 6). Rather, one estimates the growth of the sequence involved 
using theta notation. Our method of deriving a theta notation for the sequence defined 
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Theorem 7.3.4 


by (7.3.1) and (7.3.2) illustrates a general method of handling such recurrence relations. 
First we explicitly solve (7.3.2) in case n is a power of 2. When n is not a power of 2, n 
lies between two powers of 2, say 2‘~! and 2‘, and a, lies between a1 and a>. Since 
explicit formulas are known for az-1 and do, we can estimate a, and thereby derive a 
theta notation for a,. 

First we solve the recurrence relation (7.3.2) in case n is a power of 2. Ifn = Qk 
(7.3.2) becomes 


ax = 1+ ae k= lo 2yaass 
If we let by = ax, we obtain the recurrence relation 


by = 14+ de-1 KH 125 ees, (7.3.3) 


and the initial condition b) = 2. The recurrence relation (7.3.3) can be solved by the 
iterative method: 


by = lt bg = 24+ dp = ++ Hk +b) =k 42. 
Thus, if n = 2‘, 
an = 2+1gn. (7.3.4) 
An arbitrary value of n falls between two powers of 2, say 


Hl en< 2, (7.3.5) 


Since the sequence a is nondecreasing (a fact that can be proved using induction—see 
Exercise 5), 


Ayk-1 < An < aye. (7.3.6) 


Notice that (7.3.5) gives 
k—1<lgn<k. (7.3.7) 


From (7.3.4), (7.3.6), and (7.3.7), we deduce that 
Ign <1L+k=doi-1 < dy < dk =24+k <3 4+1]gn= Ollgn). 


Therefore, a, = ©(lgn), so binary search is @(1gn) in the worst case. This result is 
important enough to highlight as a theorem. 


The worst-case time for binary search for input of size n is O(1gn). 


Proof The proof precedes the statement of the theorem. < 


For our last example, we present and analyze another sorting algorithm known 
as merge sort (Algorithm 7.3.8). We will show that merge sort has worst-case run 
time O(nlgn), so for large input, merge sort is much faster than selection sort (Al- 
gorithm 7.3.1), which has worst-case run time @(n2). In Section 9.7 we will show that 
any sorting algorithm that compares elements and, based on the result of a comparison, 
moves items around in an array is (2 (n 1g n) in the worst case; thus merge sort is optimal 
within this class of sorting algorithms. 

In merge sort the sequence to be sorted, 


S35. 8:4 seg. 
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is divided into two nearly equal sequences, 


Sin eee Sms Sm+lss++9 Sis 


where m = |(i + j)/2]. Each of these sequences is recursively sorted, after which they 
are combined to produce a sorted arrangement of the original sequence. The process of 
combining two sorted sequences is called merging. 


This algorithm combines two nondecreasing sequences into a single nondecreasing 
sequence. 


Input: Two nondecreasing sequences: s;,...,5m and S41,...,5;, and 
indexes i, m, and j 
Output: The sequence c;,...,c; consisting of the elements s;,..., 5, and 
Sm+1, +++, 8; combined into one nondecreasing sequence 
1. merge(s, i,m, j, c) { 
2) // p is the position in the sequence sj, ..., Sm 
3 // q is the position in the sequence Sy,41,..., Sj 
4. // r is the position in the sequence cj, ..., Cj 
5) pa 
6 — all 
7 r=7 
8. // copy smaller of s, and sq 
), while (p < mAq <j) { 
10. if (sp < Sq) { 
11 C2 = 95 
PR, 2 — pel 
13, } 
14 else { 
15 Cr = Sq 
16. a— Gal 
Nes } 
18. r=r+1 
iY), } 
20. // copy remainder of first sequence 
Qe while (p < m) { 
ED). = 5p 
D3, pSjorr il 
24. r=r+1 
Wy, } 
26. // copy remainder of second sequence 
ils while (q < j) { 
28. Cr = Sq 
2), Q@= Or il 
30. r=r+1 
Bir } 
30, 


Figure 7.3.1 shows how Algorithm 7.3.5 merges the sequences 


1,3,4 and 2,4,5,6. 
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Pp p P P Pp P P 
ie { | { | } 
134 134 134 134 134 134 134 

q q q q q q q 


2456 2456 2456 2456 2456 2456 2456 


r r r r r r r 
Cig eel 
nee | | | | | | 
1 12 123 1234 12344 123445 1234456 
Figure 7.3.1 Merging s;,..., 5 and 5,41, ...,5;. The result is cj, ... , cj. < 


Theorem 7.3.7 shows that in the worst case, n— 1 comparisons are needed to merge 
two sequences the sum of whose lengths is n. 


Theorem 7.3.7 In the worst case, Algorithm 7.3.5 requires j — i comparisons. In particular, in the 
worst case, n — 1 comparisons are needed to merge two sequences the sum of whose 
lengths is n. 


Proof In Algorithm 7.3.5, the comparison of elements in the sequences occurs in the 
while loop at line 10. The while loop will execute as long as p < mand q < j. Thus, in 
the worst case, Algorithm 7.3.5 requires j — i comparisons. < 


We next use Algorithm 7.3.5 (merging) to construct merge sort. 


Algorithm 7.3.8 | Merge Sort 


This recursive algorithm sorts a sequence into nondecreasing order by using Algo- 
rithm 7.3.5, which merges two nondecreasing sequences. 


Go Online ; 
Formore-on WWE Macon Sah Culely) 
merge sort, see Output: s;, ..., sj, arranged in nondecreasing order 


goo.gl/P8Y2SF 
merge_sort(s, i, j) { 


1 
2; // base case: i == j 

3 if (i == j) 

4. return 

a // divide sequence and sort 
6 m= (G+ j)/2] 

7 merge_sort(s, i, m) 

8. merge_sort(s, m+ 1, j) 

&), // merge 


10. merge(s, i, m, J, C) 

ili. // copy c, the output of merge, into s 
2, fork =itoj 

13}. Sia—Gr 


Example 7.3.9 


Theorem 7.3.10 
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Figure 7.3.2 shows how Algorithm 7.3.8 sorts the sequence 


12, 30, 21, 8 6, 9, 1, 7. 


12 1 | 8 | 1 
30 30 12 6 
21 ra 21 7 
8 21 | 30 | 8 
6 6 1 9 
9 | 9 6 12 
a ral 7 21 
7 ia 24 aie 

Merge Merge Merge 

one-element two-element four-element 
arrays arrays arrays 
Figure 7.3.2 Sorting by merge sort. < 


We conclude by showing that merge sort (Algorithm 7.3.8) is © (n 1g 1) in the worst 
case. The method of proof is the same as we used to show that binary search is © (lg 7) 
in the worst case. 


Merge sort (Algorithm 7.3.8) is ©(nlgn) in the worst case. 


Proof Let a, be the number of comparisons required by Algorithm 7.3.8 to sort 
n items in the worst case. Then a; = 0. If n > 1, a, is at most the sum of the num- 
bers of comparisons in the worst case resulting from the recursive calls at lines 7 and 8, 
and the number of comparisons in the worst case required by merge at line 10. That is, 


An S Ajn/2) + Q((n+1)/2] t+n—-1. 


In fact, this upper bound is achievable (see Exercise 22), so that 


An = An/2| + Ant1/2) Fa — I. 


First we solve the preceding recurrence relation in case n is a power of 2, say 
n = 2*. The equation becomes 


dye = 2a + 2* — 1, 
We may solve this last equation by using iteration (see Section 7.2): 
ok = 2an-1 + 2* —1 
= 2[2ayx2 + 24! — 174+ 2k -1 
= an. +2-2*—-1-2 
= 2? [Pans +2? —1]4+2-2*-1-2 
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= Vans +3-2*-1-2-72? 


= Han +k-2*—-1-2—-2? —..-— 2! 
= k-2* — (2 — 1) 
= (k—1)2* +1. (7.3.8) 


An arbitrary value of n falls between two powers of 2, say, 


BOP eee (7.3.9) 
Since the sequence a is nondecreasing (see Exercise 25), 
Agk-1 < An < Ak. (7.3.10) 


Notice that (7.3.9) gives 
k-—1<lgn<k. (7.3.11) 


From (7.3.8), (7.3.10), and (7.3.11), we deduce that 


Q(nlgn) = (-24 Ign); 2D 41 = pes 
< dy < ax < kok 41 < (+1gn)2n+ 1 = O(~mlgn). 


Therefore, a, = O(nlgn), so merge sort is O(n 1lgn) in the worst case. < 


As remarked previously, in Section 9.7 we will show that any comparison-based 
sorting algorithm is Q2(nlgn) in the worst case. This result implies, in particular, that 
merge sort is &2(n1g7) in the worst case. If we had already proved this result, to prove 
that merge sort is ©(n lg 7) in the worst case, it would have been sufficient to prove that 
merge sort is O(n lg n) in the worst case. 

Even though merge sort, Algorithm 7.3.8, is optimal, it may not be the algorithm 
of choice for a particular sorting problem. Factors such as the average-case time of the 
algorithm, the number of items to be sorted, available memory, the data structures to 
be used, whether the items to be sorted are in memory or reside on peripheral storage 
devices such as disks or tapes, whether the items to be sorted are already “nearly” sorted, 
and the hardware to be used must be taken into account. 


7.3 Review Exercises 


1. 


Explain how to find a recurrence relation that describes the time 
a recursive algorithm requires. 


. How does selection sort work? 
. What is the time required by selection sort? 


. How does binary search work? What properties must the input 


have? 


. Write a recurrence relation that describes the worst-case time 


required by binary search. 


. What is the worst-case time of binary search? 


. How does the merge algorithm work? What properties must the 


input have? 


8. 
2: 
10. 


iM, 


12. 


13. 


What is the worst-case time of the merge algorithm? 
Explain how merge sort works. 


Write a recurrence relation that describes the worst-case time 
required by merge sort. 


Why is it easy to solve for the worst-case time of merge sort if 
the input size is a power of two? 


Explain, in words, how we can obtain bounds on the worst-case 
time of merge sort for input of arbitrary size if we know the 
worst-case time when the input size is a power of two. 


What is the worst-case time of merge sort? 
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Exercises 1-4 refer to the sequence 


an & WN 


*6. 


11. 


12. 


SHC’, se", 87 "»., Sgn"). sg SX 


. Show how Algorithm 7.3.2 executes in case key = ‘G’. 
. Show how Algorithm 7.3.2 executes in case key = ‘P’. 
. Show how Algorithm 7.3.2 executes in case key = ‘*C’. 
. Show how Algorithm 7.3.2 executes in case key = ‘Z’. 


. Let a, denote the worst-case time of binary search (Algo- 


rithm 7.3.2). Prove that a, < dj41 forn > 1. 


Prove that if a, is the number of times the binary search 
algorithm (Algorithm 7.3.2) is invoked in the worst case for 
a sequence containing n items, then 


Gn = 2+ Ug n| 


for every positive integer n. 


. Give an example to show that if the input to Algorithm 7.3.2 is 


not in nondecreasing order, Algorithm 7.3.2 may not find key 
even if it is present. 


. Suppose that the input to Algorithm 7.3.2 is not in nondecreas- 


ing order. Could Algorithm 7.3.2 erroneously find key even 
though it is not present? 


. Write a nonrecursive version of the binary search algorithm. 
«10. 


Write a nonrecursive version of the binary search algorithm 
that uses at most 1 + [lg] array comparisons. The input is 
a sequence 5}, ...,5,, sorted in nondecreasing order, a value 
key to search for, and n. Assume that the only array compar- 
isons allowed are s; < key and s; == key. 


Assume the conditions of Exercise 10 and assume further that 
one incorrect response to an array comparison is allowed. [An 
“incorrect response to an array comparison” occurs in one of 
four ways: (1) s; < key is true, but in the algorithm, the rela- 
tion s; < key evaluates to false. (2) s; < key is false, but in the 
algorithm, the relation s; < key evaluates to true. (3) s; == key 
is true, but in the algorithm, the relation s; == key evaluates to 
false. (4) s; == key is false, but in the algorithm, the relation 
5; == key evaluates to true.] Write an algorithm that uses at 
most 3 + 2[lgn] array comparisons to determine whether key 
is in the sequence. (The problem can be solved in fewer than 
3 + 2[lgn] array comparisons in the worst case.) 


Professor T. R. S. Eighty proposes the following version of 
binary search: 


binary_search2(s, i, j, key) { 
if (i >j) 
return 0 
k=G4+)/2) 
if (key == sx) 
return k 
k1 = binary_search2(s, i, k — 1, key) 
k2 = binary_search2(s, k + 1, j, key) 
return kl + k2 


13. 


14. 


15. 


(a) Show that binary_search2 is correct; that is, if key is 
present, the algorithm returns its index, but if key is not 
present, it returns 0. 


(b) Find the worst-case running time of binary_search2. 


Professor Larry proposes the following version of binary 
search: 


binary_search3(s, i, j, key) { 
while (i < j) { 
k=G+)/2! 
if (key == sx) 
return k 
if (key < sx) 
j=k 
else 
i=k 
} 
return 0 


} 


Is the Professor’s version correct (i.e., does it find key if it is 
present and return 0 if it is not present)? If the Professor’s ver- 
sion is correct, what is the worst-case time? 


Professor Curly proposes the following version of binary 
search: 


binary_searchA(s, i, j, key) { 
if (i > j) 
return 0 
k=+)/2) 
if (key == x) 
return k 
flag = binary_search4(s, i, k — 1, key) 
if (flag == 0) 
return binary_search4(s, k + 1, j, key) 
else 
return flag 


} 


Is the Professor’s version correct (i.e., does it find key if it is 
present and return 0 if it is not present)? If the Professor’s ver- 
sion is correct, what is the worst-case time? 


Professor Moe proposes the following version of binary 
search: 


binary_search5(s, i, j, key) { 
if (i > /) 
return 0 
k=(G+)/2) 
if (key == sx) 
return k 
if (key < sx) 
return binary_search5(s, i, k, key) 
else 
return binary_search5(s, k + 1, j, key) 
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16. 


17. 


18. 


19. 
20. 


21. 


22. 


23. 
24. 
25. 


26. 


27. 


Let p\,.. 
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Is the Professor’s version correct (i.e., does it find key if it is 
present and return 0 if it is not present)? If the Professor’s ver- 
sion is correct, what is the worst-case time? 


Suppose that we replace the line 
k=lG+))/2] 

with 
k=|i+QG-1/3] 


in Algorithm 7.3.2. Is the resulting algorithm still correct (i.e., 
does it find key if it is present and return 0 if it is not present)? 
If it is correct, what is the worst-case time? 


Suppose that we replace the line 
k= |(G+))/2] 


with 
k=|j-2| 


in Algorithm 7.3.2. Is the resulting algorithm still correct (i.e., 
does it find key if it is present and return 0 if it is not present)? 
If it is correct, what is the worst-case time? 
Suppose that algorithm A requires [n lg] comparisons to sort 
n items and algorithm B requires [n2/4] comparisons to sort 
n items. For which n is algorithm B superior to algorithm A? 
Show how merge sort (Algorithm 7.3.8) sorts the sequence 1, 
OT 
Show how merge sort (Algorithm 7.3.8) sorts the sequence 2, 
Dy Ig 24 By Oy: Ly Dy Ae 
Suppose that we have two sequences each of size n sorted in 
nondecreasing order. 
(a) Under what conditions does the maximum number of 
comparisons occur in Algorithm 7.3.5? 
(b) Under what conditions does the minimum number of 
comparisons occur in Algorithm 7.3.5? 
Let a, be as in the proof of Theorem 7.3.10. Describe input 
for which 
An = Ajn/2) + 44/2) ta— 1. 


What is the minimum number of comparisons required by 
Algorithm 7.3.8 to sort an array of size 6? 


What is the maximum number of comparisons required by 
Algorithm 7.3.8 to sort an array of size 6? 


Let a, be as in the proof of Theorem 7.3.10. Show that a, < 
an41 for alln > 1. 


Let a, denote the number of comparisons required by merge 
sort in the worst case. Show that a, < 3nlgn forn = 


Show that in the best case, merge sort requires ©(nlgn) 
comparisons. 


-+Pn be a permutation of {1, ...,n}. An inversion in p is 


a pair p;, pj where i < j and p; > pj. Informally, the pair pj, p; 
is an inversion if pj and p; are out of order, where order means 
increasing order. Exercises 28-38 are concerned with inversions. 


28. 


List all inversions in the permutation 4, 1, 3, 2. 


29. 
30. 
31. 


32. 


33. 


34. 


35. 


36. 


«37. 


38. 


List all inversions in the permutation 3, 2, 5, 4, 1. 

List all inversions in the permutation 1, 2, 3, 4. 

Show a permutation of {1, 2, 3, 4} having the maximum num- 
ber of inversions. 

What is the maximum number of inversions in a permutation 
OF (1s 5.50570}? 

Show that any sorting algorithm that moves data by swapping 
only adjacent elements has worst-case time Q(n). 

Show that the total number of inversions in all of the permu- 
tations of {1,...,n} is n!n(n — 1)/4. Hint: If pi,..., pn is 
a permutation of {1,...,}, let p> = py,...,p1 denote the 
reverse of p. Consider the total number of permutations in p 
and p®. 


Show that the average number of inversions in permutations 


of {1,..., nm} is n(n — 1)/4. Assume that all permutations are 
equally likely. 
Write an algorithm whose input is a permutation p of 


{1,...,m} and n. The algorithm returns the number of inver- 
sions in p. What is the worst-case time of your algorithm? 


Write an algorithm whose worst-case time is O(n lg n) that re- 
turns the number of inversions in a permutation of {1,..., 7}. 
The algorithm’s input is a permutation of {1,..., m} and n. 


What is wrong with the following argument that purports to 
show that any algorithm that computes the number of in- 
versions in a permutation of n elements has worst-case time 
Q(n7)? Since any algorithm that computes the number of 
inversions in a permutation of n elements must check each 
pair of elements and there are Q(n7) pairs, any such algo- 
rithm has worst-case time Q(n?). (Compare this exercise with 
Exercise 37.) 


Exercises 39-43 refer to Algorithm 7.3.11. 


Algorithm 7.3.11 


Computing an Exponential 
This algorithm computes a” recursively, where a is a real 
number and n is a positive integer. 


Input: 
Output: a” 


a (areal number), n (a positive integer) 


il expl(a,n) { 

2 if (== 1) 

3), return a 

4, m= |n/2] 

5 return exp/(a, m) * expI(a,n —m) 
6 


} 


Let by be the number of multiplications (line 5) required to 
compute a”. 


39. 
40. 


Explain how Algorithm 7.3.11 computes a”. 


Find a recurrence relation and initial conditions for the se- 
quence {b,}. 


41 
42 


43, 


7.3. ® Applications to the Analysis of Algorithms 


. Compute b2, b3, and bq. 


. Solve the recurrence relation of Exercise 40 in case n is a 
power of 2. 


. Prove that b, =n — | for every positive integer n. 


Exercises 44-49, refer to Algorithm 7.3.12. 


Algorithm 7.3.12 


Computing an Exponential 
This algorithm computes a” recursively, where a is a real 
number and n is a positive integer. 


Le 
to 


44. 
45. 


46. 
47. 


48. 


x49 


Input: a (a real number), n (a positive integer) 
Output: a” 
exp2(a, n) { 


il 

2 if@== 1) 

3 return a 

4 m= |n/2] 

St power = exp2(a, m) 

6. power = power * power 
Wf if (n is even) 

8 return power 

8) else 

0 return power * a 
1 


t by, be the number of multiplications (lines 6 and 10) required 
compute a”. 


Explain how Algorithm 7.3.12 computes a”. 
Show that 


if n is odd 
if n is even. 


b= Din-1)/2 +2 
: bnj2 +1 
Find by, b2, b3, and by. 


Solve the recurrence relation of Exercise 45 in case n is a 
power of 2. 


Show, by an example, that b is not nondecreasing. 


. Prove that b, = O(ign). 


Exercises 50-55 refer to Algorithm 7.3.13. 


Algorithm 7.3.13 
Finding the Largest and Smallest Elements 


i 


na Sequence 


This recursive algorithm finds the largest and smallest ele- 
ments in a sequence. 


Input: 
Output: 


Sypris syea andy 


large (the largest element in the 
sequence), small (the smallest element 
in the sequence) 
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1 large_small(s, i,j, large, small) { 
2 if @==/{ 
3 large = sj; 
4. small = 5; 
Bb return 
6 } 
7 m=|G+))/2] 
8. large_small(s, i, m, large_left, small_left) 
OF large_small(s, m + 1, j, large_right, small_right) 
10. if (large_left > large_right) 
iil. large = large_left 
12, else 
IBF large = large_right 
14. if (small_left > small_right) 
iS, small = small_right 
16. else 
Ys small = small_left 
ils} 


Let b,, be the number of comparisons (lines 10 and 14) required for 
an input of size n. 


50. 


51. 
52. 
53. 


54. 


55. 


Explain how Algorithm 7.3.13 finds the largest and smallest 
elements. 


Show that b} = 0 and bz = 2. 
Find b3. 


Establish the recurrence relation 
bn = bin/2) + D\n+1)/2] +2 (7.3.12) 


forn > 1. 


Solve the recurrence relation (7.3.12) in case n is a power of 2 
to obtain 
by, = 2n—2 Se ie sce 
Use induction to show that 
by, = 2n—2 


for every positive integer n. 


Exercises 56-59 refer to Algorithm 7.3.13, with the following in- 
serted after line 6. 


6a. if G==i+1){ 
6b. if (s; > sj) { 
6c. large = 8; 
6d. small = s; 
6e. } 

6f. else { 

6g. small = s; 
6h. large = sj 
6i. } 

6j. return 
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Let b, be the number of comparisons (lines 6b, 10, and 14) for an 
input of size n. 

56. Show that b} = 0 and bz = 1. 

57. Compute b3 and by. 

58. Show that the recurrence relation (7.3.12) holds for n > 2. 
59. Solve the recurrence relation (7.3.12) in case n is a power of 2 


to obtain 


3n 
by, = i 2 
*60. Modify Algorithm 7.3.13 by inserting the lines preceding 


Exercise 56 after line 6 and replacing line 7 with the following. 
Ja. if j —iis odd A (1 +, j — i)/2 is odd) 
7b. m= (Git )/2) -—1 
Tc. else 
7d. m= (it )/2] 


n=2,4,8,.... 


Show that in the worse case, this modified algorithm requires 
at most [(3n/2)—2] comparisons to find the largest and small- 
est elements in an array of size n. 


Exercises 61-65 refer to Algorithm 7.3.14. 


Algorithm 7.3.14 


Insertion Sort (Recursive Version) 
This algorithm sorts the sequence 


$1, 52,...,5n 
in nondecreasing order by recursively sorting the first n— 1 
elements and then inserting s,, in the correct position. 
Input: 
sequence 


Output: 


51, 82,..-,5, and the length n of the 


51, 52, ..., Sy arranged in nondecreasing 


1 insertion_sort(s, n) { 

2, iit == 1) 

3 return 

4. insertion_sort(s, n — 1) 
Bh i=n-1 
6 

7 

8 


temp = Sp 
while (i > 1 A s; > temp) { 
. Cee =) 
9), i=i-l 
10. } 
iil. Sit+1 = temp 
12, 


Let by be the number of times the comparison s; > temp in line 7 
is made in the worst case. Assume that if i < 1, the comparison 
sj; > temp is not made. 


61. Explain how Algorithm 7.3.14 sorts the sequence. 

62. Which input produces the worst-case behavior for Algo- 
rithm 7.3.14? 

63. Find by, bo, and b3. 

64. Find a recurrence relation for the sequence {b,}. 

65. Solve the recurrence relation of Exercise 64. 


Exercises 66-68 refer to Algorithm 7.3.15. 


Algorithm 7.3.15 


Ik Sita andy Haale 


Output: s1,..., Sp 


algorl(s, n) { 

i= 

while (i > 1) { 
s= 54 1 
C— 11/2) 

} 

n= |n/2] 

if (n> 1) 
algorl1(s, n) 


Let b, be the number of times the statement s; = s;+ 1 is executed. 


66. Find a recurrence relation for the sequence {b,,} and compute 
b, bz, and b3. 


67. Solve the recurrence relation of Exercise 66 in case n is a 
power of 2. 

68. Prove that b, = O((lg n)*). 

69. Find a theta notation in terms of n for the number of times 
algor2 is called when invoked as algor2(1, n). 


algor2(i, j) { 
if G==/) 
return 
k= |G+))/2) 
algor2(i, k) 
algor2(k + 1, /) 
} 


Exercises 70-74 refer to Algorithm 7.3.16. 


Algorithm 7.3.16 


Input: A sequence s;, ..., s; of zeros and ones 
Output: 5;,..., 8; where all the zeros precede all 
the ones 
sort(s, i,j) { 
if i == j) 
return 
if (s; == 1) { 
swap(si, Sj) 
sort(s, i,j — 1) 
} 
else 


sort(s,i+1,j) 


70. Use mathematical induction on n, the number of items in the 
input sequence, to prove that sort does produce as output a 
rearranged version of the input sequence in which all of the 
zeros precede all of the ones. (The Basis Step is n = 1.) 


Let by, denote the number of times sort is called when the input 
sequence contains n items. 

71. Find by, bo, and b3. 

72. Write a recurrence relation for by. 

73. Solve your recurrence relation of Exercise 72 for b,. 


74. In theta notation, what is the running time of sort as a function 
of n, the number of items in the input sequence? 


75. Solve the recurrence relation 


Gn = 3a\n/2) +n n>, 

in case n is a power of 2. Assume that a; = 1. 
76. Show that a, = @(n'83), where a, is as in Exercise 75. 
Exercises 77-84 refer to an algorithm that accepts as input the 
sequence 


Sip cee y Sj. 
Ifj > i, the subproblems 


Sines y SLGD/2 ANE S(t /2HL p+ Sj 


are solved recursively. Solutions to subproblems of sizes m and k 
can be combined in time Cm,x to solve the original problem. Let by 
be the time required by the algorithm for an input of size n. 

77. Write a recurrence relation for b, assuming that c»,, = 3. 
78. Write a recurrence relation for b, assuming that Cy,,, = m+k. 


79. Solve the recurrence relation of Exercise 77 in case n is a 
power of 2, assuming that b; = 0. 


80. Solve the recurrence relation of Exercise 77 in case n is a 
power of 2, assuming that b; = 1. 


81. Solve the recurrence relation of Exercise 78 in case n is a 
power of 2, assuming that b; = 0. 


82. Solve the recurrence relation of Exercise 78 in case n is a 
power of 2, assuming that b; = 1. 


7.4 
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«83. Assume that ifm, > m2 and ky > ko, then cm, 4, > Cmy,k- 
Show that the sequence b,, bz, ... is nondecreasing. 

x84. Assuming that c»,=m+k and b; = 0, show that b, < 
4nign. 


Exercises 85—90 refer to the following situation. We let P,, denote 
a particular problem of size n. If P,, is divided into subproblems 
of sizes i and j, there is an algorithm that combines the solutions 
of these two subproblems into a solution to Py, in time at most 
2 + lg(ij). Assume that a problem of size 1 is already solved. 


85. Write a recursive algorithm to solve P,, similar to Algo- 
rithm 7.3.8. 
86. Let a, be the worst-case time to solve P,, by the algorithm of 
Exercise 85. Show that 
Gn S An/2) + aynt1y/2) + 21gn. 


87. Let b, be the recurrence relation obtained from Exercise 86 by 
replacing “<” by “=.” Assume that b} = a; = 0. Show that 
if n is a power of 2, 


b, = 4n—21gn—4. 


88. Show that a, < b, forn = 1,2,3,.... 

89. Show that by, < bj; forn = 1,2,3,.... 

90. Show that a, < 8n forn = 1,2,3,.... 

91. Suppose that {a,,} is a nondecreasing sequence and that when- 
ever m divides n, 

Qn = An/m +d, 

where d is a positive real number and m is an integer satisfying 
m > 1. Show that a, = O(1gn). 


x92. Suppose that {a,,} is a nondecreasing sequence and that when- 


ever m divides n, 
An = Can/m + d, 


where c and d are real numbers satisfying c > | andd > 0, and 


mis an integer satisfying m > 1. Show that a, = @(n'°8°), 


93. [Project] Investigate other sorting algorithms. Consider specif- 
ically complexity, empirical studies, and special features of the 
algorithms (see [Knuth, 1998b]). 


The Closest-Pair Problem* 


Computational geometry is concerned with the design and analysis of algorithms to 


Go Online 

For more on 
computational geometry, 
see g00.g1/P8Y2SF 


design. 


solve geometry problems. Efficient geometric algorithms are useful in fields such as 
computer graphics, statistics, image processing, and very-large-scale-integration (VLSI) 


The closest-pair problem furnishes an example of a problem from computational 
geometry: Given n points in the plane, find a closest pair. (We say a closest pair since it 


is possible that several pairs achieve the same minimum distance.) Our distance measure 
is ordinary Euclidean distance. 


‘This section can be omitted without loss of continuity. 
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Figure 7.4.1 n points in 
the plane. The problem is 
to find a closest pair. For 
this set, the closest pair is 
6 and 8. Line / divides the 
points into two approxi- 
mately equal parts. The 
closest pair in the left half 
is | and 3, which is 5; 
apart. The closest pair in 
the right half is 9 and 12, 
which is dp apart. Any pair 
(e.g., 6 and 8) closer to- 
gether than 5 = min{6z, dr} 
must lie in the vertical 
strip of width 26 centered 
at 1. 


One way to solve this problem is to list the distance between each pair and choose 
the minimum in this list of distances. Since there are C(n, 2) = n(n — 1)/2 = O(n?) 
pairs, this “list-all” algorithm’s time is @(n7). We can do better; we will give a divide- 
and-conquer closest-pair algorithm whose worst-case time is O(n lgn). We first discuss 
the algorithm and then give a more precise description using pseudocode. 

Our algorithm begins by finding a vertical line / that divides the points into two 
nearly equal parts (see Figure 7.4.1). [If n is even, we divide the points into parts each 
having n/2 points. If n is odd, we divide the points into parts, one having (n + 1)/2 
points and the other having (n — 1)/2 points.] 

We then recursively solve the problem for each of the parts. We let 6; be the dis- 
tance between a closest pair in the left part; we let dp be the distance between a closest 
pair in the right part; and we let 6 = min{6_, dg}. Unfortunately, 5 may not be the dis- 
tance between a closest pair from the original set of points because a pair of points, one 
from the left part and the other from the right part, might be closer than 5 (see Figure 
7.4.1). Thus we must consider distances between points on opposite sides of the line /. 

We first note that if the distance between a pair of points is less than 4, the points 
must lie in the vertical strip of width 26 centered at / (see Figure 7.4.1). (Any point not in 
this strip is at least 6 away from every point on the other side of /.) Thus we can restrict 
our search for a pair closer than 6 to points in this strip. 

If there are n points in the strip and we check all pairs in the strip, the worst-case 
time to process the points in the strip is ©(n7). In this case the worst-case time of our 
algorithm will be Q (n”), which is at least as bad as the exhaustive search; thus we must 
avoid checking all pairs in the strip. 

We order the points in the strip in nondecreasing order of their y-coordinates. We 
then examine the points in this order. When we examine a point p in the strip, any point q 
following p whose distance to p is less than 6 must lie strictly within or on the base of the 
rectangle of height 6 whose base contains p and whose vertical sides are at a distance 6 
from / (see Figure 7.4.2). (We need not compute the distance between p and points below 
p. These distances would already have been considered since we are examining the points 
in nondecreasing order of their y-coordinates.) We will show that this rectangle contains 
at most eight points, including p itself, so if we compute the distances between p and the 
next seven points in the strip, we can be sure that we will compute the distance between 
p and all the points in the rectangle. Of course, if fewer than seven points follow p in the 
list, we compute the distances between p and all of the remaining points. By restricting 
the search in the strip in this way, the time spent processing the points in the strip is 
O(n). (Since there are at most n points in the strip, the time spent processing the points 
in the strip is at most 7n.) 
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Figure 7.4.2 Any point g 
following p whose distance to 
pis less than 6 must lie within 
the rectangle. 


Figure 7.4.3 The large rectangle 
contains at most eight points 
because each square contains at 
most one point. 


Example 7.4.1 
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We show that the rectangle of Figure 7.4.2 contains at most eight points. 
Figure 7.4.3 shows the rectangle of Figure 7.4.2 divided into eight equal squares. Notice 
that the length of a diagonal of a square is 


(+) =a 


thus each square contains at most one point. Therefore, the 26 x 6 rectangle contains at 
most eight points. 


Show how the closest-pair algorithm finds a closest pair for the input of Figure 7.4.1. 


SOLUTION The algorithm begins by finding a vertical line / that divides the points 
into two equal parts, 


S; = {1, 2,3,4,5, 6, 7}, So = {8,9, 10, 11, 12, 13, 14}. 


For this input there are many possible choices for the dividing line. The particular line 
chosen here happens to go through point 7. 

Next we recursively solve the problem for S; and Sj. The closest pair of points in 
S; is 1 and 3. We let 6, denote the distance between points 1 and 3. The closest pair of 
points in Sy is 9 and 12. We let dp denote the distance between points 9 and 12. We let 


6= min{6;, dr} = Op. 


We next order the points in the vertical strip of width 26 centered at / in nonde- 
creasing order of their y-coordinates: 


9, 12,4, 10, 7,5, 11, 6, 8. 


We then examine the points in this order. We compute the distances between each point 
and the following seven points, or between each point and the remaining points if fewer 
than seven points follow it. 

We first compute the distances from 9 to each of 12, 4, 10, 7, 5, 11, and 6. Since 
each of these distances exceeds 4, at this point we have not found a closer pair. 

We next compute the distances from 12 to each of 4, 10, 7, 5, 11, 6, and 8. Since 
each of these distances exceeds 4, at this point we still have not found a closer pair. 

We next compute the distances from 4 to each of 10, 7,5, 11, 6, and 8. Since each 
of these distances exceeds 4, at this point we still have not found a closer pair. 

We next compute the distances from 10 to each of 7, 5, 11, 6, and 8. Since the 
distance between points 10 and 7 is less than 5, we have discovered a closer pair. We 
update 6 to the distance between points 10 and 7. 

We next compute the distances from 7 to each of 5, 11, 6, and 8. Since each of 
these distances exceeds 5, we have not found a closer pair. 

We next compute the distances from 5 to each of 11, 6, and 8. Since each of these 
distances exceeds 5, we have not found a closer pair. 

We next compute the distances from 11 to each of 6 and 8. Since each of these 
distances exceeds 5, we have not found a closer pair. 

We next compute the distance from 6 to 8. Since the distance between points 6 and 
8 is less than 5, we have discovered a closer pair. We update 6 to the distance between 
points 6 and 8. Since there are no more points in the strip to consider, the algorithm 
terminates. The closest pair is 6 and 8 and the distance between them is 6. < 


Before we give a formal statement of the closest-pair algorithm, there are several 
technical points to resolve. 

In order to terminate the recursion, we check the number of points in the input 
and if there are three or fewer points, we find a closest pair directly. Dividing the input 
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Algorithm 7.4.2 


Go Online 

For more on the 
closest-pair algorithm, 
see g00.g1/P8Y2SF 


Go Online 

For a C++ program 
implementation 

of this algorithm, 

see g00.81/wshA3i 


and using recursion only if there are four or more points ensures that each of the two 
parts contains at least one pair of points and, therefore, that there is a closest pair in each 
part. 

Before invoking the recursive procedure, we sort the entire set of points by 
x-coordinate. This makes it easy to divide the points into two nearly equal parts. 

We use merge sort (see Section 7.3) to sort by y-coordinate. However, instead of 
sorting each time we examine points in the vertical strip, we assume as in merge sort 
that each half is sorted by y-coordinate. Then we simply merge the two halves to sort all 
of the points by y-coordinate. 

We can now formally state the closest-pair algorithm. To simplify the description, 
our version outputs only the distance between a closest pair, not a closest pair. We leave 
this enhancement as an exercise (Exercise 5). 


Finding the Distance Between a Closest Pair of Points 


Input: pj,..., Pn (n = 2 points in the plane) 
Output: 4, the distance between a closest pair of points 


closest_pair (p, n) { 
sort pi, ..., Pn by x-coordinate 
return rec_cl_pair (p, 1, n) 
} 
rec_cl_pair (p, i, j) { 
// the input is the sequence pj, ... , pj of points in the plane 
// sorted by x-coordinate 


// at termination of rec_cl_pair, the sequence is sorted by 
// y-coordinate 


// rec_cl_pair returns the distance between a closest pair 
// in the input 


// denote the x-coordinate of point p by p.x 


// trivial case (3 or fewer points) 

if gG —i < 3){ 
sort pj, ..., p; by y-coordinate 
directly find the distance 6 between a closest pair 
return 6 

} 

// divide 

k=(|@+))/2] 

[= jayne 

dz, = rec_cl_pair (p, i, k) 

dr = rec_cl_pair (p,k + 1,j) 

6 = min{6;, dr} 


// pi, ».., Pk 18 NOW sorted by y-coordinate 
/! Pei, «++ Pj 18 NOW sorted by y-coordinate 
merge pj, ..., Pk and p41, ..., pj by y-coordinate 


// assume that the result of the merge is stored back in p;,..., Pj 
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// pj, ..., pj iS now sorted by y-coordinate 


// store points in the vertical strip in v 


t=O 
fork =itoj 
if (pyex > 1—6 A pg.x < 146) { 
t=ft+1 
Ve == 1h 
} 
// points in strip are v;,..., Vv; 


// look for closer pairs in strip 
// compare each to next seven points 
fork=1tot—1 
for s =k-+ 1 to min{t, k +7} 
5 = min{6, dist(v;, v,)} 
return 6 


We now consider the worst-case time required by the closest-pair algorithm. The 
procedure closest_pair begins by sorting the points by x-coordinate. If we use an optimal 
sort (e.g., merge sort), the worst-case sorting time will be @(nlgn). Next closest_pair 
invokes rec_cl_pair. We let a, denote the worst-case time of rec_cl_pair for input of 
size n. If n > 3, rec_cl_pair first invokes itself with input size [n/2] and |(n + 1)/2]. 
Merging the points, extracting the points in the strip, and checking the distances in the 
strip each takes time O(n). Thus we obtain the recurrence 


An ZS An/2) + Ayn4n2) +n n> 3. 


This is essentially the same recurrence that merge sort satisfies, so we expect that 
rec_cl_pair has the same worst-case time O(nlgn) as merge sort. Since the worst-case 
time of sorting the points by x-coordinate is O(n lgn), we expect that the worst-case 
time of closest_pair is ©(nlgn). While this is true, the gap in the argument is that in the 
proof that merge sort has worst-case time ©(nlgn), we used the fact that, in the worst 
case, the recurrence for merge sort is actually ad, = @jn/2) + @\(n41)/2) +n — 1. Notice 
that in the recurrence, “=” has replaced “<.” We leave the technical details of filling the 
gap to the exercises (see Exercises 18-21). 

Making reasonable assumptions about what kinds of computations are allowed and 
using a decision tree model together with advanced methods, Preparata (see [Preparata, 
1985: Theorem 5.2, page 188]) shows that any algorithm that solves the closest-pair 
problem is Q(nlgn); thus Algorithm 7.4.2 is asymptotically optimal. 

It can be shown (Exercise 10) that there are at most six points in the rectangle 
of Figure 7.4.2 when the base is included and the other sides are excluded. This re- 
sult is the best possible since it is possible to place six points in the rectangle (Exer- 
cise 8). By considering the possible locations of the points in the rectangle, D. Lerner 
and R. Johnsonbaugh have shown that it suffices to compare each point in the strip 
with the next three points (rather than the next seven). This result is the best possi- 
ble since checking the next two points does not lead to a correct algorithm 
(Exercise 7). 


370 Chapter 7 @ Recurrence Relations 


7.4 Review Exercises 


1. What is computational geometry? 
2. State the closest-pair problem. 


3. Describe a brute-force closest-pair algorithm. 


4. Describe the divide-and-conquer closest-pair algorithm. 


5. Compare the worst-case times of the brute-force and divide- 
and-conquer closest-pair algorithms. 


1. Describe how the closest-pair algorithm finds the closest pair 
of points if the input is (8, 4), (3, 11), (12, 10), (5, 4), C1, 2), 
(17, 10), (8, 7), (8, 9), (11, 3), (1, 5), C11, 7), (5, 9), C1, 9), 
(7, 6), (3, 7), (14, 7). 


2. What can you conclude about input to the closest-pair algo- 
rithm when the output is zero for the distance between a clos- 
est pair? 

3. Give an example of input for which the closest-pair algorithm 
puts some points on the dividing line / into the left half and 
other points on / into the right half. 


4. Explain why in some cases, when dividing a set of points by a 
vertical line into two nearly equal parts, it is necessary for the 
line to contain some of the points. 


5. Write a closest-pair algorithm that finds one closest pair as 
well as the distance between the pair of points. 


6. Write an algorithm that finds the distance between a closest 
pair of points on a (straight) line. 

7. Give an example of input for which comparing each 
point in the strip with the next two points gives incorrect 
output. 


8. Give an example to show that it is possible to place six points 
in the rectangle of Figure 7.4.2 when the base is included and 
the other sides are excluded. 


9. When we compute the distances between a point p in the strip 
and points following it, can we stop computing distances from 
p if we find a point g such that the distance between p and g 
exceeds 5? Explain. 


«10. Show that there are at most six points in the rectangle of 
Figure 7.4.2 when the base is included and the other sides are 
excluded. 

11. Write a ©(nlgn) algorithm that finds the distance 6 
between a closest pair and, if 5 > 0, also finds all pairs 6 
apart. 

12. Write a O(nlgn) algorithm that finds the distance 6 between 
a closest pair and all pairs less than 26 apart. 


13. Write a ©(nlgn) algorithm that finds the distance 5 between 
a closest pair and all pairs less than 26 apart, where each pair 
less than 26 apart is listed exactly one time. 


14. Explain why the following algorithm does not find the distance 
5 between a closest pair and all pairs less than 26 apart. 


exercisel4(p, n) { 
5 = closest_pair (p, n) 
fori=lton—1 
for j =i+ 1 to min{n, i+ 31} 
if (dist(p;, pj) < 2 * 6) 
printing + “” +) 
} 


15. Write an algorithm that combines finding points in the strip 
and searching in the strip. In this way, the strip size changes 
dynamically so that, in general, this version checks fewer 
points in the strip than does Algorithm 7.4.2. 

16. Suppose that the distance between a closest pair in a set of n 
points in the plane is 6 > 0. Show that the number of pairs 
exactly 5 apart is O(n). 

17. Suppose that the distance between a closest pair in a set of n 
points in the plane is 5 > 0. Show that the number of pairs 
less than 26 apart is O(n). 


Let {an}P7°., be a sequence satisfying the recurrence 


An S A\n/2) + A,~n41)/2) + en, 
for some positive constant c and for alln > 3. 
Define the sequence {t,}°°, by the rules ty = az, 3 = 
max{a2, a3}, and 
th = tn/2) + tm+1y/2) +n ne 3: 
Exercises 18-21 deal with these recurrences. 


18. Prove that a, < t, for alln > 2. 
19. Prove that the sequence {f,}°° 


20. Prove that ¢, = O(nlgn). 


, is nondecreasing. 


21. Deduce that rec_cl_pair has worst-case time O(nlgn). 


Chapter 7 Notes 


Recurrence relations are treated more fully in [Liu, 1985; Roberts; and Tucker]. Several ap- 
plications to the analysis of algorithms are presented in [Johnsonbaugh]. 
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[Cull] gives algorithms for solving certain Tower of Hanoi problems with minimum 
space and time complexity. [Hinz] is a comprehensive discussion of the Tower of Hanoi with 
50 references. 

The cobweb in economics first appeared in [Ezekiel]. 

All data structures and algorithms books have extended discussions of searching and 
sorting (see, e.g., [Brassard; Cormen; Johnsonbaugh; Knuth, 1998b; Kruse; and Nyhoff]). 

{de Berg; Preparata, 1985; and Edelsbrunner] are books on computational geometry. 
The closest-pair algorithm in Section 7.4 is due to M. I. Shamos and appears in [Preparata, 
1985]. [Preparata, 1985] also gives a O(n 1g n) algorithm to find the closest pair in an arbitrary 
number of dimensions. 

Recurrence relations are also called difference equations. [Goldberg] contains a dis- 
cussion of difference equations and applications. 
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. Initial condition 
Compound interest 
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. Ackermann’s function 


An RWNE 


Section 7.2 


7. Solving a recurrence relation by iteration 

8. nth-order, linear homogeneous recurrence relation with 
constant coefficients and how to solve a second-order 
recurrence relation 

9. Population growth 
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10. How to find a recurrence relation that describes the time re- 
quired by a recursive algorithm 
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12. Binary search 

13. Merging sequences 

14. Merge sort 
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15. Computational geometry 
16. Closest-pair problem 
17. Closest-pair algorithm 


Chapter 7 Self-Test 


1. Answer parts (a)—(c) for the sequence defined by the rules: 


1. The first term is 3. 

2. The nth term is n plus the previous term. 
(a) Write the first four terms of the sequence. 
(b) Find an initial condition for the sequence. 


(c) Find a recurrence relation for the sequence. 


2. Is the recurrence relation 
An = Ayn—1 + An-3 


a linear homogeneous recurrence relation with constant 
coefficients? 


3. Assume that a person invests $4000 at 17 percent interest 
compounded annually. Let A, represent the amount at the 
end of n years. Find a recurrence relation and an initial con- 
dition for the sequence Ao, Aj,.... 


4. Let P, be the number of partitions of an n-element set. 
Show that the sequence Po, P|, ... satisfies the recurrence 
relation 


n—1 


P, = p> C(n— 1, k)Px. 


k=0 


In Exercises 5 and 6, solve the recurrence relation subject to the 


initial conditions. 


5. dy = —4ay-1 — 44n-23 a9 = 2, ay = 4 

6. An = 3dn—-1 + 10GQn_-23 a9 = 4, ay = 13 

7. Suppose that we have a 2 x n rectangular board divided 
into 2n squares. Let a, denote the number of ways to ex- 


actly cover this board by 1 x 2 dominoes. Show that the 
sequence {a,} satisfies the recurrence relation 


An = Ay—| + Ay-2. 


Show that a, = fn+1, where {f,,} is the Fibonacci sequence. 


8. Let c, denote the number of strings over {0, 1, 2} of length 
n that contain an even number of 1’s. Write a recur- 
rence relation and initial condition that define the sequence 
C1, C2,.... Solve the recurrence relation to obtain an ex- 
plicit formula for c,,. 
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Exercises 9-12 refer to the following algorithm. 


Algorithm 


Polynomial Evaluation 
This algorithm evaluates the polynomial 


Pa) = Dye 
k=0 


at the point f. 


Input: The sequence of coefficients co, 
C1, +--+, Cn, the value t, and n 
Output: p(t) 
poly(c, n, t) { 
a1) 
return Co 
return f * poly(c,n— 1,1) + cy 


} 


Let b, be the number of multiplications required to compute p(t). 


11. 
12. 


13. 


14. 


15. 


16. 


. Find a recurrence relation and an initial condition for the 
sequence {b,}. 


. Compute bj, bo, and b3. 
Solve the recurrence relation of Exercise 9. 


Suppose that we compute p(t) by a straightforward tech- 
nique that requires n — k multiplications to compute c,t"~*. 
How many multiplications would be required to compute 
p(t)? Would you prefer this method or the preceding algo- 
rithm? Explain. 

Describe how the closest-pair algorithm finds the closest 
pair of points for the input (10, 1), (7, 7), (3, 13), (6, 10), 
(16, 4), (10, 5), (7, 13), (13, 8), (4, 4), (2, 2), C1, 8), (10, 13), 
(7, 1), (4, 8), (12, 3), (16, 10), (14, 5), (10, 9). 


In order to terminate the recursion in the closest-pair algo- 
rithm, if there are three or fewer points in the input we find 
the closest pair directly. Why can’t we replace “three” by 
“two”? 


Show that there are at most four points in the lower half of 
the rectangle of Figure 7.4.3. 

What would the worst-case time of the closest-pair algo- 
rithm be if instead of merging pj,..., Px aNd Pets + +5 Pjs 
we used merge sort to sort pj, ... 


Chapter 7 Computer Exercises 


1. 


10. 


11. 


Write a program that prints the amount accumulated yearly 
if a person invests n dollars at p percent compounded 
annually. 


. Write a program that prints the amount accumulated yearly 


if a person invests n dollars at p percent annual interest com- 
pounded m times yearly. 


. Write a program that solves the three-peg Tower of Hanoi 


puzzle. 


. Write a program that solves the four-peg Tower of Hanoi 


puzzle in fewer moves than does the solution to the three- 
peg puzzle. 


. Write a program to display the cobweb of economics. 
. Write a program to compute Ackermann’s function. 


. Write a program that prints a solution to the n-node com- 


munication program (see Exercise 53, Section 7.1). 


. Write a program that prints all the ways to spend n dollars 


under the conditions of Exercise 55, Section 7.1. 


. Write a program that prints all n-bit strings that do not con- 


tain the pattern 010. 


Write a program to compute the Lucas sequence (see Exer- 
cise 68, Section 7.1). 


Write a program to list all rise/fall permutations of length n 
(see the definition before Exercise 74, Section 7.1). 


12 


13. 


14. 
15. 


16. 


17. 


18. 
19. 


. Implement Algorithms 7.3.1 (selection sort) and 7.3.8 
(merge sort) and other sorting algorithms as programs and 
compare the times needed to sort n items. 


Implement binary search (Algorithm 7.3.2) as a program. 
Measure the time used by the program for various keys 
and for various values of n. Compare these results 
with the theoretical estimate for the worst-case time 
@O(gn). 

Write a program to merge two sequences. 


Implement a computation of a" nonrecursively that uses re- 
peated multiplication and Algorithms 7.3.11 and 7.3.12 as 
computer programs, and compare the times needed to exe- 
cute each. 


Implement the methods of computing the largest and small- 
est elements in an array (see Exercises 50-60, Section 7.3) 
and compare the times needed to execute each. 


Implement the closest-pair algorithm (Algorithm 7.4.2) as 
a program. The program should find not only the closest 
distance but also a closest pair. 

Write a program to find all closest pairs in the plane. 


Write a program to find the closest-pair in three- 
dimensional space. An algorithm is given in [Preparata]. 
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Chapter 8 


GRAPH THEORY 


Although the first paper in graph theory goes back to 1736 (see Example 8.2.16) and 
several important results in graph theory were obtained in the nineteenth century, it is 
only since the 1920s that there has been a sustained, widespread, intense interest in graph 
theory. Indeed, the first text on graph theory ([K6nig]) appeared in 1936. Undoubtedly, 
one of the reasons for the recent interest in graph theory is its applicability in many 
diverse fields, including computer science, chemistry, operations research, electrical en- 
gineering, linguistics, and economics. 

We begin with some basic graph terminology and examples. We then discuss some 
important concepts in graph theory, including paths and cycles. Two classical graph 
problems, the existence of Hamiltonian cycles and the traveling salesperson problem, 
are then considered. A shortest-path algorithm is presented that efficiently finds a short- 
est path between two given points. After presenting ways of representing graphs, we 
study the question of when two graphs are essentially the same (i.e., when two graphs 
are isomorphic) and when a graph can be drawn in the plane without having any of its 
edges cross. We conclude by presenting a solution based on a graph model to the Instant 
Insanity puzzle. 


Introduction 


Figure 8.1.1 shows the highway system in Wyoming that a particular person is responsi- 
ble for inspecting. Specifically, this road inspector must travel all of these roads and file 
reports on road conditions, visibility of lines on the roads, status of traffic signs, and so 
on. Since the road inspector lives in Greybull, the most economical way to inspect all of 
the roads would be to start in Greybull, travel each of the roads exactly once, and return 
to Greybull. Is this possible? See if you can decide before reading on. 

The problem can be modeled as a graph. In fact, since graphs are drawn with dots 
and lines, they look like road maps. In Figure 8.1.2, we have drawn a graph G that models 
the map of Figure 8.1.1. The dots in Figure 8.1.2 are called vertices, and the lines that 
connect the vertices are called edges. (Later in this section we will define all of these 
terms carefully.) We have labeled each vertex with the first three letters of the city to 
which it corresponds. We have labeled the edges e1,..., @13. When we draw a graph, 


iThis section can be omitted without loss of continuity. 


373 


374 


Chapter 8 ® Graph Theory 


Sheridan 


Greybull 
Gillette 


Buffalo 


Shoshoni | 


Lander ° 


Douglas 


Muddy Gap 


Figure 8.1.1 Part of the Wyoming highway system. 


the only information of importance is which vertices are connected by which edges. For 
this reason, the graph of Figure 8.1.2 could just as well be drawn as in Figure 8.1.3. 


Wor e2 


Sho =f Gre 


Gre 
e2 
Wor 
6 
Sho 
en 
Lan 


ey 


e7 Buf &3 


Figure 8.1.2 A graph model Figure 8.1.3 An alternative, but 
of the highway system shown equivalent, graph model of the high- 
in Figure 8.1.1. way system shown in Figure 8.1.1. 


If we start at a vertex vo, travel along an edge to vertex vj, travel along another 
edge to vertex v2, and so on, and eventually arrive at vertex v,, we call the complete 
tour a path from vo to v,. The path that starts at She, then goes to Buf, and ends at Gil 
corresponds to a trip on the map of Figure 8.1.1 that begins in Sheridan, goes to Buffalo, 
and ends at Gillette. The road inspector’s problem can be rephrased for the graph model 
G in the following way: Is there a path from vertex Gre to vertex Gre that traverses every 
edge exactly once? 

We can show that the road inspector cannot start in Greybull, travel each of the 
roads exactly once, and return to Greybull. To put the answer in graph terms, there is 
no path from vertex Gre to vertex Gre in Figure 8.1.2 that traverses every edge exactly 
once. To see this, suppose that there is such a path and consider vertex Wor. Each time we 
arrive at Wor on some edge, we must leave Wor on a different edge. Furthermore, every 
edge that touches Wor must be used. Thus the edges at Wor occur in pairs. It follows 
that an even number of edges must touch Wor. Since three edges touch Wor, we have a 


Go Online 

For a C program to 
generate random graphs 
with specified properties, 
see g00.g¢1/G8Eaud 


Example 8.1.2 


Example 8.1.3 


U1 
e 
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e2 U4 es 
€6 U6 
U5 
e7 
Figure 8.1.4 A 
directed graph. 
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contradiction. Therefore, there is no path from vertex Gre to vertex Gre in Figure 8.1.2 

that traverses every edge exactly once. The argument applies to an arbitrary graph G. If 

G has a path from vertex v to v that traverses every edge exactly once, an even number of 

edges must touch each vertex. We discuss this problem in greater detail in Section 8.2. 
At this point we give some formal definitions. 


Definition 8.1.1 > A graph (or undirected graph) G consists of a set V of ver- 
tices (or nodes) and a set E of edges (or arcs) such that each edge e € E is associated 
with an unordered pair of vertices. If there is a unique edge e associated with the ver- 
tices v and w, we write e = (v, w) or e = (w, v). In this context, (v, w) denotes an edge 
between v and w in an undirected graph and not an ordered pair. 

A directed graph (or digraph) G consists of a set V of vertices (or nodes) and a 
set E of edges (or arcs) such that each edge e € E is associated with an ordered pair of 
vertices. If there is a unique edge e associated with the ordered pair (v, w) of vertices, 
we write e = (v, w), which denotes an edge from v to w. 

An edge e in a graph (undirected or directed) that is associated with the pair of 
vertices v and w is said to be incident on v and w, and v and w are said to be incident on 
e and to be adjacent vertices. 

If G is a graph (undirected or directed) with vertices V and edges E, we write 
G=\(V,E). 

Unless specified otherwise, the sets E and V are assumed to be finite and V is 
assumed to be nonempty. < 


In Figure 8.1.2 the (undirected) graph G consists of the set 
V = {Gre, She, Wor, Buf, Gil, Sho, Cas, Dou, Lan, Mud} 


of vertices and the set 
E = {e1, @2,...,e13} 


of edges. Edge e; is associated with the unordered pair {Gre, She} of vertices, and edge 
€i9 is associated with the unordered pair {Cas, Dou} of vertices. Edge e; is denoted 
(Gre, She) or (She, Gre), and edge ej9 is denoted (Cas, Dou) or (Dou, Cas). Edge e4 is 
incident on Wor and Buf, and the vertices Wor and Buf are adjacent. < 


A directed graph is shown in Figure 8.1.4. The directed edges are indicated by arrows. 
Edge e, is associated with the ordered pair (v2, v,) of vertices, and edge e7 is associated 
with the ordered pair (v6, v6) of vertices. Edge e; is denoted (v2, v1), and edge e7 is 
denoted (v6, v6). < 


Definition 8.1.1 allows distinct edges to be associated with the same pair of ver- 
tices. For example, in Figure 8.1.5, edges e; and e2 are both associated with the vertex 
pair {v), v2}. Such edges are called parallel edges. An edge incident on a single vertex 


e3 


v2 U4 Us 


e2 
v3 
E tie 
UI 


Figure 8.1.5 A graph with parallel edges and 
loops. 
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Example 8.1.4 


Example 8.1.5 


is called a loop. For example, in Figure 8.1.5, edge e3 = (v2, v2) is a loop. A vertex, 
such as vertex v4 in Figure 8.1.5, that is not incident on any edge is called an isolated 
vertex. A graph with neither loops nor parallel edges is called a simple graph. 


Since the graph of Figure 8.1.2 has neither parallel edges nor loops, it is a simple graph. 
< 


Some authors do not permit loops and parallel edges when they define graphs. One 
would expect that if agreement has not been reached on the definition of “graph,” most 
other terms in graph theory would also not have standard definitions. This is indeed 
the case. In reading articles and books about graphs, it is necessary to check on the 
definitions being used. 

We turn next to an example that shows how a graph model can be used to analyze 
a manufacturing problem. 


Frequently in manufacturing, it is necessary to bore many holes in sheets of metal (see 
Figure 8.1.6). Components can then be bolted to these sheets of metal. The holes can be 
drilled using a drill press under the control of a computer. To save time and money, the 
drill press should be moved as quickly as possible. Model the situation as a graph. 


° 
° 
° 
O° 
Figure 8.1.6 A sheet of Figure 8.1.7 A graph model 
metal with holes for bolts. of sheet metal in Figure 8.1.6. 


The edge weight is the time to 
move the drill press. 


SOLUTION The vertices of the graph correspond to the holes (see Figure 8.1.7). Every 
pair of vertices is connected by an edge. We write on each edge the time to move the 
drill press between the corresponding holes. A graph with numbers on the edges (such 
as the graph of Figure 8.1.7) is called a weighted graph. If edge e is labeled k, we say 
that the weight of edge e is k. For example, in Figure 8.1.7 the weight of edge (c, e) 
is 5. In a weighted graph, the length of a path is the sum of the weights of the edges 
in the path. For example, in Figure 8.1.7 the length of the path that starts at a, visits c, 
and terminates at b is 8. In this problem, the length of a path that starts at vertex vj and 
then visits v2, v3,..., in this order, and terminates at v, represents the time it takes the 
drill press to start at hole h; and then move to hz, h3,..., in this order, and terminate at 
hy, where hole h; corresponds to vertex v;. A path of minimum length that visits every 
vertex exactly one time represents the optimal path for the drill press to follow. 

Suppose that in this problem the path is required to begin at vertex a and end 
at vertex e. We can find the minimum-length path by listing all possible paths from a 
to e that pass through every vertex exactly one time and choose the shortest one (see 
Table 8.1.1). We see that the path that visits the vertices a, b, c, d, e, in this order, has 
minimum length. Of course, a different pair of starting and ending vertices might produce 
an even shorter path. 


Example 8.1.6 
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TABLE 8.1.1 @ Paths in 
the Graph of Figure 
8.1.7 from a to e That 
Pass Through Every 
Vertex Exactly One 
Time, and Their Lengths 


Path Length 
a,b,c, d,e 21 
a, b, d,c,e 28 
a,c, b,d,e 24 
a, c,d, b,e 26 
a,d,b,c,e 27 
a,d,c, b,e 22 


< 


Listing all paths from vertex v to vertex w, as we did in Example 8.1.5, is a rather 
time-consuming way to find a minimum-length path from v to w that visits every vertex 
exactly one time. Unfortunately, no one knows a method that is much more practical for 
arbitrary graphs. This problem is a version of the traveling salesperson problem. We 
discuss that problem in Section 8.3. 


Bacon Numbers Actor Kevin Bacon has appeared in numerous films including Diner 
and Apollo 13. Actors who have appeared in a film with Bacon are said to have Bacon 
number one. For example, Ellen Barkin has Bacon number one because she appeared 
with Bacon in Diner. Actors who did not appear in a film with Bacon but who appeared 
in a film with an actor whose Bacon number is one are said to have Bacon number 
two. Higher Bacon numbers are defined similarly. For example, Bela Lugosi has Bacon 
number three. Lugosi was in Black Friday with Emmett Vogan, Vogan was in With a 
Song in My Heart with Robert Wagner, and Wagner was in Wild Things with Bacon. 
Develop a graph model for Bacon numbers. 


SOLUTION We let vertices denote actors, and we place one edge between two distinct 
actors if they appeared in at least one film together (see Figure 8.1.8). In an unweighted 


Ellen 
Barkin Dennis 
Kevin Charles 
Bacon Siebert 


Robert Emmett Bela 
Wagner Vogan Lugosi 


Figure 8.1.8 Part of a graph that models Bacon numbers. The vertices denote 
actors. There is an edge between two distinct actors if they appeared in at least one 
film together. For example, there is an edge between Ellen Barkin and Dennis Quaid 
because they both appeared in The Big Easy. An actor’s Bacon number is the length 
of a shortest path from that actor to Bacon. For example, Bela Lugosi’s Bacon 
number is three because the length of a shortest path from Lugosi to Bacon is three. 
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Go Online 
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Example 8.1.7 


VI v2 
v3 

U4, v5 

Figure 8.1.9 A 


similarity graph 
corresponding to the 
programs of 

Table 8.1.2 with 

S = 25. 


graph, the length of a path is the number of edges in the path. Thus an actor’s Bacon 
number is the length of a shortest path from the vertex corresponding to that actor to 
the vertex corresponding to Bacon. In Section 8.4, we discuss the general problem of 
finding shortest paths in graphs. Unlike the situation in Example 8.1.5, there are efficient 
algorithms for finding shortest paths. 

It is interesting that most actors, even actors who died many years ago, have Bacon 
numbers of three or less. See Exercise 30 for a similar graph model. < 


Similarity Graphs This example deals with the problem of grouping “like” objects into 
classes based on properties of the objects. For example, suppose that a particular algo- 
rithm is implemented in C++ by a number of persons and we want to group “like” pro- 
grams into classes based on certain properties of the programs (see Table 8.1.2). Suppose 
that we select as properties 


1. The number of lines in the program 
2. The number of return statements in the program 


3. The number of function calls in the program 


TABLE 8.1.2 m C++ Programs That Implement the Same Algorithm 


Number of Number of Number of 
Program Program Lines return Statements Function Calls 
1 66 20 1 
2 41 10 2 
3 68 5 8 
4 90 34 5 
5 75 12 14 


A similarity graph G is constructed as follows. The vertices correspond to pro- 
grams. A vertex is denoted (pj, p2,p3), Where p; is the value of property 7. We de- 
fine a dissimilarity function s as follows. For each pair of vertices v = (pj, p2, p3) and 
w = (41, 92, q3), We set 


s(v, w) = |p — q1| + |p2 — G2] + |p3 — al. 
If we let v; be the vertex corresponding to program 7, we obtain 


s(V1, V2) = 36, s(v1, v3) = 24, s(v1, v4) = 42, 8(V1, V5) = 30, 
S(V2, V3) = 38, S(V2, V4) = 76, (V2, v5) = 48, 5(V3, v4) = 54, 
5(V3, V5) = 20, (V4, Vs) = 46. 


If v and w are vertices corresponding to two programs, s(v, w) is a measure of 
how dissimilar the programs are. A large value of s(v, w) indicates dissimilarity, while 
a small value indicates similarity. 

For a fixed number S, we insert an edge between vertices v and w if s(v, w) < S. 
(In general, there will be different similarity graphs for different values of S.) We say that 
v and w are in the same class if v = w or there is a path from v to w. In Figure 8.1.9 we 
show the graph corresponding to the programs of Table 8.1.2 with S = 25. In this graph, 
the programs are grouped into three classes: {1, 3, 5}, {2}, and {4}. In a real problem, an 
appropriate value for S might be selected by trial and error or the value of S might be 
selected automatically according to some predetermined criteria. < 


Example 8.1.8 


Go Online 

For more on the 
hypercube, see 
goo.gl/jfjmol 


000 001 


Figure 8.1.10 The 3-cube. 
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The n-Cube (Hypercube) The traditional computer, often called a serial computer, 
executes one instruction at a time. Our definition of “algorithm” also assumes that one 
instruction is executed at a time. Such algorithms are called serial algorithms. As hard- 
ware costs have declined, it has become feasible to build parallel computers with many 
processors that are capable of executing several instructions at a time. Graphs are often 
convenient models to describe these machines. The associated algorithms are known as 
parallel algorithms. Many problems can be solved much faster using parallel comput- 
ers rather than serial computers. We discuss one model for parallel computation known 
as the n-cube or hypercube. 

The n-cube has 2” processors, n > 1, which are represented by vertices (see 
Figure 8.1.10) labeled 0, 1, ...,2” — 1. Each processor has its own local memory. An 
edge connects two vertices if the binary representation of their labels differs in exactly 
one bit. During one time unit, all processors in the n-cube may execute an instruction 
simultaneously and then communicate with an adjacent processor. If a processor needs 
to communicate with a nonadjacent processor, the first processor sends a message that 
includes the route to, and ultimate destination of, the recipient. It may take several time 
units for a processor to communicate with a nonadjacent processor. 

The n-cube may also be described recursively. The 1-cube has two processors, 
labeled 0 and 1, and one edge. Let H; and Hz be two (n — 1)-cubes whose vertices are 
labeled in binary 0,...,2”~! — 1 (see Figure 8.1.11). We place an edge between each 
pair of vertices, one from H; and one from H2, provided that the vertices have identical 
labels. We then change the label L on each vertex in H to OL and we change the label L 
on each vertex in H> to 1L. We obtain an n-cube (Exercise 39). See Exercises 43-45 for 
an alternative way to construct the n-cube. 


010 


1110 1111 


0000 


Figure 8.1.11 Combining two 3-cubes to obtain a 4-cube. 


The n-cube is an important model of computation because several such machines 
have been built and are running. Furthermore, several other parallel computation mod- 
els can be simulated by the hypercube. The latter point is considered in more detail in 
Examples 8.3.5 and 8.6.3. < 
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Example 8.1.10 


Example 8.1.12 


v1 
i - 

v2 
U5 

U3 


Figure 8.1.13 A 
bipartite graph. 


Example 8.1.13 


We conclude this introductory section by defining some special graphs that appear 
frequently in graph theory. 


Definition 8.1.9 >» The complete graph on n vertices, denoted K,,, is the simple 
graph with n vertices in which there is an edge between every pair of distinct vertices. “ 


The complete graph on four vertices, K4, is shown in Figure 8.1.12. 


Figure 8.1.12 The 
complete graph Ky. < 


Definition 8.1.11 > A graph G = (V,E) is bipartite if there exist subsets 
V, and V2 (either possibly empty) of V such that V; N V2 = @, V; U V2 = V, and 
each edge in F is incident on one vertex in V; and one vertex in V2. < 


The graph in Figure 8.1.13 is bipartite since if we let 
Vi = {11, v2, v3} and Vo = {va, Vs}, 


each edge is incident on one vertex in V; and one vertex in V3. < 


Notice that Definition 8.1.11 states that if e is an edge in a bipartite graph, then e is 
incident on one vertex in V; and one vertex in V). It does not state that if v; is a vertex in 
V, and v2 is a vertex in V2, then there is an edge between vy and v2. For example, the graph 
of Figure 8.1.13 is bipartite since each edge is incident on one vertex in V; = {v1, v2, v3} 
and one vertex in V2 = {v4, v5}. However, not all edges between vertices in V; and V2 
are in the graph. For example, the edge (v1, vs) is absent. 


The graph in Figure 8.1.14 is not bipartite. It is often easiest to prove that a graph is not 
bipartite by arguing by contradiction. 


uy 
ae 
U3 
U7 
Uo) 
V6 
U8 v9 


Figure 8.1.14 A graph that is not 
bipartite. 

Suppose that the graph in Figure 8.1.14 is bipartite. Then the vertex set can be 
partitioned into two subsets V; and V2 such that each edge is incident on one vertex in 
V, and one vertex in V>. Now consider the vertices v4, v5, and v6. Since v4 and vs are 
adjacent, one is in V; and the other in V2. We may assume that vq is in V; and that vs is 
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in V2. Since vs and v¢ are adjacent and vs is in V2, v6 is in V;. Since vq and v¢ are adjacent 
and v4 is in Vj, v6 is in V2. But now ve is in both V; and V2, which is a contradiction 
since V, and V> are disjoint. Therefore, the graph in Figure 8.1.14 is not bipartite. <4 


Example 8.1.14 The complete graph K, on one vertex is bipartite. We may let V; be the set containing 
—e-—-———=_ the one vertex and V, be the empty set. Then each edge (namely none!) is incident on 
one vertex in V; and one vertex in V3. < 


Definition 8.1.15 >» Thecomplete bipartite graph on mand n vertices, denoted 
Kyn.n» 18 the simple graph whose vertex set is partitioned into sets V; with m vertices and 
V2 with n vertices in which the edge set consists of all edges of the form (11, v2) with 
yEeV, and v2 € V>. < 


Example 8.1.16 The complete bipartite graph on two and four vertices, K2,4, is shown in Figure 8.1.15. 


Figure 8.1.15 The 
complete bipartite 
graph K> 4. 


8.1 Problem-Solving Tips 


To model a given situation as a graph, you must first decide what the vertices represent. 
Then an edge between two vertices represents some kind of relation. For example, if 
several teams play soccer games, we could let the vertices represent the teams. We could 
then put an edge between two vertices (teams) if the two teams represented by the two 
vertices played at least one game. The graph would then show which teams have played 
each other. 

To determine whether a graph is bipartite, try to separate the vertices into two 
disjoint sets V; and V> so that each edge is incident on one vertex in one set and one 
vertex in the other set. If you succeed, the graph is bipartite and you have discovered the 
sets V; and V>. If you fail, the graph is not bipartite. To try to separate the vertices into 
two disjoint sets, pick a start vertex v. Put v € V;. Put all vertices adjacent to v in Vp. 
Pick a vertex w € V). Put all vertices adjacent to w in V;. Pick a vertex v’ € Vo,’ #v. 
Put all vertices adjacent to v’ in V2. Continue in this way. If you can put each vertex into 
either V; or V2, but not both, the graph is bipartite. If at some point, you are forced to 
put a vertex into both V; and V2, the graph is not bipartite. 


8.1 Review Exercises 


1. Define undirected graph. 3. Define directed graph. 


2. Give an example of something in the real world that can be 4. Give an example of something in the real world that can be 
modeled by an undirected graph. modeled by a directed graph. 


382 


10. 
11. 
12. 
13. 


14. 


Vn 
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. What does it mean for an edge to be incident on a vertex? 


What does it mean for a vertex to be incident on an edge? 


What does it mean for v and w to be adjacent vertices? 


. What are parallel edges? 
. What is a loop? 


What is an isolated vertex? 
What is a simple graph? 
What is a weighted graph? 


Give an example of something in the real world that can be 
modeled by a weighted graph. 


Define length of path in a weighted graph. 


22. 
23. 


. What is a similarity graph? 

. Define n-cube. 

. What is a serial computer? 

. What is a serial algorithm? 

. What is a parallel computer? 
. What is a parallel algorithm? 


. What is the complete graph on n vertices? How is it 


denoted? 
Define bipartite graph. 


What is the complete bipartite graph on m and n vertices? How 
is it denoted? 


8.1 Exercises 


Ina tournament, the Snow beat the Pheasants once, the Skyscrap- 
ers beat the Tuna once, the Snow beat the Skyscrapers twice, the 
Pheasants beat the Tuna once, and the Pheasants beat the Snow 
once. In Exercises 1-4, use a graph to model the tournament. The 
teams are the vertices. Describe the kind of graph used (e.g., undi- 
rected graph, directed graph, simple graph). 


1. There is an edge between teams if the teams played. 


. There is an edge from team #; to team 4; if ¢; beat f; at least one 


time. 


. There is an edge between teams for each game played. 


. There is an edge from team 1; to team 4; for each victory of f 


over fj. 


Explain why none of the graphs in Exercises 5—7 has a path from 
a to a that passes through each edge exactly one time. 


5. 


a 


Show that each graph in Exercises 8-10 has a path from a to a that 
passes through each edge exactly one time by finding such a path 
by inspection. 


8. 


10. 


a 
b c 
d e 
ts 
a b c 


For each graph G = (V, E) in Exercises 11-13, find V, E, all par- 
allel edges, all loops, all isolated vertices, and tell whether G is a 
simple graph. Also, tell on which vertices edge e is incident. 


11. ee 
Uy 
e6 2 
U3 
v2 &3 eg v4 
12: 
er 
v2 
a . Go Online 
6 For more on 
Paul Erdés and Erdés 
a ve numbers, see 
4 eg goo.gl/jfjmol 
13. isi 
Uz © ° U3 


14. Draw K3 and Ks. 


15. Give an example of a bipartite graph different from those in 
the examples of this section. Specify the disjoint vertex sets. 


16. Find a formula for the number of edges in Ky. 


State which graphs in Exercises 17-23 are bipartite graphs. If the 
graph is bipartite, specify the disjoint vertex sets. 


17. 
vy v2 
e2 63 
ey e4 
U4 es US 
18. 
Vy v2 U3 U4 
ey e3 e5 e6 
e e e 
2 4 7 + Us 
U5 V6 U7 eg 
9 
Vv 
Vo 10 


19. Figure 8.1.2 
21. Exercise 11 22. Exercise 12 
23. Exercise 13 24. Draw K23 and K33. 


25. Find a formula for the number of edges in Ky. 


20. Figure 8.1.5 


26. Most authors require that V; and V2 be nonempty in Defini- 
tion 8.1.11. According to these authors, which of the graphs 
in Examples 8.1.12—8.1.14 are bipartite? 
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In Exercises 27-29, find a path of minimum length from v to w in 
the graph of Figure 8.1.7 that passes through each vertex exactly 
one time. 


27. v=b,w=e 
29. v=c,w=d 


30. Paul Erdéds (1913-1996) was one of the most prolific mathe- 
maticians of all time. He was the author or co-author of nearly 
1500 papers. Mathematicians who co-authored a paper with 
Erdés are said to have Erdés number one. Mathematicians who 
did not co-author a paper with Erdés but who co-authored a 
paper with a mathematician whose Erdés number is one are 
said to have Erdés number two. Higher Erdés numbers are de- 
fined similarly. For example, the author of this book has Erdés 
number five. Johnsonbaugh co-authored a paper with Tadao 
Murata, Murata co-authored a paper with A. T. Amin, Amin 
co-authored a paper with Peter J. Slater, Slater co-authored 
a paper with Frank Harary, and Harary co-authored a paper 
with Erdés. Develop a graph model for Erdés numbers. In your 
model, what is an Erd6s number? 


28. v=a,w=b 


31. Is the graph model for Bacon numbers (see Example 8.1.6) a 
simple graph? 

32. Draw the similarity graph that results from setting S = 40 in 
Example 8.1.7. How many classes are there? 


33. Draw the similarity graph that results from setting S = 50 in 
Example 8.1.7. How many classes are there? 


34. In general, is “is similar to” an equivalence relation? 

35. Suggest additional properties for Example 8.1.7 that might be 
useful in comparing programs. 

36. How might one automate the selection of S to group data into 
classes using a similarity graph? 

37. Draw a 2-cube. 

38. Draw a picture like that in Figure 8.1.11 to show how a 3-cube 
may be constructed from two 2-cubes. 


39. Prove that the recursive construction in Example 8.1.8 actually 
yields an n-cube. 

40. How many edges are incident on a vertex in an n-cube? 

41. How many edges are in an n-cube? 

*42. In how many ways can the vertices of an n-cube be labeled 
0,..., 2” — 1 so that there is an edge between two vertices if 
and only if the binary representation of their labels differs in 
exactly one bit? 


[Bain] invented an algorithm to draw the n-cube in the plane. In 
the algorithm, all vertices are on the unit circle in the xy-plane. 
The angle of a point is the angle from the positive x-axis coun- 
terclockwise to the ray from the origin to the point. The input 
is n. 

1. Ifn = 0, put one unlabeled vertex at (—1, 0) and stop. 

2. Recursively invoke this algorithm with input n — 1. 

3. Move each vertex so that its new angle is half the current 

angle, maintaining edge connections. 


4. Reflect each vertex and edge in the x-axis. 
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5. Connect each vertex above the x-axis to its mirror image 
below the x-axis. 

6. Prefix 0 to the label of each vertex above the x-axis, and 
prefix 1 to the label of each vertex below the x-axis. 


The following figures show how the algorithm draws the 2-cube 
and 3-cube. 


01 00 011 001 


111101 


11 10 


43. Show how the algorithm constructs the 2-cube from the 
1-cube. 


44. Show how the algorithm constructs the 3-cube from the 
2-cube. 


45. Show how the algorithm constructs the 4-cube from the 
3-cube. 


Exercises 46-48 refer to the following graph. The vertices repre- 
sent offices. An edge connects two offices if there is a communica- 
tion link between the two. Notice that any office can communicate 
with any other either directly through a communication link or by 
having others relay the message. 


a 


b c 


46. Show, by giving an example, that communication among all 
offices is still possible even if some communication links are 
broken. 


47. What is the maximum number of communication links that 
can be broken with communication among all offices still pos- 
sible? 

48. Show a configuration in which the maximum number of com- 
munication links are broken with communication among all 
offices still possible. 


49. In the following graph the vertices represent cities and the 
numbers on the edges represent the costs of building the indi- 
cated roads. Find a least-expensive road system that connects 
all the cities. 


a 
5 10 
b Cc 
ae | 30 
6 e 
Via a 
f 14 8g 


Ina precedence graph, the vertices model certain actions. For ex- 
ample, a vertex might model a statement in a computer program. 
There is an edge from vertex v to vertex w if the action modeled by 
v must occur before the action modeled by w. Draw a precedence 
graph for each computer program in Exercises 50-52. 


50. x=1 51. x=1 52.. x= 1 
y=2 y= 2 y=2 
Z=xt+y z= 2=y2 
z=z+1 a=x+y w=x+5 
b=y+z x=z+w 
c=x+z 
c=c+1 
x=atb+c 
53. Let G denote the set of simple graphs G = (V,E), where 
V ={1 2,05. n} for some n € Z*. Define a function f from 


G to Z"""*s by the rule f(G) = |E|. Is f one-to-one? Is f onto? 
Explain. 


8.2 Paths and Cycles 


If we think of the vertices in a graph as cities and the edges as roads, a path corresponds 
to a trip beginning at some city, passing through several cities, and terminating at some 
city. We begin by giving a formal definition of path. 


Definition 8.2.1 > Let vo and v, be vertices in a graph. A path from vo to vy, of 
length n is an alternating sequence of n + | vertices and n edges beginning with vertex 
vo and ending with vertex v,, 


(Vo, €1, Vis €2, V25-- 


+> Vn—-15 ens Vn)s 


in which edge e; is incident on vertices v;_; and v; fori=1,...,n. < 


Example 8.2.2 


Example 8.2.3 


Example 8.2.5 


Example 8.2.6 
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The formalism in Definition 8.2.1 means: Start at vertex vo; go along edge e; to 
v1; go along edge e2 to v2; and so on. 


In the graph of Figure 8.2.1, 
(1, e1, 2, e2, 3, €3, 4, e4, 2) (8.2.1) 


is a path of length 4 from vertex 1| to vertex 2. < 


Figure 8.2.1 A connected graph 
with paths (1, e;, 2, e2, 3, e3, 4, e4, 2) 
of length 4 and (6) of length 0. 


In the graph of Figure 8.2.1, the path (6) consisting solely of vertex 6 is a path of length 
0 from vertex 6 to vertex 6. < 


In the absence of parallel edges, in denoting a path we may suppress the edges. 
For example, the path (8.2.1) may also be written (1, 2, 3, 4, 2). 

A connected graph is a graph in which we can get from any vertex to any other 
vertex on a path. The formal definition follows. 


Definition 8.2.4 > A graph G is connected if given any vertices v and w in G, 
there is a path from v to w. < 


The graph G of Figure 8.2.1 is connected since, given any vertices v and w in G, there 
is a path from v to w. < 


The graph G of Figure 8.2.2 is not connected since, for example, there is no path from 
vertex v2 to vertex vs. 


U2 
U4 
e1 (3) ° U6 
e4 
Us 
Vy e3° «U3 
G 


Figure 8.2.2 A graph that is 
not connected. 4 
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Example 8.2.7 


Example 8.2.9 


Example 8.2.10 


Let G be the graph whose vertex set consists of the 50 states of the United States. Put 
an edge between states v and w if v and w share a border. For example, there is an edge 
between California and Oregon and between Illinois and Missouri. There is no edge 
between Georgia and New York, nor is there an edge between Utah and New Mexico. 
(Touching does not count; the states must share a border.) The graph G is not connected 
because there is no path from Hawaii to California (or from Hawaii to any other state). 4 


As we can see from Figures 8.2.1 and 8.2.2, a connected graph consists of one 
“piece,” while a graph that is not connected consists of two or more “pieces.” These 
“pieces” are subgraphs of the original graph and are called components. We give the 
formal definitions beginning with subgraph. 

A subgraph G’ of a graph G is obtained by selecting certain edges and vertices from 
G subject to the restriction that if we select an edge e in G that is incident on vertices v 
and w, we must include v and w in G’. The restriction is to ensure that G’ is actually a 
graph. The formal definition follows. 


Definition 8.2.8 > Let G = (V,E) bea graph. We call (V’, E’) a subgraph 
of G if 
(a) V CVandE’ CE. 


(b) For every edge e’ € E’, if e’ is incident on v’ and w’, then v’, w’ € V’. < 


The graph G’ = (V’, E’) of Figure 8.2.3 is a subgraph of the graph G = (V, £) of 
Figure 8.2.4 since V’ C Vand E’ C E. 


U2 vy 
10 eit 
G 
v3 °8 v4 uy &5 v7 
Figure 8.2.3 A subgraph of the Figure 8.2.4 A graph, one of 
graph of Figure 8.2.4. whose subgraphs is shown in Fig- < 


ure 8.2.3. 
Find all subgraphs of the graph G of Figure 8.2.5 having at least one vertex. 


SOLUTION If we select no edges, we may select one or both vertices yielding the 
subgraphs G;, G2, and G3 shown in Figure 8.2.6. If we select the one available edge 
e,, we must select the two vertices on which e is incident. In this case, we obtain the 
subgraph Gy, shown in Figure 8.2.6. Thus G has the four subgraphs shown in Figure 8.2.6. 


v2 v2 U2 
iy . . 
U1 we 
we v1 "1 
v1 G G, G2 G3 G4 


Figure 8.2.5 The graph Figure 8.2.6 The four subgraphs of 
for Example 8.2.10. the graph of Figure 8.2.5. < 


We can now define “component.” 


Example 8.2.12 


Example 8.2.13 


Example 8.2.15 
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Definition 8.2.11 >» Let Gbea graph and let v be a vertex in G. The subgraph 
G’ of Gconsisting of all edges and vertices in G that are contained in some path beginning 
at v is called the component of G containing v. < 


The graph G of Figure 8.2.1 has one component, namely itself. Indeed, a graph is con- 
nected if and only if it has exactly one component. < 


Let G be the graph of Figure 8.2.2. The component of G containing v3 is the subgraph 
G, = (Vi, £1), Vi = (v1, V2, v3}, E| = {e1, é2, e3}. 
The component of G containing v4 is the subgraph 
Gy = (V2, Er), Vo = {va}, Ey = 2. 
The component of G containing vs is the subgraph 


G3 = (V3, E3), V3 = {Vs5, Ve}, E3 = {eq}. < 


Another characterization of the components of a graph G = (V, E) is obtained by 
defining a relation R on the set of vertices V by the rule 


v;Rv2_ if there is a path from v, to v2. 


It can be shown (Exercise 69) that R is an equivalence relation on V and that if v € V, 
the set of vertices in the component containing v is the equivalence class 


[v] = {we V | wRv}. 


Notice that the definition of “path” allows repetitions of vertices or edges or both. 
In the path (8.2.1), vertex 2 appears twice. 

Subclasses of paths are obtained by prohibiting duplicate vertices or edges or by 
making the vertices vo and v, of Definition 8.2.1 identical. 


Definition 8.2.14 > Let v and w be vertices in a graph G. 
A simple path from v to w is a path from v to w with no repeated vertices. 
A cycle (or circuit) is a path of nonzero length from v to v with no repeated 


edges. 
A simple cycle is a cycle from v to v in which, except for the beginning and ending 
vertices that are both equal to v, there are no repeated vertices. < 


For the graph of Figure 8.2.1, we have the following information. 


Path Simple Path? Cycle? Simple Cycle? 
(6, 5, 2, 4, 3, 2, 1) No No No 
(6, 5, 2, 4) Yes No No 
(2, 6, 5, 2, 4, 3, 2) No Yes No 
(5, 6, 2, 5) No Yes Yes 
(7) Yes No No 


< 


We next reexamine the problem introduced in Section 8.1 of finding a cycle in a 
graph that traverses each edge exactly one time. 
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Example 8.2.16 K6nigsberg Bridge Problem The first paper in graph theory was Leonhard Euler’s in 


Go Online 

For more on the 
Konigsberg Bridge 
Problem, see 
goo.gl/jfjmol 


Go Online 

A biography of 
Euler is at 
goo.gl/jfjmol 


Theorem 8.2.17 


Theorem 8.2.18 


1736. The paper presented a general theory that included a solution to what is now called 
the Konigsberg bridge problem. 

Two islands lying in the Pregel River in K6nigsberg (now Kaliningrad in Russia) 
were connected to each other and the river banks by bridges, as shown in Figure 8.2.7. 
The problem is to start at any location—A, B, C, or D; walk over each bridge exactly 
once; then return to the starting location. 

The bridge configuration can be modeled as a graph, as shown in Figure 8.2.8. 
The vertices represent the locations and the edges represent the bridges. The Konigsberg 
bridge problem is now reduced to finding a cycle in the graph of Figure 8.2.8 that includes 
all of the edges and all of the vertices. In honor of Euler, a cycle in a graph G that includes 
all of the edges and all of the vertices of G is called an Euler cycle.’ From the discussion 
of Section 8.1, we see that there is no Euler cycle in the graph of Figure 8.2.8 because 
the number of edges incident on vertex A is odd. (In fact, in the graph of Figure 8.2.8, 
every vertex is incident on an odd number of edges.) 


A A 


— D 


Figure 8.2.8 A graph 
: F ee model of the bridges of 
Figure 8.2.7 The bridges of K6nigsberg. einigsberd. < 

The solution to the existence of Euler cycles is nicely stated by introducing the 
degree of a vertex. The degree of a vertex v, 5(v), is the number of edges incident on v. 
(By definition, each loop on v contributes 2 to the degree of v.) In Section 8.1 we found 
that if a graph G has an Euler cycle, then every vertex in G has even degree. We can also 
prove that G is connected. 


If a graph G has an Euler cycle, then G is connected and every vertex has even 
degree. 


Proof Suppose that G has an Euler cycle. We argued in Section 8.1 that every vertex 
in G has even degree. If v and w are vertices in G, the portion of the Euler cycle that 
takes us from v to w serves as a path from v to w. Therefore, G is connected. < 


The converse of Theorem 8.2.17 is also true. We give a proof by mathematical 
induction due to [Fowler]. 


If G is a connected graph and every vertex has even degree, then G has an Euler 
cycle. 


Proof The proof is by induction on the number n of edges in G. 


{For technical reasons, if G consists of one vertex v and no edges, we call the path (v) an Euler cycle for G. 
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Basis Step (n=0) 


Since G is connected, if G has no edges, G consists of a single vertex. An Euler cycle 
consists of the single vertex and no edges. 


Inductive Step 


Suppose that G has n edges, n > 0, and that any connected graph with k edges, k < n, 
in which every vertex has even degree, has an Euler cycle. 

It is straightforward to verify that a connected graph with one or two vertices, each 
of which has even degree, has an Euler cycle (see Exercise 70); thus we assume that the 
graph has at least three vertices. 

Since G is connected, there are vertices vj, v2, and v3 in G with edge e; incident on 
v, and v2 and edge e incident on v2 and v3. We delete edges e; and é, but no vertices, and 
add an edge e incident on v, and v3 to obtain the graph G’ [see Figure 8.2.9(a)]. Notice 
that each component of the graph G’ has less than n edges and that in each component 
of the graph G’, every vertex has even degree. We show that G’ has either one or two 
components. 

Let v be a vertex. Since G is connected, there is a path P in G from v to v;. Let P’ 
be the portion of the path P starting at v whose edges are also in G’. Now P’ ends at either 
V1, V2, or v3 because the only way that P could fail to be a path in G’ is that P contains 
one of the deleted edges e, or e. If P’ ends at vj, then v is in the same component as v, 
in G’. If P’ ends at v3 [see Figure 8.2.9(b)], then v is in the same component as v3 in G’, 
which is in the same component as v; in G’ (since edge e in G’ is incident on v, and v3). 
If P’ ends at v2, then v2 is in the same component as v. Therefore, any vertex in G’ is in 
the same component as either v; or v2. Thus G’ has one or two components. 

If G’ has one component, that is, if G’ is connected, we may apply the inductive 
hypothesis to conclude that G’ has an Euler cycle C’. This Euler cycle may be modified 
to produce an Euler cycle in G: We simply replace the occurrence of edge e in C’ by 
edges e; and eo. 

Suppose that G’ has two components [see Figure 8.2.9(c)]. By the inductive hy- 
pothesis, the component containing v; has an Euler cycle C’ and the component con- 
taining v2 has an Euler cycle C” beginning and ending at v2. An Euler cycle in G is ob- 
tained by modifying C’ by replacing (v1, v3) in C’ by (11, v2) followed by C” followed 


(b) 


Figure 8.2.9 The proof of Theorem 8.2.18. In (a), edges e; and e> are deleted and edge ¢ is added. In (b), 
P (shown in color) is a path in G from v to v;, and P’ (shown in heavy color) is the portion of P starting at v 
whose edges are also in G’. As shown, P’ ends at v3. Since edge ¢ is in G’, there is a path in G’ from v to v1. 
Thus v and vy; are in the same component. In (c), C’ (shown with a heavy line) is an Euler cycle for one 
component, and C” (shown with a light, solid line) is an Euler cycle for the other component. If we replace e 
in C’ by e;, C”, e2, we obtain an Euler cycle (shown in color) for G. 
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v1 


U3 


Figure 8.2.10 The graph 
for Example 8.2.19. 
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by (v2, v3) or by replacing (v3, v1) in C’ by (v3, v2) followed by C” followed by (v2, v1). 
The Inductive Step is complete; G has an Euler cycle. < 


If G is a connected graph and every vertex has even degree and G has only a few 
edges, we can usually find an Euler cycle by inspection. 


Let G be the graph of Figure 8.2.10. Use Theorem 8.2.18 to verify that G has an Euler 
cycle. Find an Euler cycle for G. 


SOLUTION We observe that G is connected and that 
5(v1) = b(v2) = 6(¥3) = bs) = 4, 


3(v4) = 6, 5(v6) = 5(v7) = 2. 


Since the degree of every vertex is even, by Theorem 8.2.18, G has an Euler cycle. By 
inspection, we find the Euler cycle 


< 


(V6, V4, V7, V5, V1, V3, V45 Vis V2, V5, V4, V2, V3, Vo). 


A domino is a rectangle divided into two squares with each square numbered one of 
0, 1,..., 6 (see Figure 8.2.11). Two squares on a single domino can have the same num- 
ber. Show that the distinct dominoes can be arranged in a circle so that touching domi- 
noes have adjacent squares with identical numbers. 


Figure 8.2.11 Dominoes. [Photo by 
the author.| 


SOLUTION We model the situation as a graph G with seven vertices labeled 0, 1, ..., 6. 
The edges represent the dominoes: There is one edge between each distinct pair of ver- 
tices and there is one loop at each vertex. Notice that G is connected. Now the domi- 
noes can be arranged in a circle so that touching dominoes have adjacent squares with 
identical numbers if and only if G contains an Euler cycle. Since the degree of each 
vertex is 8 (remember that a loop contributes 2 to the degree), each vertex has even 
degree. By Theorem 8.2.18, G has an Euler cycle. Therefore, the dominoes can be 
arranged in a circle so that touching dominoes have adjacent squares with identical 
numbers. < 


What can be said about a connected graph in which not all the vertices have even 
degree? The first observation (Corollary 8.2.22) is that the number of vertices of odd 
degree is even. This follows from the fact (Theorem 8.2.21) that the sum of all of the 
degrees in a graph is an even number. 


Theorem 8.2.21 


Corollary 8.2.22 


Theorem 8.2.23 


Theorem 8.2.24 


8.2 ® PathsandCycles 391 


If Gis a graph with m edges and vertices {v), v2,..., Vn}, then 


3 5(¥;) = 2m. 
=! 


In particular, the sum of the degrees of all the vertices in a graph is even. 


Proof When we sum over the degrees of all the vertices, we count each edge (14, v;) 
twice—once when we count it as (v;, vj) in the degree of vj and again when we count it 
as (v;, vj) in the degree of v;. The conclusion follows. < 


In any graph, the number of vertices of odd degree is even. 


Proof Let us divide the vertices into two groups: those with even degree x), ..., Xm 
and those with odd degree y1,..., yn. Let 


S = d(x) + 602) +++ +5 Qn), T = (yi) + 8(y2) + +++ +8 0n)- 


By Theorem 8.2.21, S + T is even. Since S is the sum of even numbers, S is even. Thus 
T is even. But T is the sum of n odd numbers, and therefore n is even. < 


Suppose that a connected graph G has exactly two vertices v and w of odd degree. 
Let us temporarily insert an edge e from v to w. The resulting graph G’ is connected and 
every vertex has even degree. By Theorem 8.2.18, G’ has an Euler cycle. If we delete e 
from this Euler cycle, we obtain a path with no repeated edges from v to w containing 
all the edges and vertices of G. We have shown that if a graph has exactly two vertices v 
and w of odd degree, there is a path with no repeated edges containing all the edges and 
vertices from v to w. The converse can be proved similarly. 


A graph has a path with no repeated edges from v to w(v # w) containing all the 
edges and vertices if and only if it is connected and v and w are the only vertices 
having odd degree. 


Proof Suppose that a graph has a path P with no repeated edges from v to w contain- 
ing all the edges and vertices. The graph is surely connected. If we add an edge from v 
to w, the resulting graph has an Euler cycle, namely, the path P together with the added 
edge. By Theorem 8.2.17, every vertex has even degree. Removing the added edge af- 
fects only the degrees of v and w, which are each reduced by 1. Thus in the original 
graph, v and w have odd degree and all other vertices have even degree. 

The converse was discussed just before the statement of the theorem. < 


Generalizations of Theorem 8.2.23 are given as Exercises 43 and 45. 
We conclude by proving a rather special result that we will use in Section 9.2. 


If a graph G contains a cycle from v to v, G contains a simple cycle from v to v. 


Proof Let 


C = (V0, C1, V5 s+ 5 Cis Vig CEs «0 + 5 Sfp Vip C41, Vad y © + 0s Ons Vn) 
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be a cycle from v to v where v = vo = v, (see Figure 8.2.12). If C is not a simple cycle, 
then v; = vj, for some i < j < n. We can replace C by the cycle 


/ 
C = (v0, 41, Vi, -- 


+s Ciy Vin CFs Vit dy «+> Ons Vn) 


If C’ is not a simple cycle from v to v, we repeat the previous procedure. Eventually we 
obtain a simple cycle from v to v. 


en 


Figure 8.2.12 A cycle that either is a simple cycle 
or can be reduced to a simple cycle. < 


8.2 Review Exercises 


—_ 


. What is a path? 

. What is a simple path? 

. Give an example of a path that is not a simple path. 

. What is a cycle? 

. What is a simple cycle? 

. Give an example of a cycle that is not a simple cycle. 
. Define connected graph. 


. Give an example of a connected graph. 


Se FN DH nD FF WB WY 


. Give an example of a graph that is not connected. 


= 
S 


. What is a subgraph? 


= 
—_ 


. Give an example of a subgraph. 


— 
nN 


. What is a component of a graph? 


= 
iv) 


. Give an example of a component of a graph. 


— 
_ 


. Ifa graph is connected, how many components does it have? 


18. 


19. 


20. 


21. 


22. 


23. 


. Define degree of vertex v. 
. What is an Euler cycle? 


. State a necessary and sufficient condition that a graph have an 
Euler cycle. 


Give an example of a graph that has an Euler cycle. Specify 
the Euler cycle. 


Give an example of a graph that does not have an Euler cycle. 
Prove that it does not have an Euler cycle. 


What is the relationship between the sum of the degrees of the 
vertices in a graph and the number of edges in the graph? 


In any graph, must the number of vertices of odd degree be 
even? 


State a necessary and sufficient condition that a graph have a 
path with no repeated edges from v to w (v # w) containing 
all the edges and vertices. 


If a graph G contains a cycle from v to v, must G contain a 
simple cycle from v to v? 


8.2 Exercises 


In Exercises 1-9, tell whether the given path in the graph is 
(a) A simple path 
(b) A cycle 
(c) A simple cycle 


1. (b,b) 2. (a, d, c,d, e) 

3. (e,d,c,b) 4. (d,c, b,e, d) 

5. (b,c, d, e, b, b) 6. (b,c, d, a, b, e, d, c, b) 
7. (a,d,c, b, e) 8. (d,c, b) 

9. (d) 


In Exercises 10-18, draw a graph having the given properties or 
explain why no such graph exists. 


10. 
11. 
12. 
13. 
14. 
15. 
16. 
17. 
18. 
19. 


20. 
21. 


Six vertices each of degree 3 

Four vertices each of degree 1 

Five vertices each of degree 3 

Six vertices; four edges 

Four vertices having degrees 1, 2, 3, 4 

Four edges; four vertices having degrees 1, 2, 3, 4 
Simple graph; six vertices having degrees 1, 2, 3, 4,5, 5 
Simple graph; five vertices having degrees 2, 2, 4, 4, 4 
Simple graph; five vertices having degrees 2, 3, 3, 4, 4 
Find all the simple cycles in the following graph. 


c d 
QO » > e 
a 

g f 


Find all simple paths from a to e in the graph of Exercise 19. 


Find all connected subgraphs of the following graph contain- 
ing all of the vertices of the original graph and having as few 
edges as possible. Which are simple paths? Which are cycles? 
Which are simple cycles? 


c d 
Zee] | 
a 

8 f 


Find the degree of each vertex for the following graphs. 


22. 


23. 


v1 i AI 
4 
V3 v6 v9 
v1 V7 
v4 
v2 U5 U8 v10 


In Exercises 24-27, find all subgraphs having at least one vertex 
of the graph given. 
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24. ey 25, Uy 
VI v2 a é 
v2 v3 
26. ey «27. VI 
<< 
e2 
v2 v3 
In Exercises 28-33, decide whether the graph has an Euler cycle. 


If the graph has an Euler cycle, exhibit one. 


28. Exercise 21 29. Exercise 22 

30. Exercise 23 31. Figure 8.2.4 

32. 

33. 

34. The following graph is continued to an arbitrary, finite depth. 


35. 
36. 


37. 


Does the graph contain an Euler cycle? If the answer is yes, 
describe one. 


When does the complete graph K,, contain an Euler cycle? 


When does the complete bipartite graph K,,,, contain an Euler 
cycle? 
For which values of m and n does the graph contain an Euler 
cycle? 

m vertices 


A 


n vertices 4 
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38. For which values of n does the n-cube contain an Euler cycle? 


In Exercises 39 and 40, verify that the number of vertices of odd 
degree in the graph is even. 


39. 


40. 


Cc d 


41. A sports conference has 11 teams. It was proposed that each 
team play precisely one game against each of exactly nine 
other conference teams. Prove that this proposal is impossible 
to implement. 


42. For the graph of Exercise 39, find a path with no repeated 
edges from d to e containing all the edges. 


43. Let G be a connected graph with four vertices v1, v2, v3, and 
v4 of odd degree. Show that there are paths with no repeated 
edges from 1; to v2 and from v3 to v4 such that every edge in 
Gis in exactly one of the paths. 


44. Illustrate Exercise 43 using the following graph. 
a b c e 


‘a g h 
i 
45. State and prove a generalization of Exercise 43 where the 
number of vertices of odd degree is arbitrary. 


In Exercises 46 and 47, tell whether each assertion is true or false. 
If false, give a counterexample and if true, prove it. 


46. Let G be a graph and let v and w be distinct vertices. If there 
is a path from v to w, there is a simple path from v to w. 


47. Ifa graph contains a cycle that includes all the edges, the cycle 
is an Euler cycle. 


48. Let G be a connected graph. Suppose that an edge e is ina 
cycle. Show that G with e removed is still connected. 


49. Give an example of a connected graph such that the removal 
of any edge results in a graph that is not connected. (Assume 
that removing an edge does not remove any vertices.) 


x50. Cana knight move around a chessboard and return to its origi- 
nal position making every move exactly once? (A move is con- 


sidered to be made when the move is made in either direction.) 


51. Show that if G’ is a connected subgraph of a graph G, then G’ 
is contained in a component. 


52. Show that if a graph G is partitioned into connected subgraphs 
so that each edge and each vertex in G belong to one of the 
subgraphs, the subgraphs are components. 


53. Let G be a directed graph and let G’ be the undirected graph 
obtained from G by ignoring the direction of edges in G. 
Assume that G is connected. If v is a vertex in G, we say 
the parity of v is even if the number of edges of the form 
(v, w) is even; odd parity is defined similarly. Prove that if 
v and w are vertices in G having odd parity, it is possible to 
change the orientation of certain edges in G so that v and w 
have even parity and the parity of all other vertices in G is 
unchanged. 


«54. Show that the maximum number of edges in a simple, discon- 
nected graph with n vertices is (n — 1)(n — 2)/2. 
*55. Show that the maximum number of edges in a simple, bipartite 


graph with n vertices is [n*/4]. 


A vertex v in a connected graph G is an articulation point if the 
removal of v and all edges incident on v disconnects G. 


56. Give an example of a graph with six vertices that has exactly 
two articulation points. 


57. Give an example of a graph with six vertices that has no artic- 
ulation points. 


58. Show that a vertex v in a connected graph G is an articulation 
point if and only if there are vertices w and x in G having the 
property that every path from w to x passes through v. 


Let G be a directed graph and let v be a vertex in G. The indegree 
of v, in(v), is the number of edges of the form (w, v). The outdegree 
of v, out(v), is the number of edges of the form (v, w). A directed 
Euler cycle in G is a sequence of edges of the form 


(vo, V1), (V1, V2), ++ Wn—1s Yn), 


where vo = Vn, every edge in G occurs exactly one time, and all 
vertices appear. 


59. Show that a directed graph G contains a directed Euler cycle 
if and only if the undirected graph obtained by ignoring the 
directions of the edges of G is connected and in(v) = out(v) 
for every vertex v in G. 


Ade Bruijn sequence for n (in 0’s and I’s) is a sequence a,,.. 
of 2" bits having the property that if s is a bit string of length n, for 
some m, 1 < m < 2", 


+5 AQn 


(8.2.2) 


S = Am4m+1°** Gm+n—-1- 
In (8.2.2), we define ay; = aj fori=1,...,2" —1. 


60. Verify that 00011101 is a de Bruijn sequence for n = 3. 


61. Let G be a directed graph with vertices corresponding to all 
bit strings of length n — 1. A directed edge exists from vertex 
X1+++Xp—1 to x2 -++X,. Show that a directed Euler cycle in G 
corresponds to a de Bruijn sequence. 


x62. Show that there is a de Bruijn sequence for everyn = 1, 2,.... 


*63. A closed path is a path from v to v. Show that a connected 
graph G is bipartite if and only if every closed path in G has 
even length. 

64. How many paths of length k > 1 are there in K,,? 


65. Show that there are 
n(n — 1)[(n— 1) — 1] 
n—2 
paths whose lengths are between | and k, inclusive, in Ky, 
n> 2. 


66. Let v and w be distinct vertices in K,,. Let p,, denote the num- 
ber of paths of length m from v to w in K,, 1 <m <n. 


(a) Derive a recurrence relation for py. 
(b) Find an explicit formula for p,». 


67. Let v and w be distinct vertices in K,, n > 2. Show that the 
number of simple paths from v to w is 
n—2 


1 
(n—2)!)> ai 
k=0 ~° 


x68. [Requires calculus] Show that there are |n!e— 1] simple paths 
in K,. (e = 2.71828... is the base of the natural logarithm.) 


69. Let G be a graph. Define a relation R on the set V of vertices 
of G as vRw if there is a path from v to w. Prove that R is an 
equivalence relation on V. 


70. Prove that a connected graph with one or two vertices, each 
of which has even degree, has an Euler cycle. 


Let G be a connected graph. The distance between vertices v and 
w in G, dist(v, w), is the length of a shortest path from v to w. The 
diameter of G is 


d(G) = max{dist(v, w) | v and w are vertices in G}. 


71. Find the diameter of the graph of Figure 8.2.10. 

72. Find the diameter of the n-cube. In the context of parallel 
computation, what is the meaning of this value? 

73. Find the diameter of K,,, the complete graph on n vertices. 

74. Show that the number of paths in the following graph from 


vy to vy of length n is equal to the (n + 1)st Fibonacci 
number fy,+1- 


Problem-Solving Corner 


395 


Problem-Solving Corner: Graphs 


QQ. 


v1 v2 


75. Let G be a simple graph with n vertices in which every vertex 
has degree k and 
n—3 
2 


n-1l 


k> ifn mod 4 = 1 


k> 
—~ 2 
Show that G is connected. 


ifn mod 4 4 1. 


A cycle in a simple directed graph [i.e., a directed graph in which 
there is at most one edge of the form (v, w) and no edges of the 
form (v, v)] is a sequence of three or more vertices 


(V0, Vis ++ +5 Vn) 


in which (vj, vi) is an edge fori = 1,..., n and vo = Vy. A 
directed acyclic graph (dag) is a simple directed graph with no 
cycles. 


76. Show that a dag has at least one vertex with no out edges [i.e., 
there is at least one vertex v such that there are no edges of the 
form (v, w)]. 

77. Show that the maximum number of edges in an n-vertex dag 
is n(n — 1)/2. 

78. An independent set in a graph G is a subset S of the vertices of 
G having the property that no two vertices in S are adjacent. 
(Note that @ is an independent set for any graph.) Prove the 
following result due to [Prodinger]. 

Let P,, be the graph that is a simple path with n vertices. 
Prove that the number of independent sets in P,, is equal to 
Snt2,n = 1,2,..., where {f,} is the Fibonacci sequence. 

79. Let G be a graph. Suppose that for every pair of distinct ver- 
tices vy and v2 in G, there is a unique vertex w in G such that 
v, and w are adjacent and v2 and w are adjacent. 

(a) Prove that if v and w are nonadjacent vertices in G, then 
d(v) = 6(w). 

(b) Prove that if there is a vertex of degree k > 1 and no 
vertex is adjacent to all other vertices, then the degree of 
every vertex is k. 


Problem 


Is it possible in a department of 25 persons, racked by 
dissension, for each person to get along with exactly 
five others? 


Attacking the Problem 


Where do we start? Since this problem is in Chap- 
ter 8, which deals with graphs, it would probably be 


a good idea to try to model the problem as a graph. 
If this problem were not associated with a particu- 
lar section or chapter in the book, we might try several 
approaches—one of which might be to model the prob- 
lem as a graph. Many discrete problems can be solved 
by modeling them using graphs. This is not to say that 
this is the only approach possible. Most of the time 
by taking different approaches, we can solve a single 
problem in many ways. (A nice example is [Wagon].) 
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Finding a Solution and each vertex has degree 5. Before reading on, try to 
determine whether this is possible. 

Corollary 8.2.22 says that the number of vertices 
of odd degree is even. We have a contradiction because 
the number of vertices of odd degree is odd. Therefore, 
it is not possible in a department of 25 persons racked 
by dissension for each person to get along with exactly 
five others. 


A fundamental issue in building a graph model is to 
figure out what the graph is—what are the vertices, and 
what are the edges? In this problem, there’s not much 
choice; we have persons and dissension. Let’s try let- 
ting the vertices be the people. It’s very common in a 
graph model for the edges to indicate a relationship be- 
tween the vertices. Here the relationship is “gets along 
with,” so we’ll put an edge between two vertices (peo- Formal Solution 
ple) if they get along. 

Now suppose that each person gets along with 
exactly five others. For example, in the figure that fol- 
lows, which shows part of our graph, Jeremy gets along 
with Samantha, Alexandra, Lance, Bret, and Tiffany, 
and no others. 


No. It is not possible in a department of 25 persons 
racked by dissension for each person to get along with 
exactly five others. Suppose by way of contradiction 
that it is possible. Consider a graph where the vertices 
are the persons and an edge connects two vertices (peo- 
ple) if the people get along. Since every vertex has odd 
Samantha degree, the number of vertices of odd degree is odd, 
which is a contradiction. 


Summary of Problem-Solving Techniques 


Jeremy Lance = Many discrete problems can be solved by mod- 
eling them using graphs. 


Alexandra 


Bret 
= To build a graph model, determine what the ver- 
Tiffany tices represent and what the edges represent. 


= It’s very common in a graph model for the edges 
It follows that the degree of every vertex is 5. Now to indicate a relationship between the vertices. 
let’s take stock of the situation: We have 25 vertices 


8.3 Hamiltonian Cycles and the Traveling 
Salesperson Problem 


Sir William Rowan Hamilton marketed a puzzle in the mid-1800s in the form of a 
dodecahedron (see Figure 8.3.1). Each corner bore the name of a city and the problem 


Go Online was to Start at any city, travel along the edges, visit each city exactly one time, and return 
For more on to the initial city. The graph of the edges of the dodecahedron is given in Figure 8.3.2. 
Hamilton, Hamiltonian We can solve Hamilton’s puzzle if we can find a cycle in the graph of Figure 8.3.2 
cycles, and the traveling that contains each vertex exactly once (except for the starting and ending vertex that 
salesperson problem, see appears twice). See if you can find a solution before looking at a solution given in 
goo.gl/jfjmol Figure 8.3.3. 


In honor of Hamilton, we call a cycle in a graph G that contains each vertex in G 
exactly once, except for the starting and ending vertex that appears twice, a Hamiltonian 


cycle. 
; Hamilton (1805-1865) was one of Ireland’s greatest scholars. He was professor 
Go Online of astronomy at the University of Dublin, where he published articles in physics and 
For a C program that mathematics. In mathematics, Hamilton is most famous for inventing the quaternions, a 


determines whether a graph _ generalization of the complex number system. The quaternions provided inspiration for 
has a Hamiltonian cycle, see the development of modern abstract algebra. In this connection, Hamilton introduced 
goo.g1/BApR4h the term vector. 
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a b a b 


d d 
Figure 8.3.1 Hamilton’s Figure 8.3.2 The graph Figure 8.3.3 Visiting each vertex 
puzzle. of Hamilton’s puzzle. once in the graph of Figure 8.3.2. 


Example 8.3.1 


Figure 8.3.4 A graph with 
a Hamiltonian cycle. 


Example 8.3.2 


U1 


U3 


Figure 8.3.5 A graph with 
no Hamiltonian cycle. 


ts} 
al 
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Figure 8.3.6 A graph 
with a Hamiltonian cycle. 


v2 


The cycle (a, b, c, d, e, f, g, a) is a Hamiltonian cycle for the graph of Figure 8.3.4. <4 


The problem of finding a Hamiltonian cycle in a graph sounds similar to the prob- 
lem of finding an Euler cycle in a graph. An Euler cycle visits each edge once, whereas 
a Hamiltonian cycle visits each vertex once; however, the problems are actually quite 
distinct. For example, the graph G of Figure 8.3.4 does not have an Euler cycle since 
there are vertices of odd degree, yet Example 8.3.1 showed that G has a Hamiltonian cy- 
cle. Furthermore, unlike the situation for Euler cycles (see Theorems 8.2.17 and 8.2.18), 
no easily verified necessary and sufficient conditions are known for the existence of a 
Hamiltonian cycle in a graph. 

The following examples show that sometimes we can argue that a graph does not 
contain a Hamiltonian cycle. 


Show that the graph of Figure 8.3.5 does not contain a Hamiltonian cycle. 


SOLUTION Since there are five vertices, a Hamiltonian cycle must have five edges. 
Suppose that we could eliminate edges from the graph, leaving just a Hamiltonian cycle. 
We would have to eliminate one edge incident at vz and one edge incident at v4, since 
each vertex in a Hamiltonian cycle has degree 2. But this leaves only four edges—not 
enough for a Hamiltonian cycle of length 5. Therefore, the graph of Figure 8.3.5 does 
not contain a Hamiltonian cycle. < 


We must be careful not to count an eliminated edge more than once when using an 
argument like that in Example 8.3.2 to show that a graph does not have a Hamiltonian 
cycle. Notice in Example 8.3.2 (which refers to Figure 8.3.5) that if we eliminate one 
edge incident at v2 and one edge incident at v4, these edges are distinct. Therefore, we 
are correct in reasoning that we must eliminate two edges from the graph of Figure 8.3.5 
to produce a Hamiltonian cycle. 

As an example of double counting, consider the following faulty argument that 
purports to show that the graph of Figure 8.3.6 has no Hamiltonian cycle. Since there 
are five vertices, a Hamiltonian cycle must have five edges. Suppose that we could elim- 
inate edges from the graph to produce a Hamiltonian cycle. We would have to eliminate 
two edges incident at c and one edge incident at each of a, b, d, and e. This leaves two 
edges—not enough for a Hamiltonian cycle. Therefore, the graph of Figure 8.3.6 does 
not contain a Hamiltonian cycle. The error in this argument is that if we eliminate two 
edges incident at c (as we must do), we also eliminate edges incident at two of a, b, 
d, or e. We must not count the two eliminated edges incident at the two vertices again. 
Notice that the graph of Figure 8.3.6 does have a Hamiltonian cycle. 
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Example 8.3.3 


d 3 c 


Figure 8.3.8 A graph for 
the traveling salesperson 
problem. 


Example 8.3.4 


Example 8.3.5 


Go Online 

For more on Gray 
codes, see 
goo.gl/jfjmol 


Figure 8.3.9 The ring 
model for parallel 
computation. 


Show that the graph G of Figure 8.3.7 does not contain a Hamiltonian cycle. 


a TX c 


LAS. 


Figure 8.3.7 A graph 
with no Hamiltonian 
cycle. 


SOLUTION Suppose that G has a Hamiltonian cycle H. The edges (a, b), (a, g), (b,c), 
and (c, k) must be in H since each vertex in a Hamiltonian cycle has degree 2. Thus edges 
(b, d) and (b, f) are not in H. Therefore, edges (g, d), (d, e), (e, f), and (f, k) are in H. 
The edges now known to be in H form a cycle C. Adding an additional edge to C will 
give some vertex in H degree greater than 2. This contradiction shows that G does not 
have a Hamiltonian cycle. < 


The traveling salesperson problem is related to the problem of finding a 
Hamiltonian cycle in a graph. (We referred briefly to a variant of the traveling salesperson 
problem in Section 8.1.) The problem is: Given a weighted graph G, find a minimum- 
length Hamiltonian cycle in G. If we think of the vertices in a weighted graph as cities 
and the edge weights as distances, the traveling salesperson problem is to find a shortest 
route in which the salesperson can visit each city one time, starting and ending at the 
same city. 


The cycle C = (a,b,c, d, a) is a Hamiltonian cycle for the graph G of Figure 8.3.8. 
Replacing any of the edges in C by either of the edges labeled 11 would increase the 
length of C; thus C is a minimum-length Hamiltonian cycle for G. Thus C solves the 
traveling salesperson problem for G. < 


We next look at Hamiltonian cycles in the n-cube. 


Gray Codes and Hamiltonian Cycles in the n-Cube Consider a ring model for par- 
allel computation that, when represented as a graph, is a simple cycle (see Figure 8.3.9). 
The vertices represent processors. An edge between processors p and gq indicates that 
p and g can communicate directly with one another. We see that each processor can 
communicate directly with exactly two other processors. Nonadjacent processors com- 
municate by sending messages. 

The n-cube (see Example 8.1.7) is another model for parallel computation. The n- 
cube has a greater degree of connectivity among its processors. We consider the question 
of when an n-cube can simulate a ring model with 2” processors. In graph terminology, 
we are asking when the n-cube contains a simple cycle with 2” vertices as a subgraph or, 
since the n-cube has 2” processors, when the n-cube contains a Hamiltonian cycle. [We 
leave to the exercises the question of when an n-cube can simulate a ring model with an 
arbitrary number of processors (see Exercise 18).] 

We first observe that if the n-cube contains a Hamiltonian cycle, we must have 
n > 2 since the 1-cube has no cycles at all. 


Theorem 8.3.6 


Corollary 8.3.7 
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Recall (see Example 8.1.7) that we may label the vertices of the n-cube 0, 1,..., 
2” — 1 in such a way that an edge connects two vertices if and only if the binary rep- 
resentation of their labels differs in exactly one bit. Thus the n-cube has a Hamiltonian 
cycle if and only if n > 2 and there is a sequence, 


S1,82,..., Son (8.3.1) 


where each 5; is a string of n bits, satisfying: 


m Every n-bit string appears somewhere in the sequence. 
@ s; and 5; differ in exactly one bit,i=1,...,2" — 1. 
H so and s; differ in exactly one bit. 


A sequence (8.3.1) is called a Gray code. When n > 2, a Gray code (8.3.1) corre- 
sponds to the Hamiltonian cycle s;, 52,..., So», 5; since every vertex appears and the 
edges (s;, Si41),i= 1,...,2” — 1, and (sm, 51) are distinct. When n = 1, the Gray code 
0, 1 corresponds to the path (0, 1, 0), which is not a cycle because the edge (0, 1) is 
repeated. 

Gray codes have been extensively studied in other contexts. For example, Gray 
codes have been used in converting analog information to digital form (see [Deo]). We 
show how to construct a Gray code for each positive integer n, thus proving that the 
n-cube has a Hamiltonian cycle for every positive integer n > 2. < 


Let G, denote the sequence 0, 1. We define G, in terms of G,,_; by the following 
rules: 


(a) Let G%_, denote the sequence G,_, written in reverse. 


(b) Let G’,_, denote the sequence obtained by prefixing each member of G,_1 
with 0. 


(c) Let G”_, denote the sequence obtained by prefixing each member of Ge 
with 1. 


(d) Let G,, be the sequence consisting of G’,_, followed by G7_,. 


Then G,, is a Gray code for every positive integer n. 


Proof We prove the theorem by induction on n. 


Basis Step (n = 1) 


Since the sequence 0, 1 is a Gray code, the theorem is true when n is 1. 


Inductive Step 


Assume that G,_; is a Gray code. Each string in G/,_, begins with 0, so any difference 
between consecutive strings must result from differing bits in the corresponding strings 
in G,_;. But since G,_; is a Gray code, each consecutive pair of strings in G,_, differs 
in exactly one bit. Therefore, each consecutive pair of strings in G/,_, differs in exactly 
one bit. Similarly, each consecutive pair of strings in G’’ 


| differs in exactly one bit. 
Let a denote the last string in G/,_,, and let 6 denote the first string in G”_,. If we 


n—1? n—-l|* 
delete the first bit from @ and the first bit from £, the resulting strings are identical. Since 
the first bit in w is 0 and the first bit in is 1, the last string in G’,_, and the first string 
in G’’_, differ in exactly one bit. Similarly, the first string in G/_, and the last string in 


n—1 n—1 
G'’_, differ in exactly one bit. Therefore, G, is a Gray code. < 


The n-cube has a Hamiltonian cycle for every positive integer n > 2. 
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Example 8.3.8 We use Theorem 8.3.6 to construct the Gray code G3 beginning with G,. 


Example 8.3.9 
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Figure 8.3.10 The 
knight’s legal moves 
in chess. 
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4 x 4 chessboard and 
the graph GK4. 
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We next examine a problem that goes back some 200 years. 


The Knight's Tour In chess, the knight’s move consists of moving two squares hori- 
zontally or vertically and then moving one square in the perpendicular direction. For 
example, in Figure 8.3.10 a knight on the square marked K can move to any of the squares 
marked X. A knight’s tour of an x n board begins at some square, visits each square 
exactly once making legal moves, and returns to the initial square. The problem is to 
determine for which n a knight’s tour exists. 

We can use a graph to model this problem. We let the squares of the board, al- 
ternately colored black and white in the usual way, be the vertices of the graph, and 
we place an edge between two vertices if the corresponding squares on the board rep- 
resent a legal move for the knight (see Figure 8.3.11). We denote the graph as GK,,. 
Then there is a knight’s tour on the n x n board if and only if GK, has a Hamiltonian 
cycle. 

We show that if GK, has a Hamiltonian cycle, n is even. To see this, note that GK, 
is bipartite. We can partition the vertices into sets V|, those corresponding to the white 
squares, and V>, those corresponding to the black squares; each edge is incident on a 
vertex in V; and V2. Since any cycle must alternate between a vertex in V; and one in 
V2, any cycle in GK, must have even length. But since a Hamiltonian cycle must visit 
each vertex exactly once, a Hamiltonian cycle in GK, must have length n?. Thus n must 
be even. 

In view of the preceding result, the smallest possible board that might have a 
knight’s tour is the 2 x 2 board, but it does not have a knight’s tour because the board is so 
small the knight has no legal moves. The next smallest board that might have a knight’s 
tour is the 4 x 4 board, although, as we shall show, it too does not have a knight’s 
tour. 

We argue by contradiction to show that GK4 does not have a Hamiltonian cycle. 
Suppose that GK, has a Hamiltonian cycle C = (1, v2,..., 17). We assume that v; 
corresponds to the upper-left square. We call the eight squares across the top and bottom 
outside squares, and we call the other eight squares inside squares. Notice that the knight 
must arrive at an outside square from an inside square and that the knight must move from 
an outside square to an inside square. Thus in the cycle C, each vertex corresponding to 
an outside square must be preceded and followed by a vertex corresponding to an inside 
square. Since there are equal numbers of outside and inside squares, vertices v; where i 
is odd correspond to outside squares, and vertices v; where i is even correspond to inside 
squares. But looking at the moves the knight makes, we see that vertices v; where 7 is odd 
correspond to white squares, and vertices v; where 7 is even correspond to black squares. 
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Therefore, the only outside squares visited are white and the only inside squares visited 
are black. Thus C is not a Hamiltonian cycle. This contradiction completes the proof that 
GK, has no Hamiltonian cycle. This argument was given by Louis Pésa when he was a 
teenager. 

The graph Gk, has a Hamiltonian cycle. This fact can be proved by simply 
exhibiting one (see Exercise 21). It can be shown using elementary methods that GK,, has 
a Hamiltonian cycle for all even n > 6 (see [Schwenk]). The proof explicitly constructs 
Hamiltonian cycles for certain smaller boards and then pastes smaller boards together 
to obtain Hamiltonian cycles for the larger boards. < 


Although there are algorithms (see e.g., [Even, 1979]) for finding an Euler cycle 
in an n-edge graph if there is one in time ©(), there are no known polynomial-time 
algorithms for finding a Hamiltonian cycle in a graph. Indeed, it is known that the 
Hamiltonian cycle problem is NP-complete (see Section 4.3), which means that if some- 
one discovers a polynomial-time algorithm to solve the Hamiltonian-cycle problem, 
all NP-complete problems will then have polynomial-time algorithms. The traveling 
salesperson problem is also NP-complete. 

Because of the lack of a polynomial-time algorithm, other approaches to the trav- 
eling salesperson problem have been proposed, for example, finding an approximate 
solution (i.e., a near-minimum-length Hamiltonian cycle). Other approaches to finding 
a Hamiltonian cycle have also been suggested. We discuss a randomized algorithm (see 
Section 4.2) that searches for a Hamiltonian cycle. 

Our randomized algorithm is a version of an algorithm due to Pésa (see [Poésa]). 
The idea is to build a Hamiltonian cycle vertex by vertex. We assume that our graph G 
is a simple graph with n vertices. We begin with an arbitrary vertex, which we call v;. 
Our initial path consists of the vertex v, and no edges. At each successive step, we try to 
extend the current path by adding a new edge and vertex to the end of the current path. 
Thus, at the first step, we select a random, adjacent vertex of v,, which we call v2. Our 
path is now (11, v2). At the ith step, we have the path (v1, v2, ..., vi). If the path contains 
all of the vertices of G, that is, if i = n, we check whether v,, is adjacent to v,. If so, 
we have found a Hamiltonian cycle, namely, (v1, v2,..., Vn, V1). In this case, we return 
true. If i 4 n, we select a random, adjacent vertex of v;, which is not on the path, and 
continue with the new path (v,, v2, ..., vj, vi+1). If all of the vertices adjacent to v; are 
already on the path, we randomly select one of them different from v;_1, which we call 
v;, and change the path by reversing the order of the vertices following v;; that is, the 
path (v1, v2,.-., Vj, Viti, ---», Vi) becomes (V1, V2,..., Vj, Vi, Vi-1, -- +» Vj41) (See Figure 
8.3.12). If the only vertex on the path adjacent to v; is v;_;, the preceding adjustment of 
path would not change the path. In this case, we have found a vertex of degree 1. Since 
there is no Hamiltonian cycle, we return false. 


Vind Vind 


(a) (b) 


Figure 8.3.12 Given a path 

(V1,-.-5 Vj, Vj41, +--+, Vi), aS Shown in (a), where 
v; is adjacent to v;, we change the path to 
(V1,..-, Vj, Vis +--+, Vje1), aS Shown in (b). 
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Algorithm 8.3.10 


Randomized Hamiltonian Cycle 


This algorithm takes as an input a simple graph G = (V, E) and searches for a Hamil- 
tonian cycle. If the algorithm returns true, it has found a Hamiltonian cycle. If the 
algorithm returns false, there is no Hamiltonian cycle (in fact, there is a vertex of 
degree 1). If the algorithm does not terminate, there may or may not be a Hamilto- 
nian cycle. If v is a vertex, N(v) is the set of vertices adjacent to v. 


Input: A simple graph G = (V, E) with n vertices 


Output: If the algorithm terminates, it returns true if it finds a Hamiltonian 
cycle and false otherwise. 


randomized_hamiltonian_ cycle (G, n) { 
if (n == 1 V n == 2) // trivial cases 
return false 
vy, = random vertex in G 
tll 
while (i> =nv vy, ¢ N(v;)) { 
N = N(vj) = {v1, e Soren vi-1} // if iis iF {v1, nos vi-1} is @ 
// N contains the vertices adjacent to v; (the current last vertex 
// of the path) that are not already on the path 
if (NV #2){ 
i=i+l 
vy; = random vertex in NV 
} 
else if (vj € N(v;) for some j, | <j < i— 1) 
(Wilp coon WE) = (On ooee Bip Bip onon Wen) 
else 
return false 


} 


return true 


If randomized_hamiltonian_cycle returns true, it has indeed found a Hamilto- 
nian cycle, which is stored in variables v,, ..., V,, v,. If the graph input to randomized_ 
hamiltonian_cycle has Hamiltonian cycles, it may fail to find one, and, in this case, it 
will not terminate (see Exercise 25). If there are vertices of degree 1, the algorithm may 
not find one and, in this case, it will not terminate (see Exercise 28). If there is no Hamil- 
tonian cycle and all vertices have degree at least 2, the algorithm will not terminate (see 
Exercise 29). Failure to terminate is common for randomized algorithms. One way to 
guarantee termination is to simply stop the algorithm after some specified number of 
iterations. Experiments with the algorithm show that it is very good at finding Hamil- 
tonian cycles in large graphs, and there are theoretical results proving that it is almost 
always successful for graphs of sufficiently large minimum degree. 


8.3 Problem-Solving Tips 


An Euler cycle starts at a vertex, traverses each edge exactly one time, and returns to the 
initial vertex. Theorems 8.2.17 and 8.2.18 allow us to easily determine whether a graph 
has an Euler cycle: A graph G has an Euler cycle if and only if G is connected and every 
vertex has even degree. 

A Hamiltonian cycle starts at a vertex, visits each vertex exactly one time (except 
for the initial vertex, which is visited twice: at the beginning and end of the cycle), 
and returns to the initial vertex. Unlike Theorems 8.2.17 and 8.2.18, no easily verified 
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necessary and sufficient condition is known for a graph to have a Hamiltonian cycle. If a 
relatively small graph has a Hamiltonian cycle, trial and error will discover one. Ifa graph 
does not have a Hamiltonian cycle, you can sometimes use the fact that a Hamiltonian 
cycle in a graph containing n vertices has length n together with proof by contradiction to 
prove that it does not have a Hamiltonian cycle. Two proof-by-contradiction techniques 
were shown in Section 8.3. In the first, we assume that the graph has a Hamiltonian 
cycle. Certain edges cannot appear in the Hamiltonian cycle: If a graph has a vertex v 
of degree greater than 2, only two edges incident on v can appear in the Hamiltonian 
cycle. We can sometimes obtain a contradiction by showing that so many edges must be 
eliminated that the graph cannot have a Hamiltonian cycle (see Example 8.3.2). 

In the second proof-by-contradiction technique shown in Section 8.3, we again 
assume that the graph with n vertices has a Hamiltonian cycle. We then argue that certain 
edges must be in the Hamiltonian cycle. For example, if a vertex v has degree 2, both 
edges incident on v must be in the cycle. We can sometimes obtain a contradiction by 
showing that edges that must be in the Hamiltonian cycle form a cycle of length less 
than n (see Example 8.3.3). 


8.3 Review Exercises 


1. What is a Hamiltonian cycle? 5. Give an example of a graph that has neither a Hamiltonian 
2. Give an example of a graph that has a Hamiltonian cycle and cycle nor an Euler cycle. Prove that the graph has the speci- 
an Euler cycle. Prove that the graph has the specified proper- fied properties. 
ties. 6. What is the traveling salesperson problem? How is it related 
3. Give an example of a graph that has a Hamiltonian cycle but to the Hamiltonian cycle problem? 
not an Euler cycle. Prove that the graph has the specified prop- 7. What is the ring model for parallel computation? 
erties. 8. What is a Gray code? 
4. Give an example of a graph that does not have a Hamiltonian 9. Explain how to construct a Gray code. 


cycle but does have an Euler cycle. Prove that the graph has 


the specified properties 10. Explain how the randomized Hamiltonian algorithm (Algo- 


rithm 8.3.10) works. 


8.3 Exercises 


Find a Hamiltonian cycle in each graph. Show that none of the graphs contains a Hamiltonian cycle. 


1. 3s 
a 
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Determine whether or not each graph contains a Hamiltonian 
cycle. If there is a Hamiltonian cycle, exhibit it; otherwise, give 
an argument that shows there is no Hamiltonian cycle. 


6. 
17. 
7. 
rs N — 
Pq r 
Ss t 19 
8 20 


9. Give an example of a graph that has an Euler cycle but contains 
no Hamiltonian cycle. 


10. Give an example of a graph that has an Euler cycle that is also 
a Hamiltonian cycle. 

11. Give an example of a graph that has an Euler cycle and a 
Hamiltonian cycle that are not identical. 


*12. For which values of m and n does the graph of Exercise 37, 
Section 8.2, contain a Hamiltonian cycle? 


21. 
22. 


23. 


24. 


25. 


. Modify the graph of Exercise 37, Section 8.2, by inserting an 


edge between the vertex in row i, column 1, and the vertex 
in row i, column m, for i = 1,..., n. Show that the resulting 
graph always has a Hamiltonian cycle. 


. Show that if n > 3, the complete graph on n vertices K,, con- 


tains a Hamiltonian cycle. 


. When does the complete bipartite graph Kj, contain a 


Hamiltonian cycle? 


. Show that the cycle (e, b, a, c, d, e) provides a solution to the 


traveling salesperson problem for the graph shown. 


4 ail 
Vas 


Solve the traveling salesperson problem for the graph shown. 
a 
5 
XIN 
b 


Let m and n be integers satisfying | < m < 2”. Prove that the 
n-cube has a simple cycle of length m if and only if m > 4 and 
m is even. 


Use Theorem 8.3.6 to compute the Gray code G4. 


. Let G be a bipartite graph with disjoint vertex sets V; and V2, 


as in Definition 8.1.11. Show that if G has a Hamiltonian cycle, 
V, and V> have the same number of elements. 


Find a Hamiltonian cycle in GK¢ (see Example 8.3.9). 


Describe a graph model appropriate for solving the following 
problem: Can the permutations of {1, 2, ..., } be arranged in 
a sequence so that adjacent permutations 


Pi Pls-++5Pn and gq? y+ ++ An 
satisfy pj # qj fori=1,...,n? 
Solve the problem of Exercise 22 for n = 1, 2,3, 4. (The an- 
swer to the question is “yes” for n > 5; see [Problem 1186] in 
the References.) 
Show that the consecutive labels of the vertices on the unit 
circle in Bain’s depiction of the n-cube (see Exercises 43-45, 
Section 8.1) give a Gray code. 
Show that in the following graph, it is possible for Algorithm 
8.3.10 to fail to find a Hamiltonian cycle even though there is 
one and, in this case, it does not terminate. 


26. 


27. 


28. 


29. 


30. 


31. 


32. 


a b c 
f e d 


Show that in the graph of Exercise 25, if the first two choices 
of vertices for Algorithm 8.3.10 are a and then f, Algorithm 
8.3.10 will always find a Hamiltonian cycle. 


Show that in the following graph, Algorithm 8.3.10 will al- 
ways find a Hamiltonian cycle. 


a b 
d c 


Show that in the following graph, Algorithm 8.3.10 may not 
find the vertex of degree 1, and, in this case, it will not termi- 
nate. 


Prove that if a graph with no Hamiltonian cycle and all vertices 
of degree at least 2 is input to Algorithm 8.3.10, the algorithm 
will not terminate. 


Answer true or false, and prove your response. In Algorithm 
8.3.10, when a vertex is added to the path, it is never removed 
from the path. 


Does Algorithm 8.3.10 always terminate when the input is K;,, 
the complete graph on n vertices? Prove your answer. 


For which n does Algorithm 8.3.10 always find a Hamiltonian 
cycle when the input is K,,, the complete graph on n vertices? 
Prove your answer. 


33. 


34. 


35. 


36. 
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Does Algorithm 8.3.10 always terminate when the input is 
K.n, the complete bipartite graph on m and n vertices? Prove 
your answer. 


Does Algorithm 8.3.10 always find a Hamiltonian cycle if 
there is one when the input is Ky,,, the complete bipartite 
graph on m and n vertices? Prove your answer. 


Is there a graph containing a Hamiltonian cycle for which Al- 
gorithm 8.3.10 always fails; that is, for which whatever the se- 
quence of random guesses made by the algorithm, the Hamil- 
tonian cycle is not found? Prove your answer. 


Suggest ways to improve Algorithm 8.3.10. 


A Hamiltonian path in a graph G is a simple path that contains ev- 
ery vertex in G exactly once. (A Hamiltonian path begins and ends 
at different vertices.) 


a. 


38. 


39. 
40. 
41. 
42. 
43. 
44. 
45. 
46. 
47. 


48. 


49. 


If a graph has a Hamiltonian cycle, must it have a Hamiltonian 
path? Explain. 


If a graph has a Hamiltonian path, must it have a Hamiltonian 
cycle? Explain. 

Does the graph of Figure 8.3.5 have a Hamiltonian path? 
Does the graph of Figure 8.3.7 have a Hamiltonian path? 
Does the graph of Exercise 3 have a Hamiltonian path? 

Does the graph of Exercise 4 have a Hamiltonian path? 

Does the graph of Exercise 5 have a Hamiltonian path? 

Does the graph of Exercise 6 have a Hamiltonian path? 

Does the graph of Exercise 7 have a Hamiltonian path? 

Does the graph of Exercise 8 have a Hamiltonian path? 


For which values of m and n does the graph of Exercise 37, 
Section 8.2, have a Hamiltonian path? 

For which n does the complete graph on n vertices have a 
Hamiltonian path? 


Modify Algorithm 8.3.10 for finding a Hamiltonian cycle to 
search for a Hamiltonian path. 


8.4 A Shortest-Path Algorithm 


Recall (see Section 8.1) that a weighted graph is a graph in which values are assigned to 
the edges and that the length of a path in a weighted graph is the sum of the weights of 
the edges in the path. We let w(i, 7) denote the weight of edge (i, 7). In weighted graphs, 
we often want to find a shortest path (i.e., a path having minimum length) between 
two given vertices. Algorithm 8.4.1, due to E. W. Dijkstra, which efficiently solves this 
problem, is the topic of this section. 

Edsger W. Dijkstra (1930-2002) was born in The Netherlands. He was an early 
proponent of programming as a science. So dedicated to programming was he that when 
he was married in 1957, he listed his profession as a programmer. However, the Dutch 
authorities said that there was no such profession, and he had to change the entry to 
“theoretical physicist.” He won the prestigious Turing Award from the Association for 
Computing Machinery in 1972. He was appointed to the Schlumberger Centennial Chair 
in Computer Science at the University of Texas at Austin in 1984 and retired as Professor 


Emeritus in 1999. 
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Algorithm 8.4.1 


Example 8.4.2 


Throughout this section, G denotes a connected, weighted graph. We assume that 
the weights are positive numbers and that we want to find a shortest path from vertex a 
to vertex z. The assumption that G is connected can be dropped (see Exercise 9). 

Dijkstra’s algorithm involves assigning labels to vertices. We let L(v) denote the 
label of vertex vy. At any point, some vertices have temporary labels and the rest have 
permanent labels. We let T denote the set of vertices having temporary labels. In illus- 
trating the algorithm, we will circle vertices having permanent labels. We will show later 
that if L(v) is the permanent label of vertex v, then L(y) is the length of a shortest path 
from a to v. Initially, all vertices have temporary labels. Each iteration of the algorithm 
changes the status of one label from temporary to permanent; thus we may terminate 
the algorithm when z receives a permanent label. At this point L(z) gives the length of a 
shortest path from a to z. 


Dijkstra's Shortest-Path Algorithm 


This algorithm finds the length of a shortest path from vertex a to vertex z in a con- 
nected, weighted graph. The weight of edge (i, j) is w(i, 7) > O and the label of vertex 
x is L(x). At termination, L(z) is the length of a shortest path from a to z. 


Input: A connected, weighted graph in which all weights are positive; ver- 
tices a and z 


Output: L(z), the length of a shortest path from a to z 


dijkstra(w, a, z, L) { 
(Ga) —0) 
for all vertices x 4a 
IWC?) ies 
T = set of all vertices 
// T is the set of vertices whose shortest distance from a has 
// not been found 
while (z € T) { 
choose v € T with minimum L(v) 
T =T — {v} 
for each x € T adjacent to v 
L(x) = min{L(x), L(v) + wv, x)} 


eee ee 
Be ie Set een ON es) a oe) 


We show how Algorithm 8.4.1 finds a shortest path from a to z in the graph of 
Figure 8.4.1. (The vertices in T are uncircled and have temporary labels. The circled 
vertices have permanent labels.) Figure 8.4.2 shows the result of executing lines 2-5. 
At line 8, z is not circled. We proceed to line 9, where we select vertex a, the uncircled 
vertex with the smallest label, and circle it (see Figure 8.4.3). At lines 11 and 12 we 
update each of the uncircled vertices, b and f, adjacent to a. We obtain the new labels 


L(b) = min{oo, 0+ 2} = 2, L(f) = min{oo,0+ 1} =1 


(see Figure 8.4.3). At this point, we return to line 8. 

Since z is not circled, we proceed to line 9, where we select vertex f, the uncircled 
vertex with the smallest label, and circle it (see Figure 8.4.4). At lines 11 and 12 we 
update each label of the uncircled vertices, d and g, adjacent to f. We obtain the labels 
shown in Figure 8.4.4. 


Theorem 8.4.3 
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f 5 ¢g 


Figure 8.4.1 The graph for Figure 8.4.2 Initialization in Dijkstra’s 
Example 8.4.2. shortest-path algorithm. 


Figure 8.4.3 The first iteration of Figure 8.4.4 The second iteration of 
Dijkstra’s shortest-path algorithm. Dijkstra’s shortest-path algorithm. 


You should verify that the next iteration of the algorithm produces the labeling 
shown in Figure 8.4.5 and that at the termination of the algorithm, z is labeled 5, in- 
dicating that the length of a shortest path from a to z is 5. A shortest path is given by 
(a, b, c, 2). 


Figure 8.4.5 The third iteration of 
Dijkstra’s shortest-path algorithm. < 


We next show that Algorithm 8.4.1 is correct. The proof hinges on the fact that 
Dijkstra’s algorithm finds the lengths of shortest paths from a in nondecreasing order. 


Dijkstra’s shortest-path algorithm (Algorithm 8.4.1) correctly finds the length of a 
shortest path from a to z. 


Proof We use mathematical induction on i to prove that the ith time we arrive at line 
9, L(y) is the length of a shortest path from a to v. When this is proved, correctness of the 
algorithm follows since when z is chosen at line 9, L(z) will give the length of a shortest 
path from a to z. 
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Basis Step (i = 1) 

The first time we arrive at line 9, because of the initialization steps (lines 2-4), L(a) is 
zero and all other L-values are oo. Thus a is chosen the first time we arrive at line 9. 
Since L(a) is zero, L(a) is the length of a shortest path from a to a. 


Inductive Step 


Assume that for all k < i, the kth time we arrive at line 9, L(v) is the length of a shortest 
path from a to v. 

Suppose that we are at line 9 for the ith time and we choose v in T with minimum 
value L(y). 

First we show that if there is a path from a to a vertex w whose length is less than 
L(y), then w is not in T (i.e., w was previously selected at line 9). Suppose by way of 
contradiction that w is in T. Let P be a shortest path from a to w, let x be the vertex 
nearest a on P that is in T, and let u be the predecessor of x on P (see Figure 8.4.6). Then 
u is not in T, so u was chosen at line 9 during a previous iteration of the while loop. By 
the inductive assumption, L(u) is the length of a shortest path from a to u. Now 


L(x) < Lu) + wtu, x) < length of P < L(y). 


But this inequality shows that v is not the vertex in T with minimum L(y) [L(x) is 
smaller]. This contradiction completes the proof that if there is a path from a to a vertex 
w whose length is less than L(v), then w is not in T. 


Figure 8.4.6 The proof of Theorem 8.4.3. 

P is a shortest path from a to w, x is the vertex 
nearest a on P that is in T, and wu is the 
predecessor of x on P. 


The preceding result shows, in particular, that if there were a path from a to v 
whose length is less than L(v), v would already have been selected at line 9 and removed 
from T. Therefore, every path from a to v has length at least L(v). By construction, there 
is a path from a to v of length L(y), so this is a shortest path from a to v. The proof is 
complete. < 


Algorithm 8.4.1 finds the length of a shortest path from a to z. In most applications, 
we would also want to identify a shortest path. A slight modification of Algorithm 8.4.1 
allows us to find a shortest path. 


Find a shortest path from a to z and its length for the graph of Figure 8.4.7. 


SOLUTION We will apply Algorithm 8.4.1 with a slight modification. In addition to 
circling a vertex, we will also label it with the name of the vertex from which it was 
labeled. 

Figure 8.4.7 shows the result of executing lines 2-4 of Algorithm 8.4.1. First, we 
circle a (see Figure 8.4.8). Next, we label the vertices b and d adjacent to a. Vertex b 
is labeled “a, 2” to indicate its value and the fact that it was labeled from a. Similarly, 
vertex d is labeled “a, 1.” 
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Next, we circle vertex d and update the label of the vertex e adjacent to d (see 
Figure 8.4.9). Then we circle vertex b and update the labels of vertices c and e (see Fig- 
ure 8.4.10). Next, we circle vertex e and update the label of vertex z (see Figure 8.4.11). 


Figure 8.4.7 Initialization in Dijkstra’s Figure 8.4.8 The first iteration of 
shortest-path algorithm. Dijkstra’s shortest-path algorithm. 


b 
3 c 
C. 


1 
d 
Figure 8.4.9 The second iteration of Figure 8.4.10 The third iteration of 
Dijkstra’s shortest-path algorithm. Dijkstra’s shortest-path algorithm. 
b 3 re 


= 
i ) 


d e 


Figure 8.4.11 The conclusion of Dijkstra’s 
shortest-path algorithm. 


At this point, we may circle z, so the algorithm terminates. The length of a shortest path 
from a to z is 4. Starting at z, we can retrace the labels to find the shortest path 


(a, d, e, z). < 


Our next theorem shows that Dijkstra’s algorithm is @(n7) in the worst case. 


Theorem 8.4.5 For input consisting of an n-vertex, simple, connected, weighted graph, Dijkstra’s 
algorithm (Algorithm 8.4.1) has worst-case run time O(n?). 


Proof We consider the time spent in the loops, which provides an upper bound on 
the total time. Line 4 is executed O(n) times. Within the while loop, line 9 takes time 
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O(n) [we could find the minimum L(v) by examining all the vertices in T]. The body 
of the for loop (line 12) takes time O(n). Since lines 9 and 12 are nested within a while 
loop, which takes time O(n), the total time for lines 9 and 12 is O(n*). Thus Dijkstra’s 
algorithm runs in time O(n’). 

In fact, for an appropriate choice of z, the time is Q(n’) for K,,, the complete graph 
on n vertices, because every vertex is adjacent to every other. Thus the worst-case run 
time is Q(n”). < 


Any shortest-path algorithm that receives as input K,, the complete graph on n 
vertices, must examine all of the edges of K,, at least once. Since K,, has n(n — 1)/2 
edges (see Exercise 16, Section 8.1), its worst-case run time must be at least n(n—1)/2 = 
Q(n2). It follows from Theorem 8.4.5 that Algorithm 8.4.1 is optimal. 


8.4 Review Exercises 


1. Describe Dijkstra’s shortest-path algorithm. 3. Prove that Dijkstra’s shortest-path algorithm correctly finds a 


hortest path. 
2. Give an example to show how Dijkstra’s shortest-path algo- ee 


rithm finds a shortest path. 


8.4 Exercises 


In Exercises 1-5, find the length of a shortest path and a shortest 
path between each pair of vertices in the weighted graph. 


10. True or false? When a connected, weighted graph and vertices 
a and z are input to the following algorithm, it returns the 
length of a shortest path from a to z. If the algorithm is correct, 
prove it; otherwise, give an example of a connected, weighted 
graph and vertices a and z for which it fails. 


Algorithm 8.4.6 


algor(w, a, z) { 
length = 0 


1. a,f 2. a, g 3. a,Z ae : 
4. b,j 5. hd T = set of all vertices 


while (v7 = z) { 
6. Write an algorithm that finds the length of a shortest path be- 


T =T — {v} 
tween two given vertices in a connected, weighted graph and choose x € T with minimum w(v, x) 
also finds a shortest path. 


length = length + w(v, x) 


7. Write an algorithm that finds the lengths of the shortest paths v=x 
from a given vertex to every other vertex in a connected, } 
weighted graph G. return length 


x8. Write an algorithm that finds the lengths of the shortest paths } 
between all vertex pairs in a simple, connected, weighted 


. . . . 3 
graph having n vertices in time O(n"). 11. True or false? Algorithm 8.4.1 finds the length of a shortest 


. Modify Algorithm 8.4.1 so that it accepts a weighted graph 


that is not necessarily connected. At termination, what is L(z) 
if there is no path from a to z? 


path in a connected, weighted graph even if some weights 
are negative. If true, prove it; otherwise, provide a counter- 
example. 


8.5 Representations of Graphs 


In the preceding sections we represented a graph by drawing it. Sometimes, as for ex- 
ample in using a computer to analyze a graph, we need a more formal representation. 
Our first method of representing a graph uses the adjacency matrix. 


Example 8.5.1 


Go Online 

For more on the 
adjacency matrix, see 
goo.gl/jfjmol 


d e 


Figure 8.5.1 The graph for 
Example 8.5.1. 


Example 8.5.2 


Q 
> 


d e 


Figure 8.5.2 The graph 
for Example 8.5.2. 
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Adjacency Matrix Consider the graph of Figure 8.5.1. To obtain the adjacency matrix 
of this graph, we first select an ordering of the vertices, say a, b, c, d, e. Next, we label 
the rows and columns of a matrix with the ordered vertices. The entry in this matrix in 
row i, column j, i € j, is the number of edges incident on i and j. If i = j, the entry is 
twice the number of loops incident on i. The adjacency matrix for this graph is 


abeceode 
a/fO 1001 
byjJ/1 01 0 1 
c1/0 12 0 1 
dqdi0 0 0 0 2 
e\l 1 12 0 < 


Notice that we can obtain the degree of a vertex v in a graph G by summing row v 
or column v in G’s adjacency matrix. 

The adjacency matrix is not a very efficient way to represent a graph. Since the 
matrix is symmetric about the main diagonal (the elements on a line from the upper- 
left corner to the lower-right corner), the information, except that on the main diagonal, 
appears twice. 


The adjacency matrix of the simple graph of Figure 8.5.2 is 


abeceode 

a/fo 1 0 1 0 

bj/1 01 0 1 

cee OF 4 Oe ol 

dj1 01 0 0 
e\O 1 1 0 0 < 


We will show that if A is the adjacency matrix of a simple graph G, the powers 
of A, 


A, A’, A?,..., 

count the number of paths of various lengths. More precisely, if the vertices of G are 
labeled 1, 2, ..., the ij th entry in the matrix A” is equal to the number of paths from i to j 
of length n. For example, suppose that we square the matrix A of Example 8.5.2 to obtain 

abcede 

0 1 0 1 0\ /fO 101 0\ a/202 0 1 

10101 10101 bj/0 3 12 1 

A*=]0 101 14/0 101 1f=ce]2 13 0 1 

1010 0 101 0 0); d{0 202 1 

0 1 1 0 0/ \0 1:1 °0 07) e\1 1 1 1 2 


Consider the entry for row a, column c in A”, obtained by multiplying pairwise the en- 
tries in row a by the entries in column c of the matrix A and summing: 


Cc 
0 
b d 1]b 
af 10 1 OjJ0 =0-04+1-14+0-04+1-14+0-1=2. 
ljd 
1 


The only way a nonzero product appears in this sum is if both entries to be multiplied 
are 1. This happens if there is a vertex v whose entry in row a is | and whose entry in 


412 Chapter8 @ Graph Theory 


Theorem 8.5.3 


a << 
i 

sas k 
Figure 8.5.3 The proof of 
Theorem 8.5.3. A path from i 
to k of length n + 1 whose 
next-to-last vertex is j 
consists of a path of length n 
from i to j followed by edge 
(j, k). If there are s; paths of 
length n from i to j and ¢; is 1 
if edge (j, k) exists and 0 
otherwise, the sum of sjt; 
over all 7 gives the number of 


paths of length n + | from i 
tok. 


column c is 1. In other words, there must be edges of the form (a, v) and (v, c). Such 
edges form a path (a, v, c) of length 2 from a to c, and each path increases the sum by 
1. In this example, the sum is 2 because there are two paths 


(a,b,c), (a,d,c) 


of length 2 from a to c. In general, the entry in row x and column y of the matrix A? is 
the number of paths of length 2 from vertex x to vertex y. 

The entries on the main diagonal of A” give the degrees of the vertices (when the 
graph is a simple graph). Consider, for example, vertex c. The degree of c is 3 since c 
is incident on the three edges (c, b), (c, d), and (c, e). But each of these edges can be 
converted to a path of length 2 from c to c: 


(c,b,c), (c,d,c), (c,e,c). 


Similarly, a path of length 2 from c to c defines an edge incident on c. Thus the number 
of paths of length 2 from c to c is 3, the degree of c. 

We now use induction to show that the entries in the nth power of an adjacency 
matrix give the number of paths of length n. 


If A is the adjacency matrix of a simple graph, the ijth entry of A” is equal to the 
number of paths of length n from vertex i to vertex j,n = 1,2,.... 


Proof We will use induction on n. 

If n = 1, A! is simply A. The ijth entry is 1 if there is an edge from i to j, which 
is a path of length 1, and 0 otherwise. Thus the theorem is true if n = 1. The Basis Step 
has been verified. 

Assume that the theorem is true for n. Now 


Att =A"A 


so that the ikth entry in A"*t! is obtained by multiplying pairwise the elements in the ith 
row of A” by the elements in the kth column of A and summing: 


kth column of A 


ith row of A" (Sj, 82, «+5 Sjy e+e5 Sm) 


tn 
= Sit) + Soto +--- + sjtj +++ +Smtn 
= ikth entry in A"*!. 


By induction, s; gives the number of paths of length n from i to j in the graph G. Now ¢ 
is either O or 1. If ¢; is 0, there is no edge from to k, so there are s;t; = 0 paths of length 
n+ 1 from i to k, where the last edge is (j,k). If ¢; is 1, there is an edge from vertex j 
to vertex k (see Figure 8.5.3). Since there are s; paths of length n from vertex i to vertex 


j, there are s;t; = s; paths of length n + 1 from i to k, where the last edge is (j, k) (see 


Figure 8.5.3). Summing over all j, we will count all paths of length n + 1 from i to k. 
Thus the ikth entry in A+! gives the number of paths of length n+ | from i to k, and the 
Inductive Step is verified. 

By the Principle of Mathematical Induction, the theorem is established. < 


Example 8.5.4 


Example 8.5.5 


Go Online 

For more on the 
incidence matrix, see 
goo.gl/jfjmolI 


V1 e3 v2 U3 
e 
— // ee od 
e4 7 
e7 
U4 V5 


Figure 8.5.4 The graph for 
Example 8.5.5. 
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After Example 8.5.2, we showed that if A is the matrix of the graph of Figure 8.5.2, then 


4 abecede 
b 202 0 1 
— 03 12 «1 
d 213 01 
" 02 02 1 
tL 2. 21. 2 
By multiplying, 
202 0 1 202 0 1 
03 12 «1 0 3 I 2 1 
A‘ =A7A7=12 1 3:0 1442 1 3:0 1], 
02 0 2 1 0202 1 
tft i-1 2 1 1112 
we find that 
- a bc de 
b 9 3 11 1 6 
Mac 3 15 7 11 8 
d ll 7 15 3 8 
7 1 11 3 9 6 


6 8 8 6 8 


The entry from row d, column e is 6, which means that there are six paths of length 4 
from d to e. By inspection, we find them to be 


(d, a, d,c, e), (d,c,d,c, e), (d, a, b,c, e), 
(d, c, e, c, e), (d, c, e, b, e), (d, c, b,c, e). | 


Another useful matrix representation of a graph is known as the incidence matrix. 


Incidence Matrix To obtain the incidence matrix of the graph in Figure 8.5.4, we label 
the rows with the vertices and the columns with the edges (in some arbitrary order). The 
entry for row v and column e is 1 if e is incident on v and 0 otherwise. Thus the incidence 
matrix for the graph of Figure 8.5.4 is 


ej (23) e3 eg es &6 e7 
vy fl 1 1 0 0 0 90 
wfO O tt 1 1 0 1 
3/0 0 0 0 0 1 90 
vwufl 1 0 1 0 0 O 
vy \0 0 0 0 1 1 O 


A column such as e7 is understood to represent a loop. < 


Notice that in a graph without loops each column has two 1’s and that the sum of 
a row gives the degree of the vertex identified with that row. 


8.5 Review Exercises 


1. What is an adjacency matrix? 


3. What is an incidence matrix? 


2. IfA is the adjacency matrix of a simple graph, what are the val- 


ues of the entries in A”? 
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In Exercises 1-6, write the adjacency matrix of each graph. 


i. b 


e xg d 
2. 4 c 
A 3 
x1 d 
4 v4 
b e 
3. 


4. The graph of Figure 8.2.2 


5. The complete bipartite graph K2,3 
6. The complete graph on five vertices Ks5 


In Exercises 7-12, write the incidence matrix of each graph. 
7. The graph of Exercise 1 
9. The graph of Exercise 3 

11. The complete bipartite graph K2,3 

12. The complete graph on five vertices K5 


In Exercises 13-17, draw the graph represented by each adjacency 


matrix. 

13. abede 14. abede 
a/20010 afo0100 
bjo0 0101 b1/0 2012 
clo 1211 c11 0000 
dj/1 0100 d/o 1001 
e \o 1 100 e102 010 

f\1 0100 

15. abcde 16. abcde 

a/fo1000 a/f4 111 0 
10000 bj1 01 1 =41 
c]0 0011 eji 1011 
d}0 0101 dji 1101 
e\o 0112 ejO 111 0 

f \2 0311 


8. The graph of Exercise 2 
10. The graph of Figure 8.2.1 


COrRPrFPWONYN CCOOFRCRKNY 


17. The 7 x 7 matrix whose ij th entry is | if i+ 1 divides j + 1 


18. 


19. 


20. 


21. 


orj + 1 divides i+ 1, i #7; whose jj th entry is 2 if i = j; and 
whose jj th entry is 0 otherwise 


Write the adjacency matrices of the components of the graphs 
given by the adjacency matrices of Exercises 13-17. 


Compute the squares of the adjacency matrices of Ks and the 
graphs of Exercises | and 2. 


Let A be the adjacency matrix for the graph of Exercise 1. 
What is the entry in row a, column d of A>? 


Suppose that a graph has an adjacency matrix of the form 


A’ 
r=(el4) 
Al’ 


where all entries of the submatrices A’ and A” are 0. What must 
the graph look like? 


22. Repeat Exercise 21 with “adjacency” replaced by “incidence.” 


23. Let A be an adjacency matrix of a graph. Why is A” symmetric 


about the main diagonal for every positive integer n? 


In Exercises 24 and 25, draw the graphs represented by the inci- 

dence matrices. 

24. a f/1 0 0 0 0 1\ 25. a /0 100 1 1 
b1}O 1 10 1 0 b{O 1101 0 
e 1/100 1 0 0 c [0 000 0 1 
d}j0 1010 0 dj1 00 10 0 
e\O0 0101 1 e\l 001 0 0 


26. What must a graph look like if some row of its incidence ma- 


trix consists only of 0’s? 


27. Let A be the adjacency matrix of a graph G with n vertices. Let 


Y=A+A74.--.-4A771, 


If some off-diagonal entry in the matrix Y is zero, what can 
you say about the graph G? 


Exercises 28-31 refer to the adjacency matrix A of Ks. 


28. Let n be a positive integer. Explain why all the diagonal 


elements of A” are equal and all the off-diagonal elements of 
A” are equal. 


Let d, be the common value of the diagonal elements of A" and let 


an 


be the common value of the off-diagonal elements of A”. 


*29. Show that 


dn41 =4an;  An41 = dn +3dn;  dn41 = 3dn + 4ap_1. 


*30. Show that 


1 
dy = ela" + (-1)""1]. 


31. Show that 


4 —1 
dn = 5" + (—1)"]. 
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*32. Derive results similar to those of Exercises 29-31 for the ad- «33. Let A be the adjacency matrix of the graph K,,,,. Find a for- 


jacency matrix A of the graph K,,,. 


8.6 


mula for the entries in A’. 


Isomorphisms of Graphs 


Go Online 

For a C program 

that determines 
whether two graphs are 
isomorphic, see 
goo.gl/Ao2qyq 


The following instructions are given to two persons who cannot see each other’s paper: 
“Draw and label five vertices a, b, c, d, and e. Connect a and b, b and c, c and d, d and 
e, and a and e.” The graphs produced are shown in Figure 8.6.1. Surely these figures 
define the same graph even though they appear dissimilar. Such graphs are said to be 
isomorphic. 


G\ 
A 
yi /\ y2 
Cc. 4 D 
3 5 
E B 
G2 
Figure 8.6.1 


Isomorphic graphs. 


Graphs are isomorphic if they are the same as graphs, even though the names of 
the vertices and edges may be different. As drawn, the graphs G; and G2 in Figure 8.6.1 
do not look the same; but, except for the names of the vertices and edges, they are the 
same graph. To put it another way, if we change the name of vertex a in G; to A, and, 
similarly, b to B, c to C, d to D, and e to E; and change the name of edge x; in G, to y; for 
i= | to5, we obtain graph G2. [Edge x;, which was incident on (e, a), becomes edge y;, 
which is now incident on (E, A); edge x2, which was incident on (a, b), becomes edge 
y2, Which is now incident on (A, B); and so on.] In general, in mathematics, computer 
science, biology, chemistry, and so on, structures are “isomorphic” if they are the same 
in form, but are not necessarily identical. 

There are many practical applications of graph isomorphism. Fingerprints can be 
modeled using graphs in which vertices represent local geometric patterns and edges 
represent relations between the patterns (see [Nandi]). In this way, a fingerprint graph 
can be potentially identified by checking it against a database of fingerprint graphs. A 
similar application of graph isomorphism represents chemical and biological structures 
as graphs and compares them to databases of such representations (see [Willett]). This 
latter work is particularly useful in pharmaceutical research. 


Definition 8.6.1 > Graphs G; and G) are isomorphic if there is a one-to-one, 
onto function f from the vertices of G; to the vertices of Gz and a one-to-one, onto 
function g from the edges of G,; to the edges of G2, so that an edge e is incident on 
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Example 8.6.3 


vand w in G; if and only if the edge g(e) is incident on f(v) and f(w) in G2. The pair of 
functions f and g is called an isomorphism of G; onto G2. < 


An isomorphism for the graphs G; and G2 of Figure 8.6.1 is defined by 


f@=A, fO%)=8. fO=C, FOM=D, fO=E, 
eQprae vj. PH Agee ds 


We can think of functions f and g as “renaming functions.” < 


If we define a relation R on a set of graphs by the rule Gj RG» if G; and Gz are 
isomorphic, R is an equivalence relation. Each equivalence class consists of a set of 
mutually isomorphic graphs. 


The Mesh Model for Parallel Computation Previously, we considered the problem 
of when the n-cube could simulate a ring model for parallel computation (see Example 
8.3.5). We now consider when the n-cube can simulate the mesh model for parallel 
computation. 

The two-dimensional mesh model for parallel computation when described as a 
graph consists of a rectangular array of vertices connected as shown (see Figure 8.6.2). 
The problem “When can an n-cube simulate a two-dimensional mesh?” can be rephrased 
in graph terminology as “When does an n-cube contain a subgraph isomorphic to a two- 
dimensional mesh?” We show that if M is a mesh p vertices by q vertices, where p < 2! 
and q < 2/, then the (i +/)-cube contains a subgraph isomorphic to M. (In Figure 8.6.2, 
we may take p = 6, q = 4,7 = 3, andj = 2. Thus our result shows that the 5-cube 
contains a subgraph isomorphic to the graph in Figure 8.6.2.) 


10 
ese 
oT 
Lt LT | 
000 001 al 010 110 111 
= 011 


Figure 8.6.2 Mesh model for 
parallel computation. 


Let M be a mesh p vertices by q vertices, where p < 2! and q < 2/. We consider M 
to be a rectangular array in ordinary 2-space with p vertices in the horizontal direction 
and q vertices in the vertical direction (see Figure 8.6.2). As coordinates for the vertices, 
we use elements of Gray codes. (Gray codes are discussed in Example 8.3.5.) The co- 
ordinates in the horizontal direction are the first p members of an i-bit Gray code, and 
the coordinates in the vertical direction are the first g members of a j-bit Gray code (see 
Figure 8.6.2). If a vertex v is in the mesh, we let v, denote the horizontal coordinate of v 
and v, denote the vertical coordinate of v. We then define a function f on the vertices of 
M by f(v) = vyvy. (The string v,Vvy is the string v, followed by the string vy.) Notice that 
f is one-to-one. 

If (v, w) is an edge in M, the bit strings v,vy and w,wy differ in exactly one bit. 
Thus (v,vy, Wy) is an edge in the (i + j)-cube. We define a function g on the edges of 


Theorem 8.6.4 


Corollary 8.6.5 
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M by g ((v, w)) = (VxVy, WyWy). Notice that g is one-to-one. The pair f, g of functions is 
an isomorphism of M onto the subgraph (V, £) of the (i + j)-cube where 


V = {f() | vis a vertex in M}, E = {g(e) | eis an edge in M}. 


Therefore, if M is a mesh p vertices by g vertices, where p < 2! and g < 2/, the (i+ j)- 
cube contains a subgraph isomorphic to M. 

The argument given extends to an arbitrary number of dimensions (see 
Exercise 12); that is, if M is ap, x p2 x --+ X pg mesh, where p; < 2" fori=1,...,k, 
then the (¢; + % +---+¢)-cube contains a subgraph isomorphic to M. < 


In general, the adjacency matrix of a graph changes when the ordering of its ver- 
tices is changed. We can show that graphs G; and G2 are isomorphic if and only if for 
some ordering of their vertices, their adjacency matrices are equal. 


Graphs G; and G2 are isomorphic if and only if for some ordering of their vertices, 
their adjacency matrices are equal. 


Proof Suppose that G; and G2 are isomorphic. Then there is a one-to-one, onto func- 
tion f from the vertices of G; to the vertices of Gz and a one-to-one, onto function g from 
the edges of G; to the edges of Gz, so that an edge e is incident on v and w if and only 
if the edge g(e) is incident on f(v) and f(w) in Go. 

Let vj,..., Vv, be an ordering of the vertices of G;. Let A; be the adjacency matrix 
of G; relative to the ordering v;,..., v,, and let Az be the adjacency matrix of G2 relative 
to the ordering f(v), ...,f(v,). Suppose that the entry in row 7, column j, i # j, of A, is 
equal to k. Then there are k edges, say e1, ..., ex, incident on v; and v;. Therefore, there 
are exactly k edges g(e), ..., g(e,) incident on f(v;) and f(v;) in Gz. Thus the entry in 
row i, column j in Az, which counts the number of edges incident on f(v;) and f (vj), is 
also equal to k. A similar argument shows that the entries on the diagonals in A; and A 
are also equal. Therefore, Ay = Ao. 

The converse is similar and is left as an exercise (see Exercise 35). < 


Let G; and G2 be simple graphs. The following are equivalent: 


(a) G; and G2 are isomorphic. 


(b) There is a one-to-one, onto function f from the vertex set of G; to the vertex 
set of G» satisfying the following: Vertices v and w are adjacent in G, if and 
only if the vertices f(v) and f(w) are adjacent in Gp. 


Proof It follows immediately from Definition 8.6.1 that (a) implies (b). 

We prove that (b) implies (a). Suppose there is a one-to-one, onto function f from 
the vertex set of G; to the vertex set of G2 satisfying the following: Vertices v and w are 
adjacent in G, if and only if the vertices f(v) and f(w) are adjacent in Gp. 

Let vj, ..., ¥, be an ordering of the vertices of G;. Let A; be the adjacency matrix 
of G;, relative to the ordering v;,..., v,, and let Az be the adjacency matrix of G2 relative 
to the ordering f(v,),...,f/(v,). Since G; and G are simple graphs, the entries in the 
adjacency matrices are either | (to indicate that vertices are adjacent) or 0 (to indicate 
that vertices are not adjacent). Because vertices v and w are adjacent in G, if and only if 
the vertices f(v) and f(w) are adjacent in Gz, it follows that A; = Az. By Theorem 8.6.4, 
G, and G» are isomorphic. < 
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Example 8.6.6 


Example 8.6.7 


The adjacency matrix of graph G; in Figure 8.6.1 relative to the vertex ordering 
a, b,c, d, e, 


abeode 
a/f0O 100 1 
bi1 01 0 0 
c}]O 1 0 1 OF, 
dqdi0 01 0 1 
e\l 00 1 0 


is equal to the adjacency matrix of graph G2 in Figure 8.6.1 relative to the vertex ordering 
A, B, C, D, E, 


A BBCODE 
A/foO 1 0 0 1 
By 1 0 1 0 0 
c;}o0 1 0 1 =O 
D\|O 0 1 0 1 
E\1 0 0 1 #0 
We see again that G; and G2 are isomorphic. < 


Although the graph isomorphism problem is not known to be NP-complete, there 
are no known worst-case, polynomial-time algorithms to test whether two graphs are 
isomorphic. There are, however, average-case, linear-time algorithms for the graph iso- 
morphism problem (see [Read] and [Babai]). 

The following is one way to show that two simple graphs G; and Gp are not iso- 
morphic. Find a property of G; that Gz does not have but that G2 would have if G, and 
G» were isomorphic. Such a property is called an invariant. More precisely, a property 
P is an invariant if whenever G; and G» are isomorphic graphs: 


If G; has property P, G2 also has property P. 


By Definition 8.6.1, if graphs G; and G2 are isomorphic, there are one-to-one, 
onto functions from the edges (respectively, vertices) of G; to the edges (respectively, 
vertices) of G2. Thus, if G; and G2 are isomorphic, then G; and G2 have the same number 
of edges and the same number of vertices. Therefore, if e and n are nonnegative integers, 
the properties “has e edges” and “has n vertices” are invariants. 


The graphs G; and Gz in Figure 8.6.3 are not isomorphic, since G; has seven edges and 
Gy» has six edges and “has seven edges” is an invariant. 


G, G2 


Figure 8.6.3 Nonisomorphic 
graphs. G, has seven edges and Gz 
has six edges. < 


Example 8.6.8 Show that if k is a positive integer, “has a vertex of degree k” is an invariant. 


Example 8.6.9 


Example 8.6.10 
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SOLUTION Suppose G; and G2 are isomorphic graphs and f (respectively, g) is a 
one-to-one, onto function from the vertices (respectively, edges) of G; onto the vertices 
(respectively, edges) of G2. Suppose that G; has a vertex v of degree k. Then there are 
k edges e;,..., ex incident on v. By Definition 8.6.1, g(e;),..., g(e,) are incident on 
J (v). Because g is one-to-one, d(f(v)) = k. 

Let E be an edge that is incident on f(v) in G2. Since g is onto, there is an edge 
e in G; with g(e) = E. Since g(e) is incident on f(v) in Go, by Definition 8.6.1, e is 


incident on v in Gy. Since e;,..., ex are the only edges in G; incident on v, e = e; for 
some i € {1,...,k}. Now g(e;) = g(e) = E. Thus d(f(v)) = k, so Gp has a vertex, 
namely f(v), of degree k. < 


Since “has a vertex of degree 3” is an invariant, the graphs G; and G2 of Figure 8.6.4 
are not isomorphic; G, has vertices (a and f) of degree 3, but Gz does not have a vertex 
of degree 3. Notice that G; and G2 have the same numbers of edges and vertices. 

h 


l 
G, G2 


Figure 8.6.4 Nonisomorphic 
graphs. G, has vertices of degree 3, 
but G» has no vertices of degree 3. 


< 


Another invariant that is sometimes useful is “has a simple cycle of length k.”’ We 
leave the proof that this property is an invariant to the exercises (Exercise 22). 


Since “has a simple cycle of length 3” is an invariant, the graphs G; and G» of 
Figure 8.6.5 are not isomorphic; the graph G2 has a simple cycle of length 3, but all 
simple cycles in G; have length at least 4. Notice that G; and G2 have the same numbers 
of edges and vertices and that every vertex in G; or G2 has degree 4. 


a b 1 2 


G2 


Figure 8.6.5 Nonisomorphic graphs. 
G» has a simple cycle of length 3, but 
G; has no simple cycles of length 3. < 


It would be easy to test whether a pair of graphs is isomorphic if we could find a 
small number of easily checked invariants that isomorphic graphs and only isomorphic 
graphs share. Unfortunately, no one has succeeded in finding such a set of invariants. 
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8.6 Review Exercises 


= 


. Define what it means for two graphs to be isomorphic. 4. What is an invariant in a graph? 


n 


. Give an example of isomorphic, nonidentical graphs. Explain 5. How is “invariant” related to isomorphism? 


hy they are isomorphic. 
bi ae a ™? 6. How can one determine whether graphs are isomorphic from 


. Give an example of two graphs that are not isomorphic. their adjacency matrices? 
Explain why they are not isomorphic. 


ue 


7. What is the mesh model for parallel computation? 


8.6 Exercises 


In Exercises 1-4, prove that the graphs G, and G2 are isomorphic. Any graph isomorphic to G; and G2 is called the Petersen graph. 
The Petersen graph is much used as an example; in fact, D. A. 
Holton and J. Sheehan wrote an entire book about it (see [Holton]]). 


5. Prove that the following graph is the Petersen graph; that is, 
prove that it is isomorphic to the graphs in Exercise 4. 


6. Draw a graph with 10 vertices. Label each vertex with one of 
the 10 distinct two-element subsets of {1, 2,3, 4,5}. Put an 
edge between two vertices if their labels (i.e., subsets) have no 
elements in common. Prove that your graph is the Petersen 
graph; that is, prove that it is isomorphic to the graphs in 
Exercise 4. 


In Exercises 7-9, prove that the graphs G, and G2 are not iso- 
morphic. 


Gi G2 
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9. a b «14. 


The second graph is G2 of Exercise 3. 


x15. 
In Exercises 10-15, determine whether the graphs G, and G2 are 


a b 1 2 
isomorphic. Prove your answer. 
e 5 
10. a b 1 5 \ 
d c 4 3 
e f 5 6 


G1 G2 
G G 
: é 16. Show that if M is ap, x p2 x +++ X pe mesh, where p; < 2" 
11. a 1 2 fori=1,...,k, then the (t; + tf +--+ + t,)-cube contains a 


subgraph isomorphic to M. 


An r-regular graph is a graph in which all vertices have degree r. 
A regular graph is a graph which is regular for some r. 


17. Show that for each r, all connected, simple, 2-vertex, r-regular 
graphs are isomorphic. 


18. Show that for each r, all connected, simple, 3-vertex, r-regular 
graphs are isomorphic. 


19. Show that for each r, all connected, simple, 4-vertex, r-regular 
graphs are isomorphic. 


20. Show that for each r, all connected, simple, 5-vertex, r-regular 
graphs are isomorphic. 


21. Find a value r and two nonisomorphic, connected, simple, 6- 
vertex, r-regular graphs. 


In Exercises 22-26, show that the property given is an invariant. 


22. Has a simple cycle of length k 
23. Has n vertices of degree k 


24. Is connected 


25. Has n simple cycles of length k 
26. Has an edge (v, w), where 5(v) = i and 6(w) =j 
27. Find an invariant not given in this section or in Exercises 


22-26. Prove that your property is an invariant. 


In Exercises 28-30, tell whether or not each property is an invari- 
ant. If the property is an invariant, prove that it is; otherwise, give 
a counterexample. 


28. Has an Euler cycle 


29. Has a vertex inside some simple cycle 


30. Is bipartite 
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31. Draw all nonisomorphic simple graphs having three vertices. 
32. Draw all nonisomorphic simple graphs having four vertices. 


33. Draw all nonisomorphic, cycle-free, connected graphs having 
five vertices. 


34. Draw all nonisomorphic, cycle-free, connected graphs having 
six vertices. 


35. Show that graphs G; and G2 are isomorphic if their vertices 
can be ordered so that their adjacency matrices are equal. 


The complement of a simple graph G is the simple graph G with 
the same vertices as G. An edge exists in G if and only if it does not 
exist in G. 


36. Draw the complement of the graph G; of Exercise 7. 
37. Draw the complement of the graph G2 of Exercise 7. 


*38. Show that if G is a simple graph, either G or G is connected. 


39. A simple graph G is self-complementary if G and G are 
isomorphic. 


(a) Find a self-complementary graph having five vertices. 
(b) Find another self-complementary graph. 


40. Let G; and G2 be simple graphs. Show that G; and G2 are 
isomorphic if and only if Gy and G are isomorphic. 

41. Given two graphs G; and G2, suppose that there is a one-to- 
one, onto function f from the vertices of G; to the vertices of 
G» and a one-to-one, onto function g from the edges of G; to 
the edges of Go, so that if an edge e is incident on v and w in 
G,, the edge g(e) is incident on f(v) and f(w) in Gz. Are G1 
and G2 isomorphic? 


A homomorphism from a graph G, to a graph G2 is a function f 
from the vertex set of G to the vertex set of G2 with the property 
that if v and w are adjacent in Gy, then f (v) and f(w) are adjacent 
in G2. 


42. Suppose that G; and G2 are simple graphs. Show that if f is a 
homomorphism of G; to Gz and f is one-to-one and onto, G; 
and G2 are isomorphic. 


In Exercises 43-47, for each pair of graphs, give an example of a 
homomorphism from G, to Go. 


43. 
a boc od 1 2 38 


G, G2 


44. 


G, G2 


45. G, = G, of Exercise 44; G2 = G, of Exercise 43 
46. G, = G, of Exercise 43 
1 2 


G2 
47. 


Gy G2 


*48. [Hell] Show that the only homomorphism from the graph to 
itself is the identity function. 


8.7 Planar Graphs 


Three cities, C), C2, and C3, are to be directly connected by expressways to each of three 


Go Online 
For a planar graph 
algorithm, see 
goo.gl/jfjmol 


other cities, Cy, Cs, and Cs. Can this road system be designed so that the expressways 
do not cross? A system in which the roads do cross is illustrated in Figure 8.7.1. If you 
try drawing a system in which the roads do not cross, you will soon be convinced that it 
cannot be done. Later in this section we explain carefully why it cannot be done. 


: 


4 3 
Figure 8.7.2 A 


connected, planar 
graph with f = 4 
faces (A, B, C, D), 
e = 8 edges, and 
v = 6 vertices; 
f=e-v4+2. 


Example 8.7.2 


d 


Figure 8.7.3 The 
nonplanar graph Ks. 
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Ci C2 C3 
C4 Cs Co 
K3 3 


Figure 8.7.1 Cities connected by 
expressways. 


Definition 8.7.1 >» A graph is planar if it can be drawn in the plane without its 
edges crossing. < 


In designing printed circuits it is desirable to have as few lines cross as possible; 
thus the designer of printed circuits faces the problem of planarity. 

If a connected, planar graph is drawn in the plane, the plane is divided into con- 
tiguous regions called faces. A face is characterized by the cycle that forms its boundary. 
For example, in the graph of Figure 8.7.2, face A is bounded by the cycle (5, 2, 3, 4, 5) 
and face C is bounded by the cycle (1, 2, 5, 1). The outer face D is considered to be 
bounded by the cycle (1, 2, 3, 4, 6, 1). The graph of Figure 8.7.2 has f = 4 faces, e = 8 
edges, and v = 6 vertices. Notice that f, e, and v satisfy the equation 


f=e-v4+2. (8.7.1) 


In 1752, Euler proved that equation (8.7.1) holds for any connected, planar graph. At 
the end of this section we will show how to prove (8.7.1), but for now let us show how 
(8.7.1) can be used to show that certain graphs are not planar. 


Show that the graph K3.3 of Figure 8.7.1 is not planar. 


SOLUTION Suppose that K3 3 is planar. Since every cycle has at least four edges, each 
face is bounded by at least four edges. Thus the number of edges that bound faces is at 
least 4f. In a planar graph, each edge belongs to at most two bounding cycles. Therefore, 
2e > 4f. Using (8.7.1), we find that 


2e > 4(e-—v+2). (8.7.2) 
For the graph of Figure 8.7.1, e = 9 and v = 6, so (8.7.2) becomes 
18 =2-9>49—-—64+42) = 20, 


which is a contradiction. Therefore, K3.3 is not planar. < 


By a similar kind of argument (see Exercise 15), we can show that the graph Ks 
of Figure 8.7.3 is not planar. 

Obviously, if a graph contains K33 or Ks as a subgraph, it cannot be planar. The 
converse is not true; however, if we introduce the concept of “homeomorphic graphs,” 
we can obtain a true statement similar to the converse (see Theorem 8.7.7). 


Definition 8.7.3 > If agraph Ghas a vertex v of degree 2 and edges (v, v) and 
(v, v2) with v; # v2, we say that the edges (v, v,) and (v, v2) are in series. A series 
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Example 8.7.4 


Example 8.7.6 


Go Online 
A biography 
of Kuratowski is at 
goo.gl/jfjmol 


Theorem 8.7.7 


reduction consists of deleting the vertex v from the graph G and replacing the edges 
(v, v1) and (v, v2) by the edge (1, v2). The resulting graph G’ is said to be obtained from 
G by a series reduction. By convention, G is said to be obtainable from itself by a series 
reduction. < 


In the graph G of Figure 8.7.4, the edges (v, v) and (v, v2) are in series. The graph G’ 
of Figure 8.7.4 is obtained from G by a series reduction. 


v2 v3 v2 v3 
v 
cl U4 Vy U4 
v5 v5 
G G' 
Figure 8.7.4 G’ is obtained from G by a series 
reduction. < 


Definition 8.7.5 » Graphs G; and Gz are homeomorphic if G, and Gy can be 
reduced to isomorphic graphs by performing a sequence of series reductions. < 


According to Definitions 8.7.3 and 8.7.5, any graph is homeomorphic to itself. 
Also, graphs G; and G2 are homeomorphic if G; can be reduced to a graph isomorphic 
to G2 or if G2 can be reduced to a graph isomorphic to Gj. 


The graphs G, and G2 of Figure 8.7.5 are homeomorphic since they can both be reduced 
to the graph G’ of Figure 8.7.5 by a sequence of series reductions. 


en Go G’ 


Figure 8.7.5 G, and G» are homeomorphic; each can be reduced to G’. < 


If we define a relation R on a set of graphs by the rule G; RG» if G; and G» are 
homeomorphic, R is an equivalence relation. Each equivalence class consists of a set of 
mutually homeomorphic graphs. 

We now state a necessary and sufficient condition for a graph to be planar. The 
theorem was first stated and proved by Kuratowski in 1930. The proof appears in [Even, 
1979]. 


Kuratowski's Theorem 


A graph G is planar if and only if G does not contain a subgraph homeomorphic to 
Ks or Ke3 5 


Example 8.7.8 
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Show that the graph G of Figure 8.7.6 is not planar by using Kuratowski’s Theorem. 


SOLUTION Let us try to find K3,3 in the graph G of Figure 8.7.6. We first note that 
the vertices a, b, f, and e each have degree 4. In K3.3 each vertex has degree 3, so let us 
eliminate the edges (a, b) and (f, e) so that all vertices have degree 3 (see Figure 8.7.6). 
We note that if we eliminate one more edge, we will obtain two vertices of degree 2 and 
we can then carry out two series reductions. The resulting graph will have nine edges; 
since K33 has nine edges, this approach looks promising. Using trial and error, we fi- 
nally see that if we eliminate edge (g, 4) and carry out the series reductions, we obtain 
an isomorphic copy of K3,3 (see Figure 8.7.7). Therefore, the graph G of Figure 8.7.6 is 
not planar, since it contains a subgraph homeomorphic to K3,3. 


—_—> 


Eliminate edges 


(a, b) (fie) 


Figure 8.7.6 Eliminating edges to obtain a subgraph. 


a a a 
if b = f b ~ f b 
& Eliminate edge & Series 
(g, h) reductions 
e e e 
¢ c c 
d d d 
K3,3 


Figure 8.7.7 Elimination of an edge to obtain a subgraph, followed by series reductions. 


Although Theorem 8.7.7 does give an elegant characterization of planar graphs, 
it does not lead to an efficient algorithm for recognizing planar graphs. However, algo- 
rithms are known that can determine whether a graph having n vertices is planar in time 
O(n) (see [Even, 1979]). 

We will conclude this section by proving Euler’s formula. 
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Theorem 8.7.9 


f =2,e=l1v=1 


Figure 8.7.8 The 
Basis Step of 
Theorem 8.7.9. 


Euler's Formula for Graphs 
If Gis a connected, planar graph with e edges, v vertices, and f faces, then 


f=e-v+2. (8.7.3) 


Proof We will use induction on the number of edges. 

Suppose that e = 1. Then G is one of the two graphs shown in Figure 8.7.8. In 
either case, the formula holds. We have verified the Basis Step. 

Suppose that the formula holds for connected, planar graphs with n edges. Let G 
be a graph with n+ 1 edges. First, suppose that G contains no cycles. Pick a vertex v and 
trace a path starting at v. Since G is cycle-free, every time we trace an edge, we arrive at a 
new vertex. Eventually, we will reach a vertex a, with degree 1, that we cannot leave (see 
Figure 8.7.9). We delete a and the edge x incident on a from the graph G. The resulting 
graph G’ has n edges; hence, by the inductive assumption, (8.7.3) holds for G’. Since G 
has one more edge than G’, one more vertex than G’, and the same number of faces as 


G’, it follows that (8.7.3) also holds for G. 


G G’ 


Figure 8.7.9 The proof of Theorem 8.7.9 for the case that G has no 
cycles. We find a vertex a of degree | and delete a and the edge x 
incident on it. 


Now suppose that G contains a cycle. Let x be an edge in a cycle (see Figure 8.7.10). 
Now x is part of a boundary for two faces. This time we delete the edge x but no vertices 
to obtain the graph G’ (see Figure 8.7.10). Again G’ has n edges; hence, 


G G' 
Figure 8.7.10 The proof of Theorem 8.7.9 for the case that G 


has a cycle. We delete edge x in a cycle. 


by the inductive assumption, (8.7.3) holds for G’. Since G has one more face than G’, 
one more edge than G’, and the same number of vertices as G’, it follows that (8.7.3) 
also holds for G. 
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Since we have verified the Inductive Step, by the Principle of Mathematical In- 


duction, the theorem is proved. < 
8.7 Review Exercises 
1. What is a planar graph? 5. What is a series reduction? 
2. What is a face? 6. Define homeomorphic graphs. 
3. State Euler’s equation for a connected, planar graph. 7. State Kuratowski’s theorem. 
4. What are series edges? 
8.7 Exercises 
In Exercises 1-3, show that each graph is planar by redrawing it 5. b 
so that no edges cross. a 
1. 
b c 
& 
a c d 
f 
e 
e d 


In Exercises 6-8, determine whether each graph is planar. If the 


2. graph is planar, redraw it so that no edges cross; otherwise, find a 
2 an é subgraph homeomorphic to either Ks or K3,3. 
NA - 6. a b 
dé X f 
f c 
3. 
e d 
i a b 
In Exercises 4 and 5, show that each graph is not planar by finding d bs 


a subgraph homeomorphic to either Ks or K3,3. 


4. , 
a b 
a h c 
d 
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9. A connected, planar graph has nine vertices having degrees 
2, 2, 2, 3, 3, 3, 4, 4, and 5. How many edges are there? How 
many faces are there? 

10. Show that adding or deleting loops, parallel edges, or edges in 
series does not affect the planarity of a graph. 


11. Show that any graph having four or fewer vertices is planar. 


12. Show that any graph having five or fewer vertices and a vertex 
of degree 2 is planar. 


13. Show that in any simple, connected, planar graph, e < 3v—6. 
14. Give an example of a simple, connected, nonplanar graph for 
which e < 3v — 6. 
15. Use Exercise 13 to show that Ks is not planar. 
x16. Show that if a simple graph G has 11 or more vertices, then 
either G or its complement G is not planar. 


*17. Prove that if a planar graph has an Euler cycle, it has an Eu- 
ler cycle with no crossings. A path P in a planar graph has a 
crossing if a vertex v appears at least twice in P and P crosses 


itself at v; that is, 
P=(...,W1,V, W2,...,W3,V,W4,--.), 


where the vertices are arranged so that w1, v, w2 crosses w3, 
v, w4 at v as in the following figure. 


Wt W4 


W3 w2 


A coloring of a graph G by the colors C,, C2, ..., Cn assigns to 
each vertex a color C; so that any vertex has a color different from 
that of any adjacent vertex. For example, the following graph is 
colored with three colors. The rest of the exercises deal with col- 
oring planar graphs. 


Cy C2 Cy 


A planar map is a planar graph where the faces are interpreted as 
countries, the edges are interpreted as borders between countries, 
and the vertices represent the intersections of borders. The prob- 
lem of coloring a planar map G, so that no countries with adjoining 


boundaries have the same color, can be reduced to the problem of 


coloring a graph by first constructing the dual graph G' of G in 
the following way. The vertices of the dual graph G' consist of one 
point in each face of G, including the unbounded face. An edge in 
G’ connects two vertices if the corresponding faces in G are sepa- 
rated by a boundary. Coloring the map G is equivalent to coloring 
the vertices of the dual graph G’. 


18. Prove that any planar graph whose boundary is a polygon, all 
of whose vertices lie on the boundary, and whose interior con- 
sists of triangles (bounded by straight-line edges) can be col- 
ored using three colors. The following figure shows such a 
graph. 


Ci C2 
C3 Ci - 
(on 3 
Cy 
19. Find the dual of the following map. 
B Cc 
A 
(ND 
oe gaa ls 
E F G 


20. Show that the dual of a planar map is a planar graph. 


21. Show that any coloring of the map of Exercise 19, excluding 
the unbounded region, requires at least three colors. 


22. Color the map of Exercise 19, excluding the unbounded re- 
gion, using three colors. 


23. Find the dual of the following map. 


“ G 
E ag 
FAI 
E 
D Cc 


24. Show that any coloring of the map of Exercise 23, excluding 
the unbounded region, requires at least four colors. 


25. Color the map of Exercise 23, excluding the unbounded re- 
gion, using four colors. 


A triangulation of a simple, planar graph G is obtained from G 
by connecting as many vertices as possible while maintaining pla- 
narity and not introducing loops or parallel edges. 


26. Find a triangulation of the following graph. 


| 


27. Show that if a triangulation G’ of a simple, planar graph G can 
be colored with n colors, so can G. 


28. Show that ina triangulation of a simple, planar graph, 3f = 2e. 


Appel and Haken proved (see [Appel]) that every simple, planar 
graph can be colored with four colors. The problem had been posed 
in the mid-1800s and for years no one had succeeded in giving a 
proof. Those working on the four-color problem in recent years had 
one advantage their predecessors did not—the use of fast electronic 
computers. The following exercises show how the proof begins. 

Suppose there is a simple, planar graph that requires more 
than four colors to color. Among all such graphs, there is one 
with the fewest number of vertices. Let G be a triangulation of 
this graph. Then G also has a minimal number of vertices and by 
Exercise 27, G requires more than four colors to color. 


29. If the dual of a map has a vertex of degree 3, what must the 
original map look like? 
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30. Show that G cannot have a vertex of degree 3. 
x31. Show that G cannot have a vertex of degree 4. 
x32. Show that G has a vertex of degree 5. 


The contribution of Appel and Haken was to show that only a finite 
number of cases involving the vertex of degree 5 needed to be con- 
sidered and to analyze all of these cases and show that all could 
be colored using four colors. The reduction to a finite number of 
cases was facilitated by using the computer to help find the cases 
to be analyzed. The computer was then used again to analyze the 
resulting cases. 


x33. Show that any simple, planar graph can be colored using five 
colors. 


8.8 Instant Insanity? 


Instant Insanity is a puzzle consisting of four cubes each of whose faces is painted one 
of four colors: red, white, blue, or green (see Figure 8.8.1). (There are different versions 
of the puzzle, depending on which faces are painted which colors.) The problem is to 
stack the cubes, one on top of the other, so that whether the cubes are viewed from front, 
back, left, or right, one sees all four colors (see Figure 8.8.2). Since 331,776 different 
stacks are possible (see Exercise 12), a solution by hand by trial and error is impractical. 
We present a solution, using a graph model, that makes it possible to discover a solution, 


if there is one, in a few minutes! 


Cube 1 


Cube 2 


Cube 3 


Cube 4 


Figure 8.8.2 A 
solution to the Instant 
Insanity puzzle of 
Figure 8.8.1. 


Figure 8.8.1 An Instant Insanity puzzle. 


First, notice that any particular stacking can be represented by two graphs, one 
representing the front/back colors and the other representing the left/right colors. For 
example, in Figure 8.8.3 we represent the stacking of Figure 8.8.2. The vertices represent 
the colors, and an edge connects two vertices if the opposite faces have those colors. For 
example, in the front/back graph, the edge labeled 1 connects R and W, since the front 
and back faces of cube | are red and white. As another example, in the left/right graph, 
W has a loop, since both the left and right faces of cube 3 are white. 

We can also construct a stacking from a pair of graphs such as those in Figure 
8.8.3, which represent a solution of the Instant Insanity puzzle. Begin with the front/back 
graph. Cube | is to have red and white opposing faces. Arbitrarily assign one of these 


colors, say red, to the front. Then cube 1 has a white back face. The other edge incident 
on W is 2, so make cube 2’s front face white. This gives cube 2 a blue back face. The 
other edge incident on B is 3, so make cube 3’s front face blue. This gives cube 3 a green 
back face. The other edge incident on G is 4. Cube 4 then gets a green front face and a 
red back face. The front and back faces are now properly aligned. At this point, the left 
and right faces are randomly arranged; however, we will show how to correctly orient 
the left and right faces without altering the colors of the front and back faces. 


iThis section can be omitted without loss of continuity. 
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Example 8.8.1 


34 
7 
Ww G W G 


(a) Front/back —(b) Left/right 


Figure 8.8.3 Graphs that represent Figure 8.8.4 Rotating a cube to 
the stacking of Figure 8.8.2. obtain a left/right orientation, without 
changing the front/back colors. 


Cube | is to have red and green opposing left and right faces. Arbitrarily assign 
one of these colors, say green, to the left. Then cube | has a red right face. Notice that by 
rotating the cube, we can obtain this left/right orientation without changing the colors of 
the front and back (see Figure 8.8.4). We can similarly orient cubes 2, 3, and 4. Notice 
that cubes 2 and 3 have the same colors on opposing sides. The stacking of Figure 8.8.2 
has been reconstructed. 

It is apparent from the preceding discussion that a solution to the Instant Insanity 
puzzle can be obtained if we can find two graphs like those of Figure 8.8.3. The properties 
needed are 


= Each vertex should have degree 2. (8.8.1) 
= Each cube should be represented by an edge exactly once in each graph. (8.8.2) 


m The two graphs should not have any edges in common. (8.8.3) 


Property (8.8.1) assures us that each color can be used twice, once on the front (or left) 
and once on the back (or right). Property (8.8.2) assures us that each cube is used exactly 
once. Property (8.8.3) assures us that, after orienting the front and back sides, we can 
successfully orient the left and right sides. 

To obtain a solution, we first draw a graph G that represents all of the faces of all 
of the cubes. The vertices of G represent the four colors, and an edge labeled i connects 
two vertices (colors) if the opposing faces of cube 7 have those colors. In Figure 8.8.5 
we have drawn the graph that represents the cubes of Figure 8.8.1. Then, by inspection, 
we find two subgraphs of G satisfying properties (8.8.1)—(8.8.3). Try your hand at the 
method by finding another solution to the puzzle represented by Figure 8.8.5. 


Figure 8.8.5 A graph representation of the 
Instant Insanity puzzle of Figure 8.8.1. 


Find a solution to the Instant Insanity puzzle of Figure 8.8.6. 


Figure 8.8.6 The Instant 
Insanity puzzle for 
Example 8.8.1. 
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SOLUTION We begin by trying to construct one subgraph having properties (8.8.1) and 
(8.8.2). We arbitrarily choose a vertex, say B, and choose two edges incident on vertex B. 
Suppose that we select the two edges shown as solid lines in Figure 8.8.7. Now consider 
the problem of picking two edges incident on vertex R. We cannot select any edges 
incident on B or G since B and G must each have degree 2. Since each cube must appear 
in each subgraph exactly once, we cannot select any of the edges labeled | or 2 since 
we already have selected edges with these labels. Edges incident on R that cannot be 
selected are shown dashed in Figure 8.8.7. This leaves only the edge labeled 4. Since we 
need two edges incident on R, our initial selection of edges incident on B must be revised. 

For our next attempt at choosing two edges incident on vertex B, let us select the 
edges labeled 2 and 3, as shown in Figure 8.8.8. Since this choice includes one edge inci- 
dent on R, we must choose one additional edge incident on R. We have three possibilities 
for selecting the additional edge (shown in color in Figure 8.8.8). (The loop incident at 
R counts as two edges and so cannot be chosen.) If we select the edge labeled 1 incident 
on R and G, we would need a loop at W labeled 4. Since there is no such loop, we do not 
select this edge. If we select the edge labeled | incident on R and W, we can then select 
the edge labeled 4 incident on W and G (see Figure 8.8.9). We have obtained one of the 
graphs. 


2 
2 = 2 
- te ei R B 
vee R B 
R mS, 
4} ‘4 a 1 
' 1 
Ww Ww G Ww G 
Figure 8.8.7 Trying to find a Figure 8.8.8 Another attempt to Figure 8.8.9 A subgraph of 
subgraph of Figure 8.8.6 find a subgraph of Figure 8.8.6 Figure 8.8.6 satisfying (8.8.1) 
satisfying (8.8.1) and (8.8.2). satisfying (8.8.1) and (8.8.2). and (8.8.2). 


Figure 8.8.10 Edges 
incident on B not used 
in Figure 8.8.9. 


We now look for a second graph having no edges in common with the graph just 
chosen. Let us again begin by picking two edges incident on B. Because we cannot 
reuse edges, our choices are limited to three edges (see Figure 8.8.10). Choosing the 
edges labeled 1 and 4 leads to the graph of Figure 8.8.11. The graphs of Figures 8.8.9 
and 8.8.11 solve the Instant Insanity puzzle of Figure 8.8.6. 


: 


3 


Figure 8.8.11 Another subgraph of Figure 8.8.6, with no edges in 
common with Figure 8.8.9, satisfying (8.8.1) and (8.8.2). This graph and 
that of Figure 8.8.9 solve the Instant Insanity puzzle of Figure 8.8.6. < 
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8.8 Review Exercises 


1. Describe the Instant Insanity puzzle. 2. Describe how to solve the Instant Insanity puzzle. 


8.8 Exercises 


Find solutions to the following Instant Insanity puzzles. 5. 


i 


2. 
6. 
3. 7. (a) Find all subgraphs of Figure 8.8.5 satisfying properties 
(8.8.1) and (8.8.2). 
(b) Find all the solutions to the Instant Insanity puzzle of 
Figure 8.8.5. 
8. (a) Represent the Instant Insanity puzzle by a graph. 
1 2 3 4 
4. 


(b) Find a solution to the puzzle. 

(c) Find all subgraphs of your graph of part (a) satisfying 
properties (8.8.1) and (8.8.2). 

(d) Use (c) to show that the puzzle has a unique solution. 

9. Show that the following Instant Insanity puzzle has no solu- 
tion by giving an argument to show that no subgraph satisfies 
properties (8.8.1) and (8.8.2). Notice that there is no solution 
even though each cube contains all four colors. 


x10. 


11. 
12. 


+13. 


Give an example of an Instant Insanity puzzle satisfying: 
(a) There is no solution. 
(b) Each cube contains all four colors. 


(c) There is a subgraph satisfying properties (8.8.1) and 
(8.8.2). 


Show that there are 24 orientations of a cube. 


Number the cubes of an Instant Insanity puzzle 1, 2, 3, and 
4. Show that the number of stackings in which the cubes 
are stacked 1, 2, 3, and 4, reading from bottom to top, is 
331,776. 


How many Instant Insanity graphs are there; that is, how many 
graphs are there with four vertices and 12 edges—three of each 
of four types? 


Exercises 14-21 refer to a modified version of Instant Insanity 
where a solution is defined to be a stacking that, when viewed from 
the front, back, left, or right, shows one color. (The front, back, left, 
and right are of different colors.) 


14. 


Give an argument that shows that if we graph the puzzle as in 
regular Instant Insanity, a solution to modified Instant Insan- 
ity consists of two subgraphs of the form shown in the figure, 
with no edges or vertices in common. 


1 


is 


4 
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Find solutions to modified Instant Insanity for the following 
puzzles. 


15. 


16. 


17. 


18. 
19. 


Graph of Exercise 6 


Show that for Figure 8.8.5, Instant Insanity, as given in the 
text, has a solution, but the modified version does not have a 
solution. 


. Show that if modified Instant Insanity has a solution, the ver- 


sion given in the text must also have a solution. 


. Is it possible for neither version of Instant Insanity to have a 


solution even if each cube contains all four colors? If the an- 
swer is yes, prove it; otherwise, give a counterexample. 


Chapter 8 Notes 


Virtually any reference on discrete mathematics contains one or more chapters on graph 
theory. Books specifically on graph theory are [Berge; Bondy; Chartrand; Deo; Even, 1979; 
Gibbons; Harary; Konig; Ore; West; and Wilson]. [Deo; Even, 1979; and Gibbons] empha- 
size graph algorithms. [Brassard, Cormen, and Johnsonbaugh] also treat graphs and graph 


algorithms. 


[Akl; Leighton; Lester; Lewis; Miller; and Quinn] discuss parallel computers and al- 
gorithms for parallel computers. Our results on subgraphs of the hypercube are from [Saad]. 
Euler’s original paper on the K6nigsberg bridges, edited by J. R. Newman, was reprinted 


as [Newman]. 


In [Gardner, 1959], Hamiltonian cycles are related to the Tower of Hanoi puzzle. 
Concerning the traveling salesperson problem, [Applegate] is accurately self-described 
as “the definitive book on the subject.” 
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In many cases, so called branch-and-bound methods (see, e.g., [Tucker]) often give 
solutions to the traveling salesperson problem more efficiently than will exhaustive search. 

Dijkstra’s shortest-path algorithm appears in [Dijkstra, 1959]. 

The complexity of the graph isomorphism problem is discussed in [Kobler]. 

Appel and Haken published their solution to the four-color problem in [Appel]. 


Chapter 8 Review 


Section 8.1 


. Graph G = (V, E) (undirected and directed) 
. Vertex 


Edge 
Edge e is incident on vertex v 


. Vertex v is incident on edge e 


v and w are adjacent vertices 


. Parallel edges 


Loop 


. Isolated vertex 
. Simple graph 

. Weighted graph 
12. 
. Length of a path in a weighted graph 

. Similarity graph 

. Dissimilarity function 

. n-cube (hypercube) 

. Serial computer 

. Serial algorithm 

. Parallel computer 

. Parallel algorithm 

. Complete graph on n vertices, K;, 

. Bipartite graph 

. Complete bipartite graph on m and n vertices, Kin 


Weight of an edge 


Section 8.2 


24. 
25. 
26. 
27. 
28. 
29. 
30. 
31. 
32. 
33. 
34. 


35. 


36. 
37. 


Path 

Simple path 

Cycle 

Simple cycle 

Connected graph 

Subgraph 

Component of a graph 

Degree of a vertex 5(v) 

KOonigsberg bridge problem 

Euler cycle 

A graph G has an Euler cycle if and only if G is connected 
and every vertex has even degree. 

The sum of the degrees of all vertices in a graph is equal to 
twice the number of edges. 

In any graph, the number of vertices of odd degree is even. 
A graph has a path with no repeated edges from v to w 
(v € w) containing all the edges and vertices if and only 
if it is connected and v and w are the only vertices having 
odd degree. 


38. 


If a graph G contains a cycle from v to v, G contains a simple 
cycle from v to v. 


Section 8.3 


39. 
40. 
41. 
42. 


Hamiltonian cycle 

Traveling salesperson problem 

Ring model for parallel computation 
Gray code 


Section 8.4 


43. 


Dijkstra’s shortest-path algorithm 


Section 8.5 


44. 
45. 


46. 


Adjacency matrix 

If A is the adjacency matrix of a simple graph, the ij th entry 
of A” is equal to the number of paths of length n from vertex 
i to vertex j. 

Incidence matrix 


Section 8.6 


47. 


48. 


49. 
50. 


Graphs G; = (V,, £)) and Gy = (V2, Ey) are isomorphic 
if there are one-to-one, onto functions f: V; — V2 and 
g: E, — E) such that e € E; is incident on v, w € V if 
and only if g(e) is incident on f(v) and f(w). 

Graphs G and G2 are isomorphic if and only if, for some or- 
derings of their vertices, their adjacency matrices are equal. 
Mesh model for parallel computation 

Invariant 


Section 8.7 


51. 
52. 
53. 
54. 
55. 
56. 
57. 


Planar graph 

Face 

Euler’s equation for connected, planar graphs: f = e—v+2 
Edges in series 

Series reduction 

Homeomorphic graphs 

Kuratowski’s theorem: A graph is planar if and only if it 
does not contain a subgraph homeomorphic to Ks or K3,3. 


Section 8.8 


58. 
59. 


Instant Insanity 
How to solve an Instant Insanity puzzle 
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Chapter 8 Self-Test 


1. For the graph G = (V, E), find V, E, all parallel edges, all 
loops, and all isolated vertices, and state whether G is a sim- 
ple graph. Also state on which vertices edge e3 is incident 
and on which edges vertex v2 is incident. 


U3 
UL. v2 


2. Tell whether the path (v2, v3, v4, v2, V6, V1, V2) In the graph 
is a simple path, a cycle, a simple cycle, or none of these. 


é@ Vv é 
Vv} 1 2 2 v3 
es\ €4 
e7 v7 e3 
eu [€8 
U6 U4 


. Find a Hamiltonian cycle in the graph of Exercise 2. 
. Write the adjacency matrix of the graph of Exercise 2. 


. Write the incidence matrix of the graph of Exercise 2. 


nn & WwW 


. IfA is the adjacency matrix of the graph of Exercise 2, what 
does the entry in row v2 and column v3 of A? represent? 


In Exercises 7 and 8, determine whether the graphs G, and G2 
are isomorphic. Prove your answer. 


Ts 
al v2 Wi w2 
. i‘ en 
‘i G2 
Ww Ww 
G, 4 5 
8. 
z 2 Wh 
=. : 
v5 5 wo 


G1 Go 


9. Explain why the following graph does not have a path from 
a to a that passes through each edge exactly one time. 


10. Draw Ks, the complete bipartite graph on 2 and 5 vertices. 
11. Draw all subgraphs containing exactly two edges of the fol- 
lowing graph. 


Uy 


12. Find a connected subgraph of the graph of Exercise 2 con- 
taining all of the vertices of the original graph and having 
as few edges as possible. 


13. Find a Hamiltonian cycle in the 3-cube. 


14. Show that the graph has no Hamiltonian cycle. 


b 
a c 


f 


15. Cana column of an incidence matrix consist only of zeros? 
Explain. 


16. Prove that the n-cube is bipartite for all n > 1. 


17. Does the graph of Exercise 2 contain an Euler cycle? 
Explain. 


18. Show that the cycle (a, b, c,d, e,f, g, h, i,j, a) provides a 
solution to the traveling salesperson problem for the graph 
shown. 


a 5 b 4 ¢ 6 4d 
Is. ~3 
glk 2 g 


19. Draw all nonisomorphic, simple graphs having exactly five 
vertices and two edges. 

20. Draw all nonisomorphic, simple graphs having exactly five 
vertices, two components, and no cycles. 
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Exercises 21-24 refer to the following graph. 


21. 
22. 
23. 
24. 


Find the length of a shortest path from a to i. 
Find the length of a shortest path from a to z. 
Find a shortest path from a to z. 


Find the length of a shortest path from a to z that passes 
through c. 


In Exercises 25 and 26, determine whether the graph is planar. If 
the graph is planar, redraw it so that no edges cross; otherwise, 
find a subgraph homeomorphic to either Ks or K3,3. 


25. 
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26. 


27. 


28. 


29. 


30. 
31. 


32. 


Show that any simple, connected graph with 31 edges and 
12 vertices is not planar. 


Show that the n-cube is planar if n < 3 and not planar if 
n> 3. 


Represent the Instant Insanity puzzle by a graph. 


Find a solution to the puzzle of Exercise 29. 


Find all subgraphs of the graph of Exercise 29 satisfying 
properties (8.8.1) and (8.8.2). 


Use Exercise 31 to determine how many solutions the puz- 
zle of Exercise 29 has. 


1. 


Write a program that accepts as input any of the following: 
© a listing of edges of a graph given as pairs of positive 
integers 
@ the adjacency matrix 
@ the incidence matrix 


and outputs the other two. 


. Write a program that determines whether a graph contains 


an Euler cycle. 


. Write a program that finds an Euler cycle in a connected 


graph in which all vertices have even degree. 


. Write a program that randomly generates ann xn adjacency 


matrix. Have your program print the adjacency matrix, the 
number of edges, the number of loops, and the degree of 
each vertex. 


. Write a program that determines whether a graph is a bipar- 


tite graph. If it is a bipartite graph, the program should list 
the disjoint sets of vertices. 


6. 


13. 


14. 


Write a program that accepts as input the edges of a graph 
and then draws the graph using a computer graphics display. 


. Write a program that lists all simple paths between two 


given vertices. 


. Write a program that determines whether a path is a simple 


path, a cycle, or a simple cycle. 


. Write a program that checks whether a proposed cycle is a 


Hamiltonian cycle. 


. Write a program that checks whether a proposed path is a 


Hamiltonian path. 


. Write a program that constructs a Gray code. 


. Implement Dijkstra’s shortest-path algorithm as a pro- 


gram. The program should find a shortest path and its 
length. 


Write a program that tests whether a proposed isomorphism 
is an isomorphism. 


Write a program to determine whether a graph is planar. 


15. Write a program that solves an arbitrary Instant Insanity 
puzzle. 


16. Implement randomized_hamiltonian_cycle (Algorithm 
8.3.10) as a computer program. Add a condition to stop 
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the algorithm after some number of iterations to guarantee 
that it will always terminate. Try your program out on sev- 
eral sample graphs. (There is a random graph generator at 
the website that accompanies this book.) 


9.1 Introduction 

9.2 Terminology and 
Characterizations 
of Trees 

9.3 Spanning Trees 

9.4 Minimal Spanning Trees 

9.5 Binary Trees 

9.6 Tree Traversals 

9.7 Decision Trees and the 
Minimum Time for 
Sorting 

9.8 l|somorphisms of Trees 

19.9 Game Trees 

9.1 
Go Online 


For more on trees, see 
goo.gl/oc4uly 


438 


Trees form one of the most widely used subclasses of graphs. Computer science, in par- 
ticular, makes extensive use of trees. In computer science, trees are useful in organizing 
and relating data in a database (see Example 9.1.7). Trees also arise in theoretical prob- 
lems such as the optimal time for sorting (see Section 9.7). 

In this chapter we begin by giving the requisite terminology. We look at subclasses 
of trees (e.g., rooted trees and binary trees) and many applications of trees (e.g., spanning 
trees, decision trees, and game trees). Our discussion of tree isomorphisms extends the 
discussion of Section 8.6 on graph isomorphisms. 


Introduction 


Figure 9.1.1 shows the results of the semifinals and finals of a classic tennis competition 
at Wimbledon, which featured four of the greatest players in the history of tennis. At 
Wimbledon, when a player loses, she is out of the tournament. Winners continue to play 
until only one person, the champion, remains. (Such a competition is called a single- 
elimination tournament.) Figure 9.1.1 shows that in the semifinals Monica Seles defeated 
Martina Navratilova and Steffi Graf defeated Gabriela Sabatini. The winners, Seles and 
Graf, then played, and Graf defeated Seles. Steffi Graf, being the sole undefeated player, 
became Wimbledon champion. 

If we regard the single-elimination tournament of Figure 9.1.1 as a graph (see 
Figure 9.1.2), we obtain a tree. If we rotate Figure 9.1.2, it looks like a natural tree (see 
Figure 9.1.3). The formal definition follows. 


Definition 9.1.1 > A (free) tree T is a simple graph satisfying the following: If 
v and w are vertices in 7, there is a unique simple path from v to w. 
A rooted tree is a tree in which a particular vertex is designated the root. < 


iThis section can be omitted without loss of continuity. 


Example 9.1.2 


v1 


v3 


U4 v7 


Figure 9.1.4 
The tree 
Figure 9.1.3(a) 
with the root at 
the top. 


Example 9.1.3 
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Seles 
Seles 
Navratilova 
Graf ] 
Sabatini 
WIMBLEDON 
Graf CHAMPION 
Graf 
FINALS 
SEMIFINALS 


U7 U6 US v4 eo 

U4 ant ® ce 

v2 Orgy: 
we U3 v2 

Uy 
U6 
Vy WM 

U3 

“7 (a) (b) 


Figure 9.1.2 The tournament 
of Figure 9.1.1 as a tree. 


Figure 9.1.3 The tree of Figure 9.1.2 rotated (a) 
compared with a natural tree (b). 


If we designate the winner as the root, the single-elimination tournament of Figure 9.1.1 
(or Figure 9.1.2) is a rooted tree. Notice that if v and w are vertices in this graph, there 
is a unique simple path from v to w. For example, the unique simple path from v2 to v7 
is (V2, Vj, V3, V7). < 


In contrast to natural trees, which have their roots at the bottom, in graph theory 
rooted trees are typically drawn with their roots at the top. Figure 9.1.4 shows the way 
the tree of Figure 9.1.2 would be drawn (with v; as root). First, we place the root v; at 
the top. Under the root and on the same level, we place the vertices, v2 and v3, that can 
be reached from the root on a simple path of length 1. Under each of these vertices and 
on the same level, we place the vertices, v4, vs, V6, and v7, that can be reached from the 
root on a simple path of length 2. We continue in this way until the entire tree is drawn. 
Since the simple path from the root to any given vertex is unique, each vertex is on a 
uniquely determined level. We call the level of the root level 0. The vertices under the 
root are said to be on level 1, and so on. Thus the level of a vertex v is the length of the 
simple path from the root to v. The height of a rooted tree is the maximum level number 
that occurs. 


The vertices v1, v2, V3, V4, Vs, Vo, v7 in the rooted tree of Figure 9.1.4 are on (respectively) 
levels 0, 1, 1, 2, 2, 2, 2. The height of the tree is 2. < 


440. Chapter9 @ Trees 


Example 9.1.4 _ If we designate e as the root in the tree T of Figure 9.1.5, we obtain the rooted tree 7’ 
———a—=_ shown in Figure 9.1.5. The vertices a, b, c, d, e, f, g, h, i, j are on (respectively) levels 2, 
1,2, 1,0, 1, 1, 2, 2, 3. The height of 7” is 3. 


T 


Figure 9.1.5 A tree T and a rooted tree 7’. T’ is obtained from T by designating 
eas the root. 


Example 9.1.5 A rooted tree is often used to specify hierarchical relationships. When a tree is used 
—e—= in this way, if vertex a is on a level one less than the level of vertex b and a and b 
are adjacent, then a is “just above” b and a logical relationship exists between a and 
b: a dominates b or b is subordinate to a in some way. An example of such a tree, 
which is the administrative organizational chart of a hypothetical university, is given in 


Figure 9.1.6. 
President | 
Vice-President Vice-President 
for Academic for Administrative 
Affairs Affairs 
| 
Dean of Dean of Director of Director of 
Arts/ Business Academic Purchasing 
Sciences Planning 
Chair oe in Chair |... 
of C 9 t of 
Mathematics Sera Accounting 
Science 
Figure 9.1.6 An administrative organizational chart. < 


Example 9.1.6 Computer File Systems Modern computer operating systems organize folders and files 
=~ using a tree structure. A folder contains other folders and files. Figure 9.1.7 shows the 
Windows Explorer display of folders on the left and files on the right on a particular 
computer. Figure 9.1.8 shows the same structure as a rooted tree. The root is called 
Desktop. Under Desktop are My Documents, My Computer, and others. Under My Doc- 
uments are Fax, My Data Sources, My Pictures, and others. Under My Pictures are 
archived, basement water, and My eBooks. Under basement water, which is highlighted, 
are the files 11-18-03, DSCO1007 11-18-03, and others, which appear on the right of 
Figure 9.1.7. 


Example 9.1.7 
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Figure 9.1.7 Windows Explorer display of folders and files on a 
particular computer. The folders appear on the left and files appear on the 
right. Under basement water, which is highlighted, are the files //-/8-03, 
DSC01007 11-18-03, and others, which appear on the right. 


Desktop 


My Documents My Computer My Network Places 


Fax My Data Sources My Pictures 
archived basement water My eBooks 


11-18-03 = DSC01007 11-18-03 DSC01008 11-18-03 


Figure 9.1.8 The structure of Figure 9.1.7 shown as a rooted tree. < 


Hierarchical Definition Trees Figure 9.1.9 is an example of a hierarchical definition 
tree. Such trees are used to show logical relationships among records in a database. 
[Recall (see Section 3.6) that a database is a collection of records that are manipulated by 
a computer.] The tree of Figure 9.1.9 might be used as a model for setting up a database 
to maintain records about books housed in several libraries. 


Book | 


- 
Author ] Location ] | Publisher | 


Availability 


i 


Figure 9.1.9 A hierarchical definition tree. | 
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Example 9.1.8 Huffman Codes The most common way to represent characters internally in a com- 
_—_—__ puter is by using fixed-length bit strings. For example, ASCH (American Standard Code 


Go Online for Information Interchange) represents each character by a string of seven bits. Exam- 

For more on Huffman ples are given in Table 9.1.1. 

codes, see Huffman codes, which represent characters by variable-length bit strings, provide 

goo.gl/oc4uly alternatives to ASCII and other fixed-length codes. The idea is to use short bit strings to 
represent the most frequently used characters and to use longer bit strings to represent 

TABLE 9.1.1 @ A Portion less frequently used characters. In this way it is generally possible to represent strings of 

of the ASCII Table characters, such as text and programs, in less space than if ASCII were used. For exam- 


ple, Huffman codes are used with other techniques to compress data for fax machines. 


Character ASCH Code A Huffman code is most easily defined by a rooted tree (see Figure 9.1.10). To 


A 100 0001 decode a bit string, we begin at the root and move down the tree until a character is 
B 100 0010 encountered. The bit, 0 or 1, tells us whether to move right or left. As an example, let us 
Cc 100 0011 decode the string 

1 011 0001 

2 011 0010 01010111. (9.1.1) 
! 010 0001 . : Bs Ses a3 

fe We begin at the root. Since the first bit is 0, the first move is right. Next, we move left 


010 1010 ? . ‘ 
and then right. At this point, we encounter the first character R. To decode the next 


character, we begin again at the root. The next bit is 1, so we move left and encounter 
the next character A. The last bits 0111 decode as T. Therefore, the bit string (9.1.1) 
represents the word RAT. 

Given a tree that defines a Huffman code, such as Figure 9.1.10, any bit string 
[e.g., (9.1.1)] can be uniquely decoded even though the characters are represented by 
variable-length bit strings. For the Huffman code defined by the tree of Figure 9.1.10, 
the character A is represented by a bit string of length 1, whereas S and T are repre- 
sented by bit strings of length 4. (A is represented as 1, S is represented as 0110, and T 
is represented as 0111.) < 


Huffman gave an algorithm (Algorithm 9.1.9) to construct a Huffman code from 
a table giving the frequency of occurrence of the characters to be represented so that 
the code constructed represents strings of characters in minimal space, provided that 


Figure 9.1.10 A the strings to be represented have character frequencies identical to the character fre- 
Huffman code. quencies in the table. A proof that the code constructed is optimal may be found in 
{Johnsonbaugh]. 


Algorithm 9.1.9 | Constructing an Optimal Huffman Code 


This algorithm constructs an optimal Huffman code from a table giving the frequency 
of occurrence of the characters to be represented. The output is a rooted tree with the 
vertices at the lowest levels labeled with the frequencies and with the edges labeled 
with bits as in Figure 9.1.10. The coding tree is obtained by replacing each frequency 
with a character having that frequency. 


Input: A sequence of n frequencies, n > 2 


Output: A rooted tree that defines an optimal Huffman code 


Lad huffman(f,n) { 
if (n == 2) { 


fi ha let f, and fo denote the frequencies 
Figure 9.1.11 let T be as in Figure 9.1.11 
The case n = 2 for return T 


Algorithm 9.1.9. } 


fi fj 
Figure 9.1.12 


The case n > 2 for 
Algorithm 9.1.9. 


Example 9.1.10 
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let f; and f; denote the smallest frequencies 

replace f; and fj in the list f by f; + fj 

T’ = huffman(f,n — 1) 

replace a vertex in T’ labeled f; + f; by the tree shown in Figure 9.1.12 
to obtain the tree T 

return T 


Show how Algorithm 9.1.9 constructs an optimal Huffman code using Table 9.1.2. 


TABLE 9.1.2 @ Input for Example 9.1.10 


Character Frequency 
! 2 
@ 3 
# T 
$ 8 
% 12 


SOLUTION The algorithm begins by repeatedly replacing the smallest two frequencies 
with the sum until a two-element sequence is obtained: 


2,3, 7,8, 12 > 243,7, 8, 12 
5,7, 8,12 + 5+7,8, 12 
8, 12,12 > 8+ 12, 12 

12, 20 


The algorithm then constructs trees working backward beginning with the two-element 
sequence 12, 20 as shown in Figure 9.1.13. For example, the second tree is obtained 
from the first by replacing the vertex labeled 20 by the tree of Figure 9.1.14 since 20 
arose as the sum of 8 and 12. Finally, to obtain the optimal Huffman coding tree, we 
replace each frequency by a character having that frequency (see Figure 9.1.15). 


Figure 9.1.13 Constructing an optimal Huffman code. 


Notice that the Huffman tree for Table 9.1.2 is not unique. When 12 is replaced 
by 5, 7, because there are two vertices labeled 12, there is a choice. In Figure 9.1.13, we 
arbitrarily chose one of the vertices labeled 12. If we choose the other vertex labeled 12, 
we will obtain the tree of Figure 9.1.16. Either of the Huffman trees gives an optimal 
code; that is, either will encode text having the frequencies of Table 9.1.2 in exactly the 
same (optimal) space. 
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1 0 
8 12 
Figure 9.1.14 Figure 9.1.15 The final Figure 9.1.16 
The tree that replaces tree of Figure 9.1.13 with Another optimal 
the vertex labeled 20 each frequency replaced Huffman tree for 
in Figure 9.1.13. by a character having that Example 9.1.10. 
frequency. < 
9.1 Review Exercises 
1. Define free tree. 6. Explain how files and folders in a computer system are orga- 
ends wibied wee. nized into a rooted tree structure. 
i ? 
3. What is the level of a vertex in a rooted tree? ee ene 
8. Explain how t truct timal Huff , 
4. What is the height of a rooted tree? mg ein Hye te construe anh pets al akon cede 
5. Give an example of a hierarchical definition tree. 


9.1 Exercises 


Which of the graphs in Exercises 1-4 are trees? Explain. 9. Find the height of the tree of Exercise 8. 
10. Draw the tree T of Figure 9.1.5 as a rooted tree with a as root. 


1. 2. What is the height of the resulting tree? 
11. Draw the tree T of Figure 9.1.5 as a rooted tree with b as root. 
What is the height of the resulting tree? 
12. Give an example similar to Example 9.1.5 of a tree that is used 
to specify hierarchical relationships. 
3: ae bi < 13. Give an example different from Example 9.1.7 of a hierarchical 


definition tree. 


Decode each bit string using the Huffman code given. 


5. For which values of n is the complete graph on n vertices a tree? 


6. For which values of m and n is the complete bipartite graph on 
m and n vertices a tree? 


7. For which values of n is the n-cube a tree? 


8. Find the level of each vertex in the tree shown. 


14. 011000010 15. 01111001001110 
16. 01110100110 17. 1110011101001111 
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Encode each word using the preceding Huffman code. 


18. 
20. 
22. 


23. 


24. 


25. 


26. 


27. 


28. 


DEN 19. LEADEN 

NEED 21. PENNED 

What factors in addition to the amount of memory used should 
be considered when choosing a code, such as ASCII or a 
Huffman code, to represent characters in a computer? 


What techniques in addition to the use of Huffman codes 
might be used to save memory when storing text? 


Construct an optimal Huffman code for the set of letters in the 
table. 


Letter Frequency Letter Frequency 
a P 6 11 
B 6 € 20 
Y 6 


Construct an optimal Huffman code for the set of letters in the 
table. 


Letter Frequency Letter Frequency 
I 75 Cc 5.0 
U 20.0 H 10.0 
B 2.5 M 25 
S 27.5 P 25.0 


Use the code developed in Exercise 25 to encode the follow- 
ing words (which have frequencies consistent with the table 
of Exercise 25): 

BUS, CUPS, MUSH, PUSS, SIP, PUSH, 

CUSS, HIP, PUP, PUPS, HIPS. 
Construct two optimal Huffman coding trees for the table of 
Exercise 24 of different heights. 


Construct an optimal Huffman code for the set of letters in the 
table. 


Letter Frequency Letter Frequency 
a 2, d 8 
b 3 e 13 
c 5 f 21 


29. 


30. 
31. 


32. 
33. 
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Professor Ter A. Byte needs to store text made up of the 
characters A, B, C, D, E, which occur with the following 
frequencies: 


Character Frequency Character Frequency 


A 6 D ) 
B 2 E 8 
Cc 3 


Professor Byte suggests using the variable-length codes 


Character Code 


moOaAwPS 
= 


which, he argues, store the text in less space than that used by 
an optimal Huffman code. Is the professor correct? Explain. 


Show that a tree is a planar graph. 


Show that any tree with two or more vertices has a vertex of 
degree 1. 


Show that a tree is a bipartite graph. 


Show that the vertices of a tree can be colored with two colors 
so that each edge is incident on vertices of different colors. 


The eccentricity of a vertex v in a tree T is the maximum length of 
a simple path that begins at v. 


34. 


Find the eccentricity of each vertex in the tree of Figure 9.1.5. 


A vertex v in a tree T is a center for T if the eccentricity of v is 
minimal. 


35. 
x36. 
«37. 

38. 


39. 


Find the center(s) of the tree of Figure 9.1.5. 

Show that a tree has either one or two centers. 

Show that if a tree has two centers they are adjacent. 

Define the radius r of a tree using the concepts of eccentricity 
and center. The diameter d of any graph was defined before 


Exercise 71, Section 8.2. Is it always true, according to your 
definition of radius, that 2r = d? Explain. 


Give an example of a tree T that does not satisfy the follow- 
ing property: If v and w are vertices in 7, there is a unique path 
from v to w. 


9.2 Terminology and Characterizations of Trees 


A portion of the family tree of the ancient Greek gods is shown in Figure 9.2.1. (Not all 
children are listed.) As shown, we can regard a family tree as a rooted tree. The vertices 
adjacent to a vertex v and on the next-lower level are the children of v. For example, 
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Kronos’s children are Zeus, Poseidon, Hades, and Ares. The terminology adapted from 
a family tree is used routinely for any rooted tree. The formal definitions follow. 


Uranus 


Aphrodite Kronos Atlas Prometheus 


Eros Zeus Poseidon Hades Ares Es 


Apollo Athena Hermes Heracles 


Figure 9.2.1 A portion of the family tree of ancient Greek gods. 


Definition 9.2.1 >» Let T be a tree with root vo. Suppose that x, y, and z are 
vertices in T and that (vo, vj, ..., ¥,) is a simple path in 7: Then 


(a) V__1 is the parent of vy. 

(b) vo, ..., Vp—1 are ancestors of vy. 

(c) Vv, is a child of vy_1. 

(d) If x is an ancestor of y, y is a descendant of x. 

(e) If x and y are children of z, x and y are siblings. 

(f) Ifx has no children, x is a terminal vertex (or a leaf). 

(g) Ifx is not a terminal vertex, x is an internal (or branch) vertex. 


(h) The subtree of Trooted at x is the graph with vertex set V and edge set E, where 
V is x together with the descendants of x and 


E = {e | e is an edge on a simple path from x to some vertex in V}. 4 


Example 9.2.2 _ In the rooted tree of Figure 9.2.1, 


(a) The parent of Eros is Aphrodite. 
(b) The ancestors of Hermes are Zeus, Kronos, and Uranus. 
(c) The children of Zeus are Apollo, Athena, Hermes, and Heracles. 


(d) The descendants of Kronos are Zeus, Poseidon, Hades, Ares, Apollo, Athena, 
Hermes, and Heracles. 


(e) Aphrodite and Prometheus are siblings. 


(f) The terminal vertices are Eros, Apollo, Athena, Hermes, Heracles, Poseidon, 
Hades, Ares, Atlas, and Prometheus. 


(g) The internal vertices are Uranus, Aphrodite, Kronos, and Zeus. 


(h) The subtree rooted at Kronos is shown in Figure 9.2.2. 


v1 
VO=Un 


v2 
tis Up d 
Y 
: Vi+1 
Un-1 + - 
ee 


Yel Uj 


Figure 9.2.3 A simple cycle. 


Theorem 9.2.3 


Go Online 

For more on equivalent 
definitions of trees, see 
goo.gl/oc4uly 


i er 
ie w ~P 


Figure 9.2.4 The proof of 
Theorem 9.2.3 [if (b), then (c)]. 
P is a simple path. v and the edge 
incident on v are removed so that 
the inductive hypothesis can be 
invoked. 


9.2 @ Terminology and Characterizations of Trees 447 


Kronos 


Zeus Poseidon Hades Ares 


Apollo Athena Hermes Heracles 


Figure 9.2.2 The subtree rooted at Kronos of the 
tree of Figure 9.2.1. < 


The remainder of this section is devoted to providing alternative characterizations 
of trees. Let T be a tree. We note that T is connected since there is a simple path from 
any vertex to any other vertex. Further, we can show that 7 does not contain a cycle. 
To see this, suppose that T contains a cycle C’. By Theorem 8.2.24, T contains a simple 
cycle (see Figure 9.2.3) 


C= (vo, a) Vn)s 


Vo = Vy. Since T is a simple graph, C cannot be a loop; so C contains at least two distinct 
vertices v; and v,;, i < j. Now 


(Vis Vins «+ Vp) (Vis Vi-ts «+ +s VOs Vn-1s e+ Yj) 


are distinct simple paths from v; to v;, which contradicts the definition of a tree. There- 
fore, a tree cannot contain a cycle. 

A graph with no cycles is called an acyclic graph. We just showed that a tree is a 
connected, acyclic graph. The converse is also true; every connected, acyclic graph is a 
tree. The next theorem gives this characterization of trees as well as others. 


Let T be a graph with n vertices. The following are equivalent. 


(a) T is a tree. 

(b) T is connected and acyclic. 

(c) T is connected and has n — 1 edges. 
(d) T is acyclic and has n — 1 edges. 


Proof To show that (a)-(d) are equivalent, we will prove four results: If (a), 
then (b); if (b), then (c); if (c), then (d); and if (d), then (a). 

[If (a), then (b).] The proof of this result was given before the statement of the 
theorem. 

[If (b), then (c).] Suppose that T is connected and acyclic. We will prove that T 
has n — 1| edges by induction on n. 

If n= 1, T consists of one vertex and zero edges, so the result is true ifn = 1. 

Now suppose that the result holds for a connected, acyclic graph with n vertices. 
Let T be aconnected, acyclic graph with n+ | vertices. Choose a path P with no repeated 
edges of maximum length. Since T is acyclic, P contains no cycles. Therefore, P contains 
a vertex v of degree 1 (see Figure 9.2.4). Let T* be T with v and the edge incident on 
v removed. Then 7* is connected and acyclic, and because 7* contains n vertices, by 
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the inductive hypothesis 7* contains n — 1 edges. Therefore, T contains n edges. The 
inductive argument is complete and this portion of the proof is complete. 

[If (c), then (d).] Suppose that 7 is connected and has n — | edges. We must show 
that T is acyclic. 

Suppose that T contains at least one cycle. Since removing an edge from a cycle 
does not disconnect a graph, we may remove edges, but no vertices, from cycle(s) in T 
until the resulting graph 7* is connected and acyclic. Now 7T* is an acyclic, connected 
graph with n vertices. We may use our just proven result, (b) implies (c), to conclude 
that T* has n — 1 edges. But now T has more than n — | edges. This is a contradiction. 
Therefore, T is acyclic. This portion of the proof is complete. 

[If (d), then (a).] Suppose that 7 is acyclic and has n — 1 edges. We must show that 
T is a tree, that is, that T is a simple graph and that 7 has a unique simple path from any 
vertex to any other vertex. 

The graph 7 cannot contain any loops because loops are cycles and T is acyclic. 
Similarly, T cannot contain distinct edges e; and e2 incident on v and w because we 
would then have the cycle (v, e1, w, é2, v). Therefore, T is a simple graph. 

Suppose, by way of contradiction, that T is not connected (see Figure 9.2.5). Let 
T,, Tn, ..., Ty be the components of T: Since T is not connected, k > 1. Suppose that 7; 
has n; vertices. Each T; is connected and acyclic, so we may use our previously proven 
result, (b) implies (c), to conclude that 7; has n; — 1 edges. Now 


n-1l=(m—-1)+(m-1)+---+(m%-—1) (counting edges) 
< (ny tng +--- +n) -1 (since k > 1) 


=n-l, (counting vertices) 


which is impossible. Therefore, T is connected. 


Nn vertices Np Vertices Ng vertices 
n,— 1 edges nz — 1 edges ng — 1 edges 


n1+n2+-+++Nnx total vertices 
(n1 — 1)+(n2 — 1)+-+-+(nx — 1) total edges 


Figure 9.2.5 The proof of Theorem 9.2.3 [if (d), then (a)]. 
The 7; are components of T: T; has n; vertices and n; — | edges. 
A contradiction results from the fact that the total number of 
edges must equal n — 1. 


Suppose that there are distinct simple paths P; and P> from a to b in T (see Fig- 
ure 9.2.6). Let c be the first vertex after a on P, that is not in P>; let d be the ver- 
tex preceding c on P;; and let e be the first vertex after d on P, that is also on Po. 
Let (Vo, V1, ---  Vn—1, Vn) be the portion of P; from d = vo to e = vy. Let (Wo, W1,..-; 
Wm-—1, Wm) be the portion of P2 from d = wo to e = Wy». Now 


(vo, sees Vn = Wns Wm-1s ++ +s Ws Wo) (9.2.1) 


is acycle in T; which is a contradiction. [In fact, (9.2.1) is a simple cycle since no vertices 
are repeated except for vo and wo.] Thus there is a unique simple path from any vertex 
to any other vertex in 7: Therefore, T is a tree. This completes the proof. 
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Po 


Figure 9.2.6 The proof of Theorem 9.2.3 [if (d), then (a)]. P; 
(shown dashed) and P, (shown in color) are distinct simple paths 
from a to b. c is the first vertex after a on P| not in P3. d is the vertex 
preceding c on Pj. e is the first vertex after d on P, that is also on 
P». As shown, a cycle results, which gives a contradiction. 


9.2 Problem-Solving Tips 


This section introduces some useful terminology and provides several different charac- 
terizations of trees. If T is a graph with n vertices, the following are equivalent (Theo- 
rem 9.2.3): 


A 


(a) T is a tree. 

(b) If v and w are vertices in T, there is a unique simple path from v to w (definition 
of tree). 

(c) T is connected and acyclic. 

(d) T is connected and has n — | edges. 

(e) T is acyclic and has n — 1 edges. 


You can use the preceding characterizations in various ways. For example, a graph with 
four edges and six vertices cannot be a tree because it violates parts (d) and (e). Further- 
more, the graph is either not connected or contains a cycle. (If it is both connected and 
acyclic, it would be a tree and thus have five edges.) A connected graph with n vertices 
and more than n— 1 edges must contain a cycle. (If it were acyclic, it would be a tree and, 
therefore, have n — | edges.) 


9.2 Review Exercises 


1. Define parent in a rooted tree. 5. Define internal vertex in a rooted tree. 

2. Define descendant in a rooted tree. 6. Define acyclic graph. 

3. Define sibling in a rooted tree. 7. Give alternative characterizations of trees. 
4. Define terminal vertex in a rooted tree. 


Answer the questions in Exercises 1-6 for the tree in Figure 9.2.1. 4. Find the descendants of Zeus. 
1. Find the parent of Poseidon. 5. Find the siblings of Ares. 
2. Find the children of Uranus. 6. Draw the subtree rooted at Aphrodite. 


3. Find the ancestors of Eros. 
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Answer the questions in Exercises 7-15 for the following tree. 


7. Find the parents of c and of h. 
8. Find the children of d and of e. 


9. Find the ancestors of c and of j. 


10. 
11. 
12. 
13. 
14. 
15. 
16. 


1. 


18. 


19. 


20. 


21. 


Find the descendants of c and of e. 
Find the terminal vertices. 

Find the siblings of f and of h. 
Find the internal vertices. 

Draw the subtree rooted at e. 
Draw the subtree rooted at j. 


Answer the questions in Exercises 7—15 for the following tree. 


What can you say about two vertices in a rooted tree that have 
the same parent? 


What can you say about a vertex in a rooted tree that has no 
ancestors? 


What can you say about two vertices in a rooted tree that have 
the same ancestors? 


What can you say about a vertex in a rooted tree that has no 
descendants? 


What can you say about two vertices in a rooted tree that have 
a descendant in common? 


Problem-Solving Corner 


Problem 


Let T be a simple graph. Prove that T is a tree if and 
only if T is connected but the removal of any edge (but 
no vertices) from T disconnects 7: 


In Exercises 22-26, draw a graph having the given properties or 
explain why no such graph exists. 

22. Six edges; eight vertices 

23. Tree; all vertices of degree 2 

24. Acyclic; four edges, six vertices 

25. Tree; six vertices having degrees 1, 1, 1, 1, 3,3 

26. Tree; four internal vertices; six terminal vertices 


27. Explain why if we allow cycles of length 0, a graph consisting 
of a single vertex and no edges is not acyclic. 


28. The connected graph shown has a unique simple path from any 
vertex to any other vertex, but it is not a tree. Explain. 


ae v2 
val 


29. Explain why if we allow cycles to repeat edges, a graph con- 
sisting of a single edge and two vertices is not acyclic. 


A forest is a simple graph with no cycles. 


30. Explain why a forest is a union of trees. 
31. Ifa forest F consists of m trees and has n vertices, how many 
edges does F have? 


32. If P; = (vo, ...; Vp) and Pz = (Wo, ..-, Wm) are distinct sim- 
ple paths from a to b in a simple graph G, is 


(V0, «++. Vn = Wms Wm-1, +++ W1, WO) 


necessarily a cycle? Explain. (This exercise is relevant to the 
last paragraph of the proof of Theorem 9.2.3.) 


33. Show that a graph G with n vertices and fewer than n— 1 edges 
is not connected. 


*34. Prove that T is a tree if and only if T is connected and when 
an edge is added between any two vertices, exactly one cycle 
is created. 


35. Show that if G is a tree, every vertex of degree 2 or more is 
an articulation point. (“Articulation point” is defined before 
Exercise 56, Section 8.2.) 


36. Give an example to show that the converse of Exercise 35 is 
false, even if G is assumed to be connected. 


Attacking the Problem 


Let’s be clear about what we have to prove. Since 
the statement is “if and only if,’ we must prove two 
statements: 


If T is a tree, then T is connected but the removal of 
any edge (but no vertices) from T disconnects T. (1) 


If T is connected but the removal of any edge (but no 
vertices) from T disconnects T, then JT isatree. (2) 


In (1), from the assumption that T is a tree, we must de- 
duce that T is connected but the removal of any edge 
(but no vertices) from T disconnects 7: In (2), from the 
assumption that T is connected but the removal of any 
edge (but no vertices) from T disconnects 7; we must 
deduce that T is a tree. 

In developing a proof, it is usually helpful to 
review definitions and other results related to the state- 
ment to be proved. Here of direct relevance are the 
definition of a tree and Theorem 9.2.3, which gives 
equivalent conditions for a graph to be a tree. 

Definition 9.1.1 states: 


A tree T is a simple graph satisfying the following: 
If v and w are vertices in 7, there is a unique simple 
path from v to w. (3) 


Theorem 9.2.3 states that the following are 
equivalent for an n-vertex graph T: 


T is a tree. (4) 
T is connected and acyclic. (5) 
T is connected and has n — | edges. (6) 
T is acyclic and has n — | edges. (7) 


Finding a Solution 


Let’s first try to prove (1). We assume that T is a tree. 
We must prove two things: T is connected, and the 
removal of any edge (but no vertices) from T discon- 
nects 7: 

Statements (5) and (6) immediately tell us that T 
is connected. None of the statements (3) through (7) 
directly tells us anything about removal of edges or 
about a graph that’s not connected. However, if we ar- 
gue by contradiction and assume that the removal of 
some edge (but no vertices) from T does not disconnect 
T, then when we remove that edge from T, the graph 7’ 
that results is connected. In this case, for the graph 7’, 
(5) is true, but (6) and (7) are false, which is a contra- 
diction since either (5), (6), and (7) are all true (and the 
graph is a tree), or (5) , (6) , and (7) are all false (and 
the graph is a not a tree). 
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Now consider proving (2). We assume that T is 
connected and that the removal of any edge (but no ver- 
tices) from T disconnects 7: We must show that T is a 
tree. Let’s try to show that T is connected and acyclic. 
We can then appeal to (5) to conclude that T is a tree. 

Since T is connected, all we have to do is show 
that T is acyclic. Again, we’ll approach this by contra- 
diction. Suppose that T has a cycle. Keeping in mind 
what we’re assuming (the removal of any edge from T 
disconnects 7), try to figure out how to deduce a con- 
tradiction from the assumption that T contains a cycle 
before reading on. 

If we delete an edge from T’s cycle, T will remain 
connected. This contradiction shows that T is acyclic. 
By (5), T is a tree. 


Formal Solution 


Suppose that T has n vertices. 

Suppose that T is a tree. Then by Theorem 9.2.3, 
T is connected and has n — 1 edges. Suppose that 
we can remove an edge from T to obtain 7” so that 
T’ is connected. Since T contains no cycles, T’ also 
contains no cycles. By Theorem 9.2.3, 7” is a tree. 
Again by Theorem 9.2.3, T’ has n — 1 edges. This 
is a contradiction. Therefore, T is connected, but 
the removal of any edge (but no vertices) from T 
disconnects T. 

If T is connected and the removal of any edge (but 
no vertices) from T disconnects T, then T contains no 
cycles. By Theorem 9.2.3, T is a tree. 


Summary of Problem-Solving Techniques 


m= When trying to construct a proof, write out 
carefully what is assumed and what is to be 
proved. 


m= When trying to construct a proof, consider using 
closely related definitions and theorems. 


m When trying to construct a proof, review the 
proofs of similar and related theorems. 


= If none of the conditions of potentially useful 
definitions and theorems applies, try proof by 
contradiction. When you assume the negation 
of the hypotheses, additional statements become 
available that might make some of the conditions 
of the definitions and theorems apply. 
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9.3 Spanning Trees 


In this section we consider the problem of finding a subgraph T of a graph G such that T 
is a tree containing all of the vertices of G. Such a tree is called a spanning tree. We will 


Go Online : see that the methods of finding spanning trees may be applied to other problems as well. 
For more on spanning 
trees, see sires 
goo.gl/oc4uly Definition 9.3.1 > A tree T is a spanning tree of a graph G if T is a subgraph 
of G that contains all of the vertices of G. < 
Example 9.3.2 A spanning tree of the graph G of Figure 9.3.1 is shown in black. < 


Example 9.3.3 In general, a graph will have several spanning trees. Another spanning tree of the graph 
== Gof Figure 9.3.1 is shown in Figure 9.3.2. 


Figure 9.3.2 Another span- 
Figure 9.3.1 A graph anda ning tree (in black) of the 
spanning tree shown in black. graph of Figure 9.3.1. < 


Suppose that a graph G has a spanning tree T. Let a and b be vertices of G. Since 
a and b are also vertices in T and T is a tree, there is a path P from a to b. However, P 
also serves as a path from a to b in G; thus G is connected. The converse is also true. 


Theorem 9.3.4 A graph Ghas a spanning tree if and only if G is connected. 


Proof We have already shown that if G has a spanning tree, then G is connected. 
Suppose that G is connected. If G is acyclic, by Theorem 9.2.3, G is a tree. 

Suppose that G contains a cycle. We remove an edge (but no vertices) from this 
cycle. The graph produced is still connected. If it is acyclic, we stop. If it contains a cycle, 
we remove an edge from this cycle. Continuing in this way, we eventually produce an 
acyclic, connected subgraph 7: By Theorem 9.2.3, T is a tree. Since T contains all the 
vertices of G, T is a spanning tree of G. < 


An algorithm for finding a spanning tree based on the proof of Theorem 9.3.4 
would not be very efficient; it would involve the time-consuming process of finding cy- 
cles. We can do much better. We shall illustrate the first algorithm for finding a spanning 
tree by an example and then we will state the algorithm. 


Example 9.3.5 Find a spanning tree for the graph G of Figure 9.3.1. 


SOLUTION We will use a method called breadth-first search (Algorithm 9.3.6). The 
idea of breadth-first search is to process all the vertices on a given level before moving 
to the next-higher level. 


Go Online 


For more on breadth-first 
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Algorithm 9.3.6 
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First, select an ordering, say abcdefgh, of the vertices of G. Select the first vertex 
a and label it the root. Let T consist of the single vertex a and no edges. Add to T all 
edges (a, x) and vertices on which they are incident, for x = b to h, that do not produce a 
cycle when added to 7: We would add to T edges (a, b), (a, c), and (a, g). (We could use 
either of the parallel edges incident on a and g.) Repeat this procedure with the vertices 
on level 1 by examining each in order: 


b: Include (b, d). 
c: Include (c, e). 


g: None 
Repeat this procedure with the vertices on level 2: 


d: Include (d, f). 


e: None 
Repeat this procedure with the vertices on level 3: 
f: Include (f, h). 


Since no edges can be added to the single vertex h on level 4, the procedure ends. We 
have found the spanning tree shown in Figure 9.3.1. < 


We formalize the method of Example 9.3.5 as Algorithm 9.3.6. 


Breadth-First Search for a Spanning Tree 
This algorithm finds a spanning tree using the breadth-first search method. 


Input: A connected graph G with vertices ordered 
V1, V2,+++5Vn 


Output: A spanning tree T 


bfs(V, E) { 
// V = vertices ordered vj, ..., Vn; E = edges 
// V' = vertices of spanning tree T; E’ = edges of spanning tree T 
// yy is the root of the spanning tree 
// S is an ordered list 


S= (1) 
V = {vy} 
i SB) 


while (true) { 
for each x € S, in order, 
for each y € V — V’, in order, 
if ((x, y) is an edge) 
add edge (x, y) to E’ and y to V’ 
if (no edges were added) 
return T 
S = children of S ordered consistently with the original vertex ordering 


} 
} 
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Algorithm 9.3.7 


Example 9.3.8 


Exercise 20 is to prove that Algorithm 9.3.6 finds a spanning tree. 

Breadth-first search can be used to test whether an arbitrary graph G with n vertices 
is connected (see Exercise 30). We use the method of Algorithm 9.3.6 to produce a tree T. 
Then G is connected if and only if T has n vertices. 

Breadth-first search can also be used to find minimum-length paths in an un- 
weighted graph from a fixed vertex v to all other vertices (see Exercise 24). We use 
the method of Algorithm 9.3.6 to generate a spanning tree rooted at v. We note that the 
length of a shortest path from v to a vertex on level i of the spanning tree is i. Dijkstra’s 
shortest-path algorithm for weighted graphs (Algorithm 8.4.1) can be considered as a 
generalization of breadth-first search (see Exercise 25). 

An alternative to breadth-first search is depth-first search, which proceeds to suc- 
cessive levels in a tree at the earliest possible opportunity. 


Depth-First Search for a Spanning Tree 


This algorithm finds a spanning tree using the depth-first search method. 
Input: A connected graph G with vertices ordered 
V1, V2,+-+5Vn 
Output: A spanning tree T 
dfs(V, E) { 
// V' = vertices of spanning tree T; E’ = edges of spanning tree T 
// v, is the root of the spanning tree 


Vi = {vi} 
ji =o 
w=vV 


while (true) { 
while (there is an edge (w, v) that when added to T does not create a cycle 
in T) { 
choose the edge (w, v,) with minimum k that when added to T 
does not create a cycle in T 
add (w, vz) to E” 
add vy; to V’ 
W= Vz 
} 
if (w == vy) 
return T 
w = parent of w in T // backtrack 


Exercise 21 is to prove that Algorithm 9.3.7 finds a spanning tree. 


Use depth-first search (Algorithm 9.3.7) to find a spanning tree for the graph of 
Figure 9.3.2 with the vertex ordering abcdefgh. 


SOLUTION We select the first vertex a and call it the root (see Figure 9.3.2). Next, we 
add the edge (a, x), with minimal x, to our tree. In our case we add the edge (a, b). 

We repeat this process. We add the edges (b, d), (d, c), (c, e), (e, f), and (f, h). At 
this point, we cannot add an edge of the form (h, x), so we backtrack to the parent f of 
h and try to add an edge of the form (f, x). Again, we cannot add an edge of the form 
(f, x), so we backtrack to the parent e of f. This time we succeed in adding the edge 


Example 9.3.9 
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Algorithm 9.3.10 
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(e, g). At this point, no more edges can be added, so we finally backtrack to the root and 
the procedure ends. < 


Because of the line in Algorithm 9.3.7 where we retreat along an edge toward the 
initially chosen root, depth-first search is also called backtracking. In the following 
example, we use backtracking to solve a puzzle. 


Four-Queens Problem The four-queens problem is to place four tokens on a 4 x 4 grid 
so that no two tokens are on the same row, column, or diagonal. Construct a backtrack- 
ing algorithm to solve the four-queens problem. (To use chess terminology, this is the 
problem of placing four queens on a 4 x 4 board so that no queen attacks another queen.) 


SOLUTION The idea of the algorithm is to place tokens successively in the columns. 
When it is impossible to place a token in a column, we backtrack and adjust the token 
in the preceding column. < 


Solving the Four-Queens Problem Using Backtracking 


This algorithm uses backtracking to search for an arrangement of four tokens on a 
4 x 4 grid so that no two tokens are on the same row, column, or diagonal. 


Input: An array row of size 4 


Output: true, if there is a solution 
false, if there is no solution 
[If there is a solution, the kth queen is in column k, row row(k).] 


four_queens(row) { 
k = 1//start in column | 


// start in row 1 
// since row(k) is incremented prior to use, set row(1) to 0 
row(1) = 0 
while (k > 0) { 
row(k) = row(k) + 1 
// \ook for a legal move in column k 
while (row(k) < 4 A column k, row(k) conflicts) 
// try next row 
row(k) = row(k) + 1 
if (row(k) < 4) 
We (Us = 25) 
return true 
else { //next column 
k=k+1 
row(k) = 0 
} 
else // backtrack to previous column 
k=k—-1 
} 


return false // no solution 


The tree that Algorithm 9.3.10 generates is shown in Figure 9.3.3. The numbering 
indicates the order in which the vertices are generated. The solution is found at vertex 8. 
The n-queens problem is to place n tokens on ann x n grid so that no two tokens 
are on the same row, column, or diagonal. Checking that there is no solution to the 
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Figure 9.3.3 The tree generated by the backtracking algorithm 
(Algorithm 9.3.10) in the search for a solution to the four-queens 
problem. 


two- or three-queens problem (see Exercise 10) is straightforward. We have just seen that 
Algorithm 9.3.10 generates a solution to the four-queens problem. Many constructions 
have been given to generate solutions to the n-queens problem for all n > 4 (see, e.g., 
[Johnsonbaugh]). 

Backtracking or depth-first search is especially attractive in a problem such as that 
in Example 9.3.9, where all that is desired is one solution. Since a solution, if one exists, 
is found at a terminal vertex, by moving to the terminal vertices as rapidly as possible, 
in general we can avoid generating some unnecessary vertices. 


9.3 Problem-Solving Tips 


Depth-first search and breadth-first search are the basis of many graph algorithms. For 
example, either can be used to determine whether a graph is connected: If we can visit 
every vertex in a graph from an initial vertex, the graph is connected; otherwise, it is 
not connected (see Exercises 30 and 31). Depth-first search can be used as a searching 
algorithm, in which case it is called backtracking. In Algorithm 9.3.10, backtracking is 
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used to search for solutions to the four-queens problem. Backtracking can also be used 
to search for Hamiltonian cycles in a graph, to generate permutations, to solve Sudoku 
puzzles, and to determine whether two graphs are isomorphic. 


9.3 Review Exercises 


1. What is a spanning tree? 


nN 


. State a necessary and sufficient condition for a graph to have a 
spanning tree. 


eo 


. Explain how breadth-first search works. 


4. Explain how depth-first search works. 
5. What is backtracking? 


1. Use breadth-first search (Algorithm 9.3.6) with the vertex 
ordering hgfedcba to find a spanning tree for graph G of 
Figure 9.3.1. 


2. Use breadth-first search (Algorithm 9.3.6) with the vertex 
ordering chbgadfe to find a spanning tree for graph G of 
Figure 9.3.1. 


3. Use breadth-first search (Algorithm 9.3.6) with the vertex 
ordering hfdbgeca to find a spanning tree for graph G of 
Figure 9.3.1. 


4. Use depth-first search (Algorithm 9.3.7) with the vertex or- 
dering hgfedcba to find a spanning tree for graph G of 
Figure 9.3.1. 


5. Use depth-first search (Algorithm 9.3.7) with the vertex or- 
dering dhcbefag to find a spanning tree for graph G of 
Figure 9.3.1. 


6. Use depth-first search (Algorithm 9.3.7) with the vertex or- 
dering hfdbgeca to find a spanning tree for graph G of 
Figure 9.3.1. 


In Exercises 7-9, find a spanning tree for each graph. 


ie 


a b c d 
e 
f 8 7 
i j k 1 
8. 
a e h 
Cc 
d f 
b Ll 
&§ 


9. 


10. Show that there is no solution to the two-queens or the three- 
queens problem. 


11. Show all solutions to the four-queens problem. 


12. Find a solution to the five-queens and six-queens problems. 


13. Show all solutions to the five-queens problem in which one 
queen is in the first column, second row. 


14. How many solutions are there to the five-queens problem? 


15. Show all solutions to the six-queens problem in which one 
queen is in row 2, column 1, and a second queen is in row 4, 
column 2. 


16. True or false? If G is a connected graph and T is a spanning 
tree for G, there is an ordering of the vertices of G such that 
Algorithm 9.3.6 produces T as a spanning tree. If true, prove 
it; otherwise, give a counterexample. 


17. True or false? If G is a connected graph and T is a spanning 
tree for G, there is an ordering of the vertices of G such that 
Algorithm 9.3.7 produces T as a spanning tree. If true, prove 
it; otherwise, give a counterexample. 


18. Show, by an example, that Algorithm 9.3.6 can produce iden- 
tical spanning trees for a connected graph G from two distinct 
vertex orderings of G. 


19. Show, by an example, that Algorithm 9.3.7 can produce iden- 
tical spanning trees for a connected graph G from two distinct 
vertex orderings of G. 
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20. 
21. 
22. 


23. 


24. 


25. 


26. 
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Prove that Algorithm 9.3.6 is correct. 
Prove that Algorithm 9.3.7 is correct. 


Under what conditions is an edge in a connected graph G con- 
tained in every spanning tree of G? 
Let T and 7’ be two spanning trees of a connected graph G. 
Suppose that an edge x is in T but not in 7’. Show that there 
is an edge y in 7” but not in T such that (T — {x}) U {y} and 
(T’ — {y}) U {x} are spanning trees of G. 
Write an algorithm based on breadth-first search that finds the 
minimum length of each path in an unweighted graph from a 
fixed vertex v to all other vertices. 
Let G be a weighted graph in which the weight of each edge 
is a positive integer. Let G’ be the graph obtained from G by 
replacing each edge 

k 


in G of weight k by k unweighted edges in series: 


k edges 


Show that Dijkstra’s algorithm for finding the minimum length 
of each path in the weighted graph G from a fixed vertex v to 
all other vertices (Algorithm 8.4.1) and performing a breadth- 
first search in the unweighted graph G’ starting with vertex v 
are, in effect, the same process. 


Let T be a spanning tree for a graph G. Show that if an edge 
in G, but not in T, is added to 7; a unique cycle is produced. 


A cycle as described in Exercise 26 is called a fundamental cycle. 
The fundamental cycle matrix of a graph G has its rows indexed by 
the fundamental cycles of G relative to a spanning tree T for G and 
its columns indexed by the edges of G. The ijth entry is 1 if edge j 
is in the ith fundamental cycle and 0 otherwise. For example, the 
fundamental cycle matrix of the graph G of Figure 9.3.1 relative to 
the spanning tree shown in Figure 9.3.1 is 


e7 &6 €11 C10 2 e1 €3 C4 C5 eg eg 12 
(abdca) 100 00110000 1 
(efdbace)} 0 1 0 0 01 11010 1 
(ageca) 00 1 00000011 1 
(aga) 000 1000000 1 0 
(abga) 00 0 0110000 1 ~0 


Find the fundamental cycle matrix of each graph. The spanning 
tree to be used is drawn in black. 


27. 
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29. 


30. 


31. 


32. 


33. 


34. 


35. 


36. 


37. 


38. 


39. 
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Write a breadth-first search algorithm to test whether a graph 
is connected. 

Write a depth-first search algorithm to test whether a graph is 
connected. 

Write a depth-first search algorithm that finds all solutions to 
the four-queens problem. 

Modify Algorithm 9.3.6 so that it tracks the parent p of a vertex 
c (p is the parent of c if c was visited from p). 


Write an algorithm that uses the output of your algorithm in 
Exercise 33 to print each vertex and its parent. 

Modify Algorithm 9.3.7 so that it tracks the parent p of a vertex 
c (p is the parent of c if c was visited from p). 

Write an algorithm that uses the output of your algorithm in 
Exercise 35 to print each vertex and its parent. 

Write a backtracking algorithm that outputs all permutations 
Of 152) cag 

Write a backtracking algorithm that outputs all subsets of 
sf Bat errr | 

Sudoku is a puzzle in which the goal is to fill in a9 x 9 grid so 
that each of the numbers | through 9 appears in each column, 
each row, and each 3 x 3 box delineated by the heavy lines: 


As shown, in each puzzle some numbers are given. Solve the 
preceding Sudoku puzzle. 


40. Write a backtracking algorithm that solves an arbitrary Su- 


doku puzzle. 


41. The minimum-queens problem asks for the minimum number 


of queens that can attack all of the squares of ann x n board 
(i.e., the minimum number of queens such that each row, col- 
umn, and diagonal contains at least one queen). Write a back- 
tracking algorithm that determines whether k queens can at- 
tack all squares of ann x n board. 
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42. The subset-sum problem is: Given a set {c1, ..., Cn} of positive 
integers and a positive integer M, find all subsets {cx,,..., cx,} 
of {c1,..., Cn} satisfying 


j 
S Ck, = M. 
i=l 


Write a backtracking algorithm to solve the subset-sum 
problem. 


9.4 Minimal Spanning Trees 


The weighted graph G of Figure 9.4.1 shows six cities and the costs of building roads 
between certain pairs of cities. We want to build the lowest-cost road system that will 
connect the six cities. The solution can be represented by a subgraph. This subgraph 
must be a spanning tree since it must contain all the vertices (so that each city is in the 
road system), it must be connected (so that any city can be reached from any other), and 
it must have a unique simple path between each pair of vertices (since a graph containing 
multiple simple paths between a vertex pair could not represent a minimum-cost system). 
Thus what is needed is a spanning tree the sum of whose weights is a minimum. Such a 
tree is called a minimal spanning tree. 
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Figure 9.4.1 Six cities 1-6 
and the costs of building roads 
between certain pairs of them. 


Definition 9.4.1 > Let G be a weighted graph. A minimal spanning tree of G 
is a spanning tree of G with minimum weight. < 


Example 9.4.2 The tree 7’ shown in Figure 9.4.2 is a spanning tree for graph G of Figure 9.4.1. The 
———= _ weight of 7’ is 20. This tree is not a minimal spanning tree since spanning tree T shown 
in Figure 9.4.3 has weight 12. We will see later that T is a minimal spanning tree for G. 


5 6 


Figure 9.4.2 A spanning tree of weight 20 of 
the graph of Figure 9.4.1. 


Figure 9.4.3 A spanning tree of 
weight 12 of the graph of < 
Figure 9.4.1. 
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The algorithm to find a minimal spanning tree that we will discuss is known as 
Prim’s Algorithm (Algorithm 9.4.3). This algorithm builds a tree by iteratively adding 
edges until a minimal spanning tree is obtained. The algorithm begins with a single 
vertex. Then at each iteration, it adds to the current tree a minimum-weight edge that 
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does not complete a cycle. Another algorithm to find a minimal spanning tree, known as 
Kruskal’s Algorithm, is presented in the exercises (see Exercises 20-22). 


This algorithm finds a minimal spanning tree in a connected, weighted graph. 


Input: A connected, weighted graph with vertices 1,..., and start 
For a C program vertex s. If (i, 7) is an edge, w(i, j) is equal to the weight of (i, 7); 
implementing this if (7, 7) 1s not an edge, w(i, /) is equal to oo (a value greater than 
algorithm, see any actual weight). 
goo.gl/Ywc/7Sb Output: The set of edges EF in a minimal spanning tree (mst) 


prim(w,n, s) { 
// v(i) = 1 if vertex i has been added to mst 
// v(i) = 0 if vertex i has not been added to mst 


ik, fori=lton 
2) Va) =10 
// add start vertex to mst 
3h VCS) pall 
// begin with an empty edge set 
4. 15h = iB} 


// put n — | edges in the minimal spanning tree 
3); fori=1lton—1{ 
// add edge of minimum weight with one vertex in mst and one vertex 
// not in mst 


6. min = oo 
Te forj=1ton 
8. if (VY) == 1) // if] is a vertex in mst 
g), fork =1ton 
10. if (vik) == OA wj, k) < min) { 
iil. add_vertex = k 
122, aa (ak) 
iL}, min = w(j, k) 
14. } 
// put vertex and edge in mst 
ilps v(add_vertex) = 1 
16. EE eh 
ie } 
18. return E 
IS, |; 


Show how Prim’s Algorithm finds a minimal spanning tree for the graph of Figure 9.4.1. 
Assume that the start vertex s is 1. 


At line 3 we add vertex 1 to the minimal spanning tree. The first time we 
execute the for loop in lines 7-14, the edges with one vertex in the tree and one vertex 
not in the tree are 


Edge Weight 


(1, 2) 4 
(1, 3) 2 
(1, 5) 3 


Cc 


Figure 9.4.4 A graph that shows 
that selecting an edge having 
minimum weight incident on the 
most recently added vertex does 
not necessarily yield a shortest 
path. Starting at a, we obtain 

(a, c, z), but the shortest path from 
ato zis (a, b, z). 
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The edge (1, 3) with minimum weight is selected. At lines 15 and 16, vertex 3 is added 
to the minimal spanning tree and edge (1, 3) is added to E. 

The next time we execute the for loop in lines 7-14, the edges with one vertex in 
the tree and one vertex not in the tree are 


Edge Weight 
(1,2) 4 
(1, 5) 3 
(3, 4) 1 
(3, 5) 6 
(3, 6) 3 


The edge (3, 4) with minimum weight is selected. At lines 15 and 16, vertex 4 is added 
to the minimal spanning tree and edge (3, 4) is added to E. 

The next time we execute the for loop in lines 7-14, the edges with one vertex in 
the tree and one vertex not in the tree are 


Edge Weight 
(, 2) 4 
d, 5) 3 
(2, 4) 5 
(3, 5) 6 
(3, 6) 3 
(4, 6) 6 


This time two edges have minimum weight 3. A minimal spanning tree will be con- 
structed when either edge is selected. In this version, edge (1, 5) is selected. At lines 
15 and 16, vertex 5 is added to the minimal spanning tree and edge (1, 5) is added to E. 

The next time we execute the for loop in lines 7-14, the edges with one vertex in 
the tree and one vertex not in the tree are 


Edge Weight 
d, 2) 4 
(2, 4) 5 
(3, 6) 3 
(4, 6) 6 
(5, 6) 2 


The edge (5, 6) with minimum weight is selected. At lines 15 and 16, vertex 6 is added 
to the minimal spanning tree and edge (5, 6) is added to E. 

The last time we execute the for loop in lines 7-14, the edges with one vertex in 
the tree and one vertex not in the tree are 


Edge Weight 
(2) 4 
(2, 4) 5 


The edge (1, 2) with minimum weight is selected. At lines 15 and 16, vertex 2 is added 
to the minimal spanning tree and edge (1, 2) is added to E. The minimal spanning tree 
constructed is shown in Figure 9.4.3. < 


Prim’s Algorithm furnishes an example of a greedy algorithm. A greedy algo- 
rithm is an algorithm that optimizes the choice at each iteration. The principle can be 
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Theorem 9.4.5 


summarized as “doing the best locally.” In Prim’s Algorithm, since we want a minimal 
spanning tree, at each iteration we simply add an available edge with minimum weight. 

Optimizing at each iteration does not necessarily give an optimal solution to the 
original problem. We will show shortly (Theorem 9.4.5) that Prim’s Algorithm is correct; 
that is, we do obtain a minimal spanning tree. As an example of a greedy algorithm that 
does not lead to an optimal solution, consider a “shortest-path algorithm” in which at 
each step we select an available edge having minimum weight incident on the most 
recently added vertex. If we apply this algorithm to the weighted graph of Figure 9.4.4 
to find a shortest path from a to z, we would select the edge (a, c) and then the edge 
(c, z). Unfortunately, this is not the shortest path from a to z. 

We next show that Prim’s Algorithm is correct. 


Prim’s Algorithm (Algorithm 9.4.3) is correct; that is, at the termination of Algo- 
rithm 9.4.3, T is a minimal spanning tree. 


Proof We let 7; denote the graph constructed by Algorithm 9.4.3 after the ith iteration 
of the for loop, lines 5-17. More precisely, the edge set of T; is the set EF constructed after 
the ith iteration of the for loop, lines 5—17, and the vertex set of T; is the set of vertices on 
which the edges in E are incident. We let Ty be the graph constructed by Algorithm 9.4.3 
just before the for loop at line 5 is entered for the first time; J consists of the single vertex 
s and no edges. Subsequently in this proof, we suppress the vertex set and refer to a graph 
by specifying its edge set. 

By construction, at the termination of Algorithm 9.4.3, the resulting graph, T,,_1, 
is a connected, acyclic subgraph of the given graph G containing all the vertices of G; 
hence 7;,_; is a spanning tree of G. 

We use induction to show that for alli = 0,...,n—1, T; is contained in a minimal 
spanning tree. It then follows that at termination, 7,_; is a minimal spanning tree. 

If i = 0, Tp consists of a single vertex. In this case Tp is contained in every minimal 
spanning tree. We have verified the Basis Step. 

Next, assume that 7; is contained in a minimal spanning tree 7’. Let V be the set 
of vertices in 7;. Algorithm 9.4.3 selects an edge (j, k) of minimum weight, where j € V 
andk ¢ V, and adds it to T; to produce T;,,. If (7, k) is in T’, then T;,; is contained in the 
minimal spanning tree 7’. If (7, k) is not in T’, T’ U {(j, k)} contains a cycle C. Choose 
an edge (x, y) in C, different from (j,k), with x € V and y ¢ V. Then 


w(x, y) = w@, k). (9.4.1) 


Because of (9.4.1), the graph T” = [T’ U{(, k)}] — {@, y)} has weight less than or equal 
to the weight of 7’. Since T” is a spanning tree, T” is a minimal spanning tree. Since 
T;41 is contained in T”, the Inductive Step has been verified. The proof is complete. < 


Our version of Prim’s Algorithm examines @(n>) edges in the worst case (see 
Exercise 6) to find a minimal spanning tree for a graph having n vertices. It is possible 
(see Exercise 8) to implement Prim’s Algorithm so that only ©(n) edges are examined 
in the worst case. Since K,, has @(n?) edges, the latter version is optimal. 


9.4 Review Exercises 


1. What is a minimal spanning tree? 


3. What is a greedy algorithm? 


2. Explain how Prim’s Algorithm finds a minimal spanning tree. 
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9.4 Exercises 


In Exercises 1-5, find the minimal spanning tree given by Algo- 5. 
rithm 9.4.3 for each graph. 
a 
Ls 
1 5 
4 
9 
7 
13 14 9 15 5 16 
5 : 
6. Show that Algorithm 9.4.3 examines ©(n*) edges in the worst 
case. 
2. Exercises 7-9 refer to an alternate version of Prim’s Algorithm 
2 (Algorithm 9.4.6). 


Algorithm 9.4.6 


Alternate Version of Prim's Algorithm 

This algorithm finds a minimal spanning tree in a con- 
nected, weighted graph G. At each step, some vertices have 
temporary labels and some have permanent labels. The la- 
bel of vertex i is denoted L;. 


Input: A connected, weighted graph with vertices 
3: 1,..., and start vertex s. If (i, j) is an edge, 
w(i, j) is equal to the weight of (i, j); if (i,j) 
is not an edge, w(i, j) is equal to oo (a value 
greater than any actual weight). 


Output: A minimal spanning tree T 
prim_alternate(w, n, s) { 
let T be the graph with vertex s and no edges 
forj = 1 ton{ 
L; = w(s, j) // these labels are temporary 
back(j) = s 


4. } 
1 3 2 #4 3 L, =0 
make L, permanent 
while (temporary labels remain) { 
choose the smallest temporary label L; 
make L; permanent 
add edge (i, back(i)) to T 
add vertex i to T 
for each temporary label Ly 
if (WG, k) < Lx) { 
+ Ly = w(i, k) 
10 5: dt 2 D2 back(k) = i 


return T 
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7. 


8. 


10. 


11. 


12. 


13. 
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Show how Algorithm 9.4.6 finds a minimal spanning tree for 
the graphs of Exercises 1-5. 

Show that Algorithm 9.4.6 examines ©(n7) edges in the worst 
case. 


. Prove that Algorithm 9.4.6 is correct; that is, at the termination 


of Algorithm 9.4.6, T is a minimal spanning tree. 

Let G be a connected, weighted graph, let v be a vertex in G, 
and let e be an edge of minimum weight incident on v. Show 
that e is contained in some minimal spanning tree. 

Let G be a connected, weighted graph and let v be a vertex 
in G. Suppose that the weights of the edges incident on v are 
distinct. Let e be the edge of minimum weight incident on v. 
Must e be contained in every minimal spanning tree? 

Show that any algorithm that finds a minimal spanning tree in 
Ky, when all the weights are the same, must examine every 
edge in Ky. 

Show that if all weights in a connected graph G are distinct, G 
has a unique minimal spanning tree. 


In Exercises 14-16, decide if the statement is true or false. If the 
statement is true, prove it; otherwise, give a counterexample. In 
each exercise, G is a connected, weighted graph. 


14. 


15. 


16. 


17. 


«18. 


If all the weights in G are distinct, then distinct spanning trees 
of G have distinct weights. 

If e is an edge in G whose weight is less than the weight of 
every other edge, e is in every minimal spanning tree of G. 

If T is a minimal spanning tree of G, there is a labeling of the 
vertices of G so that Algorithm 9.4.3 produces T: 

Let G be a connected, weighted graph. Show that if, as long as 
possible, we remove an edge from G having maximum weight 
whose removal does not disconnect G, the result is a minimal 
spanning tree for G. 

Write an algorithm that finds a maximal spanning tree in a 
connected, weighted graph. 


19. Prove that your algorithm in Exercise 18 is correct. 
Go Online Kruskal’s Algorithm finds a minimal spanning 
For more on tree in a connected, weighted graph G having n 
Kruskal’s vertices as follows. The graph T initially consists 
Algorithm, of the vertices of G and no edges. At each 
see iteration, we add an edge e to T having minimum 


goo.gl/oc4uly 


20. 
21. 


22. 


23. 


weight that does not complete a cycle in T. When 
T has n — | edges, we stop. 


Formally state Kruskal’s Algorithm. 

Show how Kruskal’s Algorithm finds minimal spanning trees 
for the graphs of Exercises 1-S. 

Show that Kruskal’s Algorithm is correct; that is, at the termi- 
nation of Kruskal’s Algorithm, T is a minimal spanning tree. 
Let V be a set of n vertices and let s be a “dissimilar- 
ity function” on V x V (see Example 8.1.7). Let G be the 
complete, weighted graph having vertices V and weights 
w(vi, vj) =8(vi, vj). Modify Kruskal’s Algorithm so that it 
groups data into classes. This modification is known as the 
method of nearest neighbors (see [Gose]). 


Exercises 24-31 refer to the following situation. Suppose that we 
have stamps of various denominations and that we want to choose 


the minimum number of stamps to make a given amount of postage. 
Consider a greedy algorithm that selects stamps by choosing as 
many of the largest denomination as possible, then as many of the 
second-largest denomination as possible, and so on. 


24. 


«25, 


26. 


«27, 


«28. 


x29, 


«30. 


31. 


32. 


Show that if the available denominations are 1, 8, and 10 cents, 
the algorithm does not always produce the fewest number of 
stamps to make a given amount of postage. 

Show that if the available denominations are 1, 5, and 25 cents, 
the algorithm produces the fewest number of stamps to make 
any given amount of postage. 

Find positive integers a; and a2 such that a; > 2a2 > 1, a2 
does not divide a1, and the algorithm, with available denomi- 
nations 1, a), a2, does not always produce the fewest number 
of stamps to make a given amount of postage. 

Find positive integers a; and a2 such that a; > 2a2 > 1, a2 
does not divide a), and the algorithm, with available denom- 
inations 1, aj, a2, produces the fewest number of stamps to 
make any given amount of postage. Prove that your values do 
give an optimal solution. 

Suppose that the available denominations are 


l=a, < a2 <-++: <a. 


Show, by giving counterexamples, that the condition 


aj = 2aj-| — aj-2, 3<i<n, 


is neither necessary nor sufficient for the greedy algorithm to 
be optimal for all amounts of postage. 

Suppose that the available denominations are 1 = a, < 
a2 < a3. Prove that the greedy algorithm is not optimal for 
all amounts of postage if and only if there is an integer k sat- 
isfying (k — l)az < a3 < k(az — 1). 

Suppose that the available denominations are 


l=a, < a2 <-+++ < dm. 


Prove that if the greedy algorithm is optimal for all amounts of 
postage less than ad,,—1 + dm, then it is optimal for all amounts 
of postage. 

Show that the bound a,—,; + a» in Exercise 30 cannot be 
lowered. 

What is wrong with the following “proof” that the greedy al- 
gorithm is optimal for all amounts of postage for the denomi- 
nations 1, 5, and 6? 

We will prove that for all i > 1, the greedy algorithm is 
optimal for all amounts of postage n < 6i. The Basis Step is 
i = 1, which is true by inspection. 

For the Inductive Step, assume that the greedy algo- 
rithm is optimal for all amounts of postage n < 6i. We must 
show that the greedy algorithm is optimal for all amounts of 
postage n < 6(i + 1). We may assume that n > 6i. Now 
n— 6 < 6i, so by the inductive assumption, the greedy algo- 
rithm is optimal for n — 6. Suppose that the greedy algorithm 
chooses k stamps for n — 6. In determining the postage for 
the amount n, the greedy algorithm will first choose a 6-cent 
stamp and then stamps for n — 6 for a total of k + 1 stamps. 
These k + 1 stamps must be optimal or otherwise n — 6 would 
use less than k stamps. The Inductive Step is complete. 
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9.5 Binary Trees 


f 8 
Figure 9.5.1 A binary tree. 


Example 9.5.2 


Example 9.5.3 


Theorem 9.5.4 


Example 9.5.5 


Binary trees are among the most important special types of rooted trees. Every vertex 
in a binary tree has at most two children (see Figure 9.5.1). Moreover, each child is 
designated as either a left child or a right child. When a binary tree is drawn, a left 
child is drawn to the left and a right child is drawn to the right. The formal definition 
follows. 


Definition 9.5.1 » A binary tree is a rooted tree in which each vertex has either 
no children, one child, or two children. If a vertex has one child, that child is designated 
as either a left child or a right child (but not both). If a vertex has two children, one child 
is designated a left child and the other child is designated a right child. < 


In the binary tree of Figure 9.5.1, vertex b is the left child of vertex a and vertex c is the 
right child of vertex a. Vertex d is the right child of vertex b; vertex b has no left child. 
Vertex e is the left child of vertex c; vertex c has no right child. < 


A tree that defines a Huffman code is a binary tree. For example, in the Huffman coding 
tree of Figure 9.1.10, moving from a vertex to a left child corresponds to using the bit 1, 
and moving from a vertex to a right child corresponds to using the bit 0. < 


A full binary tree is a binary tree in which each vertex has either two children 
or zero children. A fundamental result about full binary trees is our next theorem. 


If T is a full binary tree with i internal vertices, then T has i+ | terminal vertices and 
2i + 1 total vertices. 


Proof The vertices of T consist of the vertices that are children (of some parent) and 
the vertices that are not children (of any parent). There is one nonchild—the root. Since 
there are 7 internal vertices, each having two children, there are 2i children. Thus the 
total number of vertices of T is 2i + 1, and the number of terminal vertices is 


(2i+1) -i=i+l. < 


A single-elimination tournament is a tournament in which a contestant is eliminated 
after one loss. The graph of a single-elimination tournament is a full binary tree (see 
Figure 9.5.2). The contestants’ names are listed on the left. Winners progress to the 
right. Eventually, there is a single winner at the root. If the number of contestants is 
not a power of 2, some contestants receive byes. In Figure 9.5.2, contestant 7 has a 
first-round bye. 

Show that if there are n contestants in a single-elimination tournament, a total of 
n — | matches are played. 


SOLUTION The number of contestants is the same as the number of terminal vertices, 
and the number of matches 7 is the same as the number of internal vertices. Thus, by 
Theorem 9.5.4, 


n+i=2i+1, 


so thati=n— 1. 
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Theorem 9.5.6 


Contestant 1 
Contestant 2 
Contestant 3 
Contestant 4 
Contestant 5 ‘om 4 : 
Winner 
Contestant 6 
Contestant 7 
Contestant m —1 a 
Contestant m 


Figure 9.5.2 The graph (full binary tree) of a single-elimination 
tournament. < 


Our next result about binary trees relates the number of terminal vertices to the 
height. 


If a binary tree of height h has ¢ terminal vertices, then 


Igt<h. (9.5.1) 


Proof We will prove the equivalent inequality 
eae (9.5.2) 


by induction on h. Inequality (9.5.1) is obtained from (9.5.2) by taking the logarithm to 
the base 2 of both sides of (9.5.2). 

If h = 0, the binary tree consists of a single vertex. In this case, t = 1 and thus 
(9.5.2) is true. 

Assume that the result holds for a binary tree whose height is less than h. Let T be 
a binary tree of height > 0 with f terminal vertices. Suppose first that the root of T has 
only one child. If we eliminate the root and the edge incident on the root, the resulting 
tree has height h — 1 and the same number of terminals as 7. By induction, t < 2’~!. 
Since 2'—! < 2", (9.5.2) is established for this case. 

Now suppose that the root of T has children v; and v2. Let 7; be the subtree rooted 
at v; and suppose that 7; has height h; and ¢; terminal vertices, i = 1, 2. By induction, 


t <2", i=1,2. (9.5.3) 


Example 9.5.7 


Example 9.5.9 
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The terminal vertices of T consist of the terminal vertices of T; and T>. Hence 
t=t +h. (9.5.4) 
Combining (9.5.3) and (9.5.4), we obtain 
t=tt+h <2 42% <2 14 or lao 


The inductive step has been verified and the proof is complete. < 


The binary tree in Figure 9.5.3 has height / = 3 and the number of terminals t = 8. For 
this tree, the inequality (9.5.1) becomes an equality. 


Figure 9.5.3 A binary tree of height h = 3 with 
t = 8 terminals. For this binary tree, lgt = h. 


< 


Suppose that we have a set S whose elements can be ordered. For example, if S 
consists of numbers, we can use ordinary ordering defined on numbers, and if S consists 
of strings of alphabetic characters, we can use lexicographic order. Binary trees are used 
extensively in computer science to store elements from an ordered set such as a set of 
numbers or a set of strings. If data item d(v) is stored in vertex v and data item d(w) is 
stored in vertex w, then if v is a left child (or right child) of w, some ordering relation- 
ship will be guaranteed to exist between d(v) and d(w). One example is a binary search 
tree. 


Definition 9.5.8 > A binary search tree is a binary tree T in which data are 
associated with the vertices. The data are arranged so that, for each vertex v in T, each 
data item in the left subtree of v is less than the data item in v, and each data item in the 
right subtree of v is greater than the data item in v. < 


The words 


OLD PROGRAMMERS NEVER DIE 
THEY JUST LOSE THEIR MEMORIES (9.5.5) 


may be placed in a binary search tree as shown in Figure 9.5.4. Notice that for any ver- 
tex v, each data item in the left subtree of v is less than (i.e., precedes alphabetically) 
the data item in v and each data item in the right subtree of v is greater than the data 
item in v. 
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OLD ] 
NEVER ] PROGRAMMERS ] 
DIE | THEY ] 
JUST ] THEIR ] 
LOSE ] T 
MEMORIES ] 
Figure 9.5.4 A binary search tree. < 


In general, there will be many ways to place data into a binary search tree. 
Figure 9.5.5 shows another binary search tree that stores the words (9.5.5). 

The binary search tree T of Figure 9.5.4 was constructed in the following way. 
We inspect each of the words (9.5.5) in the order in which they appear, OLD first, then 
PROGRAMMERS, then NEVER, and so on. To start, we create a vertex and place the first 
word OLD in this vertex. We designate this vertex the root. Thereafter, given a word in 
the list (9.5.5), we add a vertex v and an edge to the tree and place the word in the vertex 
v. To decide where to add the vertex and edge, we begin at the root. If the word to be 
added is less than (using lexicographic order) the word at the root, we move to the left 
child; if the word to be added is greater than the word at the root, we move to the right 
child. If there is no child, we create one, put in an edge incident on the root and new 
vertex, and place the word in the new vertex. If there is a child v, we repeat this process. 
That is, we compare the word to be added with the word at v and move to the left child 
of v if the word to be added is less than the word at v; otherwise, we move to the right 
child of v. If there is no child to move to, we create one, put in an edge incident on v 
and the new vertex, and place the word in the new vertex. If there is a child to move 
to, we repeat this process. Eventually, we place the word in the tree. We then get the 


NEVER ] 
JUST ] PROGRAMMERS ] 
DIE | LOSE ] OLD ] THEIR ] 
MEMORIES ] THEY | 


Figure 9.5.5 Another binary search tree that stores the same words as the tree in 
Figure 9.5.4, 
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next word in the list, compare it with the root, move left or right, compare it with the 
new vertex, move left or right, and so on, and eventually store it in the tree. In this way, 
we store all of the words in the tree and thus create a binary search tree. We formally 
state this method of constructing a binary search tree as Algorithm 9.5.10. 


This algorithm constructs a binary search tree. The input is read in the order submit- 
ted. After each word is read, it is inserted into the tree. 


Input: A sequence w1,...,W,» of distinct words and the length n of the 
sequence 


Output: A binary search tree T 


make_bin_search_tree(w, n) { 
let T be the tree with one vertex, root 
store w; in root 
fone — atom) 
v = root 
search = true // find spot for w; 
while (search) { 
Ss = word inv 
if (w; < s) 
if (v has no left child) { 
add a left child / to v 
store w; in | 
search = false // end search 
} 
else 
v = left child of v 
else // w; > 5 
if (v has no right child) { 
add a right child r to v 
store w; inr 
search = false // end search 


} 


else 
v = right child of v 
} // end while 
} // end for 


return T 


Binary search trees are useful for locating data. That is, given a data item D, we 
can easily determine if D is in a binary search tree and, if it is present, where it is located. 
To determine if a data item D is in a binary search tree, we would begin at the root. We 
would then repeatedly compare D with the data item at the current vertex. If D is equal 
to the data item at the current vertex, we have found D, so we stop. If D is less than the 
data item at the current vertex v, we move to v’s left child and repeat this process. If D 
is greater than the data item at the current vertex v, we move to v’s right child and repeat 
this process. If at any point the child to move to is missing, we conclude that D is not in 
the tree. (Exercise 6 asks for a formal statement of this process.) 
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2k 


T 


Figure 9.5.6 Expanding a binary 
search tree to a full binary tree. 


The time spent searching for an item in a binary search tree is longest when the 
item is not present and we follow a longest path from the root. Thus the maximum time 
to search for an item in a binary search tree is approximately proportional to the height 
of the tree. Therefore, if the height of a binary search tree is small, searching the tree 
will always be very fast (see Exercise 25). Many ways are known to minimize the height 
of a binary search tree (see, e.g., [Cormen]). 

We make more precise statements about worst-case searching in a binary search 
tree. Let T be a binary search tree with n vertices and let T* be the full binary tree obtained 
from T by adding left and right children to existing vertices in T wherever possible. In 
Figure 9.5.6, we show the full binary tree that results from modifying the binary search 
tree of Figure 9.5.4. The added vertices are drawn as boxes. An unsuccessful search 
in T corresponds to arriving at an added (box) vertex in T*. Let us define the worst- 
case time needed to execute the search procedure as the height h of the tree 7*. By 
Theorem 9.5.6, lgt < h, where t is the number of terminal vertices in 7*. The full 
binary tree T* has n internal vertices, so by Theorem 9.5.4, t = n+ 1. Thus in the 
worst case, the time will be equal to at least lg t = Ig(n + 1). Exercise 7 shows that if the 
height of T is minimized, the worst case requires time equal to [lg(n+ 1)]. For example, 
since 


[1g(2,000,000 + 1)] = 21, 


it is possible to store 2 million items in a binary search tree and find an item, or determine 
that it is not present, in at most 21 steps. 


9.5 Review Exercises 


_ 


nan & WwW WN 


. Define binary tree. 


minal vertices does T have? 


6. If T is a full binary tree with i internal vertices, how many total 
vertices does T have? 


. What is a left child in a binary tree? 
; . ated : 7. How is the height of a binary tree related to the number of its 
. What is a right child in a binary tree? tarminal veitices? 
- What is a full binary tree? 8. What is a binary search tree? 
. If T is a full binary tree with 7 internal vertices, how many ter- 9. Give an example of a binary search tree. 


10. Give an algorithm to construct a binary search tree. 


Exercises 1-4 concern n teams that play a single-elimination Scientists 
tournament. 
Whales 
1. After the teams are assigned, in how many ways can the tour- 
Whales 


9.5 Exercises 


nament unfold? For example, if there are three teams, Scien- 


tists, Whales, Pilots, assigned as 


Scientists 


Whales 


Pilots 


» Whales 
Pilots 


There are three other ways that the tournament can unfold: 
(a) Whales defeat Scientists; Pilots defeat Whales. 

(b) Scientist defeat Whales; Scientists defeat Pilots. 

(c) Scientist defeat Whales; Pilots defeat Scientists. 


Thus, if three teams play a single-elimination tournament, af- 
ter the teams are assigned, the tournament can unfold in four 


one way the tournament can unfold is ways. 


2. After so-called play-in games, the NCAA men’s basketball 
tournament is a 64-team single-elimination tournament. After 
the teams are assigned, in how many ways can the tourna- 
ment unfold? How many (base-10) digits does this number 
have? 


3. Suppose that after the teams are assigned in the NCAA men’s 
basketball tournament, someone randomly guesses how the 
tournament will unfold. What is the probability that the guess 
is correct? 


4. Is the value in Exercise 3 a good estimate of the chance that 
someone knowledgeable about basketball will successfully 
predict how the tournament will unfold? 


5. Place the words FOUR SCORE AND SEVEN YEARS AGO 
OUR FOREFATHERS BROUGHT FORTH, in the order in 
which they appear, in a binary search tree. 


6. Write a formal algorithm for searching in a binary search tree. 


7. Write an algorithm that stores n distinct words in a binary 
search tree T of minimal height. Show that the derived tree 
T*, as described in the text, has height [lg(n + 1)]. 


8. True or false? Let T be a binary tree. If for every vertex v 
in T the data item in v is greater than the data item in the 
left child of v and the data item in v is less than the data 
item in the right child of v, then T is a binary search tree. 
Explain. 


In Exercises 9-11, draw a graph having the given properties or 
explain why no such graph exists. 


9. Full binary tree; four internal vertices; five terminal vertices 
10. Full binary tree; height = 4; nine terminal vertices 
11. Full binary tree; height = 3; nine terminal vertices 


12. A full m-ary tree is a rooted tree such that every parent has m 
ordered children. If T is a full m-ary tree with i internal ver- 
tices, how many vertices does T have? How many terminal 
vertices does T have? Prove your results. 


13. Give an algorithm for constructing a full binary tree withn > 1 
terminal vertices. 


14. Give a recursive algorithm to insert a word in a binary search 
tree. 


15. Find the maximum height of a full binary tree having ¢ termi- 
nal vertices. 
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16. Write an algorithm that tests whether a binary tree in which 
data are stored in the vertices is a binary search tree. 


17. Let T be a full binary tree. Let J be the sum of the lengths of 
the simple paths from the root to the internal vertices. We call J 
the internal path length. Let E be the sum of the lengths of the 
simple paths from the root to the terminal vertices. We call E 
the external path length. Prove that if T has n internal vertices, 
then E = 1 + 2n. 


A binary tree T is balanced if for every vertex v in T, the heights of 
the left and right subtrees of v differ by at most 1. (Here the height 
of a “missing subtree” is defined to be —1.) 

State whether each tree in Exercises 18-21 is balanced or not. 


18. 19. 


d 


20. 21. 


g h 


In Exercises 22-24, Ny, is defined as the minimum number of ver- 
tices in a balanced binary tree of height h and f\, fr, ... denotes 
the Fibonacci sequence. 


22. Show that No = 1, N; = 2, and No = 4. 
23. Show that NV, = 1 + Nj; + Np_2, for h > 0. 
24. Show that NV, = f,+3 — 1, forh > 0. 


*25. Show that the height h of an n-vertex balanced binary tree 
satisfies h = O(lgn). This result shows that the worst-case 
time to search in an n-vertex balanced binary search tree is 
O(lgn). 

Prove that if a binary tree of height h has n > | vertices, then 
Ign < h+1. This result, together with Exercise 25, shows that 
the worst-case time to search in an n-vertex balanced binary 
search tree is O(lgn). 


x26. 


Tree Traversals 


Breadth-first search and depth-first search provide ways to “walk” a tree, that is, to 
traverse a tree in a systematic way so that each vertex is visited exactly once. In this 
section we consider three additional tree-traversal methods. We define these traversals 


recursively. 
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Algorithm 9.6.1 


Example 9.6.2 


Preorder Traversal 


This recursive algorithm processes the vertices of a binary tree using preorder 
traversal. 


Input: PT, the root of a binary tree, or the special value null to indicate 
that no tree is input 


Output: Dependent on how “process” is interpreted in line 3 


preorder(PT) { 

bt C2 == Hf) 
return 

process PT 
1 = left child of PT 
preorder‘(l) 
r = right child of PT 
preorder(r) 


Se ge Si 


Let us examine Algorithm 9.6.1 for some simple cases. If no tree is input (i.e., 
PT equals null), nothing is processed since, in this case, the algorithm simply returns 
at line 2. 

Suppose that the input consists of a tree with a single vertex. We set PT to the 
root and call preorder(PT). Since PT is not equal to null, we proceed to line 3, where 
we process the root. At line 5, we call preorder with PT equal to null since there is 
no left child. However, we just saw that when no tree is input to preorder, nothing is 
processed. Similarly at line 7, when no tree is input to preorder, again nothing is pro- 
cessed. Thus when the input consists of a tree with a single vertex, we process the root 
and return. 

Now suppose that the input is the tree of Figure 9.6.1. We set PT to the root and 
call preorder(PT). Since PT is not equal to null, we proceed to line 3, where we pro- 
cess the root. At line 5 we call preorder with PT equal to the left child of the root (see 
Figure 9.6.2). We just saw that if the tree input to preorder consists of a single vertex, 
preorder processes that vertex. Thus we next process vertex B. Similarly, at line 7, we 
process vertex C. Thus the vertices are processed in the order ABC. 


(A) PT 
@ © of 


Figure 9.6.1 Figure 9.6.2 
Input for At line 5 of 
Algorithm 9.6.1. Algorithm 9.6.1, 
where the input is 
the tree of 
Figure 9.6.1. 


In what order are the vertices of the tree of Figure 9.6.3 processed if preorder traversal 
is used? 


SOLUTION Following lines 3-7 (root/left/right) of Algorithm 9.6.1, the traversal pro- 
ceeds as shown in Figure 9.6.4. Thus the order of processing is ABCDEFGHIJ. 
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Root Left Right 
a (B) 6) 
Figure 9.6.3 A binary tree. Root Left Right Root Left Right 
Preorder is ABCDEFGHLJ. 
Inorder is CBDEAFIHJG. B s (©) a 
Postorder is CEDBIJHGFA. °B) 
Root Left Right 
Root Left Right 
A B Cc D E F G 
Root Left Right 
A B Cc D E F G H I J 
Figure 9.6.4 Preorder traversal of the tree in Figure 9.6.3. < 


Inorder traversal and postorder traversal are obtained by changing the position of 
line 3 (root) in Algorithm 9.6.1. “Pre,” “in,” and “post” refer to the position of the root 
in the traversal; that is, “preorder” means root first, “inorder” means root second, and 
“postorder” means root last. 


Algorithm 9.6.3 —_Inorder Traversal 
This recursive algorithm processes the vertices of a binary tree using inorder traversal. 


Input: PT, the root of a binary tree, or the special value null to indicate 
that no tree is input 


Output: Dependent on how “process” is interpreted in line 5 
inorder(PT) { 


il; iin (2ik—— 7D) 
2) return 
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Example 9.6.4 


Algorithm 9.6.5 


Example 9.6.6 


3h 1 = left child of PT 
4. inorder(1) 

aD process PT 

6. r = right child of PT 
ile inorder(r) 


In what order are the vertices of the binary tree of Figure 9.6.3 processed if inorder 
traversal is used? 


SOLUTION Following lines 3-7 (left/root/right) of Algorithm 9.6.3, we obtain the 
inorder listing CBDEAFIHJG. < 


Postorder Traversal 


This recursive algorithm processes the vertices of a binary tree using postorder 
traversal. 


Input: PT, the root of a binary tree, or the special value null to indicate 
that no tree is input 


Output: Dependent on how “process” is interpreted in line 7 


postorder(PT) { 
1 iin ee ——srn 11) 

2 return 

3), 1 = left child of PT 
4. postorder‘(1) 

5) r = right child of PT 
6 postorder(r) 

7 process PT 


In what order are the vertices of the binary tree of Figure 9.6.3 processed if postorder 
traversal is used? 


SOLUTION Following lines 3-7 (left/right/root) of Algorithm 9.6.5, we obtain the 
postorder listing CEDBIJHGFA. < 


Notice that preorder traversal may be obtained by following the route shown in 
Figure 9.6.5, and that reverse postorder traversal may be obtained by following the route 
shown in Figure 9.6.6. 

If data are stored in a binary search tree, as described in Section 9.5, inorder traver- 
sal will process the data in order, since the sequence left/root/right agrees with the or- 
dering of the data in the tree. 

In the remainder of this section we consider binary tree representations of arith- 
metic expressions. Such representations facilitate the computer evaluation of expressions. 

We will restrict our operators to +, —, *, and /. An example of an expression 
involving these operators is 


(A+B) *C—D/E. (9.6.1) 


A B 


Figure 9.6.7 The 
binary tree 
representation of the 
expression 

(A+ B)*C—D/E. 


Go Online 

For more on reverse 
Polish notation, see 
goo.gl/oc4uly 
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START END 


Figure 9.6.5 Preorder traversal. Figure 9.6.6 Reverse postorder 
traversal. 


This standard way of representing expressions is called the infix form of an expression. 
The variables A, B, C, D, and E are referred to as operands. The operators +, —, *, 
and / operate on pairs of operands or expressions. In the infix form of an expression, an 


operator appears between its operands. 
An expression such as (9.6.1) can be represented as a binary tree. The terminal 


vertices correspond to the operands, and the internal vertices correspond to the operators. 
The expression (9.6.1) would be represented as shown in Figure 9.6.7. In the binary tree 
representation of an expression, an operator operates on its left and right subtrees. For 
example, in the subtree whose root is / in Figure 9.6.7, the divide operator operates on 
the operands D and E; that is, D is to be divided by E. In the subtree whose root is « 
in Figure 9.6.7, the multiplication operator operates on the subtree headed by +, which 
itself represents an expression, and C. 

In a binary tree we distinguish the left and right subtrees of a vertex. The left and 
right subtrees of a vertex correspond to the left and right operands or expressions. This 
left/right distinction is important in expressions. For example, 4-6 and 6—4 are different. 

If we traverse the binary tree of Figure 9.6.7 using inorder, and insert a pair of 
parentheses for each operation, we obtain (((A + B) * C) — (D/E)). This form of an 
expression is called the fully parenthesized form of the expression. In this form we 
do not need to specify which operations (such as multiplication) are to be performed 
before others (such as addition), since the parentheses unambiguously dictate the order 
of operations. 

If we traverse the tree of Figure 9.6.7 using postorder, we obtain AB+ C*DE/—. 
This form of the expression is called the postfix form of the expression (or reverse 
Polish notation). In postfix, the operator follows its operands. For example, the first 
three symbols AB-+ indicate that A and B are to be added. Advantages of the postfix 
form over the infix form are that in postfix no parentheses are needed and no conventions 
are necessary regarding the order of operations. The expression will be unambiguously 
evaluated. For these reasons and others, many compilers translate infix expressions to 
postfix form. Also, some calculators require expressions to be entered in postfix form. 

A third form of an expression can be obtained by applying preorder traversal 
to a binary tree representation of an expression. In this case, the result is called the 
prefix form of the expression (or Polish notation). As in postfix, no parentheses are 
needed and no conventions are necessary regarding the order of operations. The prefix 
form of (9.6.1), obtained by applying preorder traversal to the tree of Figure 9.6.7, is 
— * +ABC/DE. 
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9.6 Review Exercises 


. What is preorder traversal? 

. Give an algorithm to execute a preorder traversal. 
. What is inorder traversal? 

. Give an algorithm to execute an inorder traversal. 
. What is postorder traversal? 

. Give an algorithm to execute a postorder traversal. 


. What is the prefix form of an expression? 


Conran & WN = 


. What is an alternative name of the prefix form of an 
expression? 


9.6 Exercises 


9. What is the infix form of an expression? 
10. What is the postfix form of an expression? 


11. What is an alternative name of the postfix form of an 
expression? 


12. What advantages do prefix and postfix forms of expressions 
have over the infix form? 


13. Explain how a tree can be used to represent an expression. 


In Exercises 1-5, list the order in which the vertices are processed 
using preorder, inorder, and postorder traversal. 


1. 2. 


B) 
©) 


In Exercises 6-10, represent the expression as a binary tree and 
write the prefix and postfix forms of the expression. 

6. (A+B) *(C—D) 

7. (AxB+C xD) — (A/B-—(D+E)) 

8. (A—C)*D)/(A+ (B+D)) 

9. (A+B) * C+D) *£E) —((A+B) *C—D) 
10. (AxB-C/D+E£)+(A-B-C-—D#D)/A+B4+0 
In Exercises 11-15, represent the postfix expression as a binary 
tree and write the prefix form, the usual infix form, and the fully 
parenthesized infix form of the expression. 
11. AB+C— 
13. ABC+— 
15. AB+CDxEF /——Ax 


12. ABCD+x«/E— 
14. ABC**CDE+/— 


In Exercises 16-21, find the value of the postfix expression if A = 1, 
B=2,C =3,.and D=4, 


16. ABC+— 17. AB+CDxAA/——Bx 
18. AB+C— 19. ABCx*ABC++— 
20. ADBCDx—+x 21. ABABx+xDx« 


22. Show, by example, that distinct binary trees with vertices A, 
B, and C can have the same preorder listing ABC. 

23. Show that there is a unique binary tree with six vertices whose 
preorder vertex listing is ABCEFD and whose inorder vertex 
listing is ACFEBD. 


Write an algorithm that reconstructs the binary tree given its 
preorder and inorder vertex orderings. 


«24, 


25. Give examples of distinct binary trees, Bj and Bz, each with 
two vertices, with the preorder vertex listing of B; equal to the 
preorder listing of Bz and the postorder vertex listing of By 
equal to the postorder listing of Bo. 

26. Let P; and P2 be permutations of ABCDEF. Is there a binary 
tree with vertices A, B, C, D, E, and F whose preorder listing 
is P; and whose inorder listing is P2? Explain. 

27. Write a recursive algorithm that prints the contents of the ter- 
minal vertices of a binary tree from left to right. 


28. 


29. 


30. 


31. 
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Write a recursive algorithm that interchanges all left and right 
children of a binary tree. 


Write an algorithm that returns the number of terminal nodes 
in a binary tree. 

Write a recursive algorithm that initializes each vertex of a 
binary tree to the number of its descendants. 


Prove that the algorithm 


funnyorder(PT) { 

if (PT == null) 
return 

process PT 
r = right child of PT 
funnyorder(r) 
1 = left child of PT 
funnyorder(l) 

} 


visits the nodes in the reverse order of postorder. 


In Exercises 32 and 33, every expression involves only the operands 
A, B,...,Z and the operators +, —, *, /. 


«32. 


33. 


34. 


Give a necessary and sufficient condition for a string of sym- 
bols to be a valid postfix expression. 

Write an algorithm that, given the binary tree representation 
of an expression, outputs the fully parenthesized infix form of 
the expression. 

Write an algorithm that prints the characters and their codes 
given a Huffman coding tree (see Example 9.1.8). Assume that 
each terminal vertex stores a character and its frequency. 


9.7 


35. 


36. 


37. 


x38. 


39. 


40. 


41. 
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Use the following definitions in Exercises 35-40. 

Let G = (V,E) be a simple undirected graph. A ver- 
tex cover of G is a subset V' of V such that for each edge 
(v, w) € E, either v € V' orw € V’. The size of a vertex cover 
V’ is the number of vertices in V'. An optimal vertex cover is 
a vertex cover of minimum size. 

An edge disjoint set for G is a subset E’ of E such that 
for every pair of distinct edges ej = (v1,W1) and ey = 
(v2, W2) in E’, we have {vy, wi} N {v2, w2} = ©. 

Prove that for every n, there is a connected graph with n ver- 
tices that has a vertex cover of size 1. 


Could the size of an optimal vertex cover of a graph with n 
vertices equal n? Explain. 


Show that the size of an optimal vertex cover of the complete 
graph on n vertices isn — 1. 


Write an algorithm that finds an optimal vertex cover of a tree 
T = (V, E) whose worst-case time is ©(|E]). 


Show that if V’ is any vertex cover of a graph G and E’ is any 
edge disjoint set for G, then |E’| < |V’|. 

Give an example of a connected graph in which, for every ver- 
tex cover V’ and every edge disjoint set E’, we have |E’| < 
|V’|. Prove that your example has the required property. 


Show how a binary tree with n edges can be encoded as a 
string of n + | ones and n + | zeros where, reading from 
left to right, the number of zeros never exceeds the number 
of ones. Show that each such string represents a binary tree. 
Hint: Consider a preorder traversal of the binary tree in which 
a one means that an edge is present, and a zero means that an 
edge is absent. Add an extra one to the beginning of the string, 
and delete the last zero. 


Decision Trees and the Minimum Time 


for Sorting 


The binary tree of Figure 9.7.1 gives an algorithm for choosing a restaurant. Each internal 
vertex asks a question. If we begin at the root, answer each question, and follow the 
appropriate edge, we will eventually arrive at a terminal vertex that chooses a restaurant. 
Such a tree is called a decision tree. In this section we use decision trees to specify 
algorithms and to obtain lower bounds on the worst-case time for sorting as well as 
solving certain coin puzzles. We begin with coin puzzles. 


Example 9.7.1 


Five-Coins Puzzle Five coins are identical in appearance, but one coin is either heavier 
or lighter than the others, which all weigh the same. The problem is to identify the bad 


coin and determine whether it is heavier or lighter than the others using only a pan 
balance (see Figure 9.7.2), which compares the weights of two sets of coins. 

An algorithm to solve the puzzle is given in Figure 9.7.3 as a decision tree. The 
coins are labeled C;, Cr, C3, C4, Cs. As shown, we begin at the root and place coin C; 
in the left pan and coin C> in the right pan. An edge labeled “” means that the left side 
of the pan balance is heavier than the right side. Similarly, an edge labeled ‘*\ means 
that the right side of the pan balance is heavier than the left side, and an edge labeled -~- 
means that the two sides balance. For example, at the root when we compare C, with 
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Helmand Aurelio's Senkowski 
Pizza Home 


No Yes Bakery 
<= : te ; On Cafe @ 
ss ——— the Ba-Ba-Reeba! KAO 
Figure 9.7.2 A pan balance for baad 
comparing weights of coins. Figure 9.7.1 A decision tree. 


C2, if the left side is heavier than the right side, we know that either C, is the heavy coin 
or C» is the light coin. In this case, as shown in the decision tree, we next compare C| 
with Cs (which is known to be a good coin) and immediately determine whether the bad 
coin is C; or Cy and whether it is heavy or light. The terminal vertices give the solution. 
For example, when we compare C; with Cs and the pans balance, we follow the edge 
to the terminal vertex labeled Cy, L, which tells us that the bad coin is C and that it is 
lighter than the others. 


Coie $Cy.%: Ce, Cs,H  CyH C3L 


Figure 9.7.3 An algorithm to solve the five-coins puzzle. < 


If we define the worst-case time to solve a coin-weighing problem to be the number 
of weighings required in the worst case, it is easy to determine the worst-case time from 
the decision tree; the worst-case time is equal to the height of the tree. For example, the 
height of the decision tree of Figure 9.7.3 is 3, so the worst-case time for this algorithm 
is equal to 3. 


9.7 ® Decision Trees and the Minimum Time for Sorting 479 


We can use decision trees to show that the algorithm given in Figure 9.7.3 to solve 
the five-coins puzzle is optimal, that is, that no algorithm that solves the five-coins puzzle 
has a worst-case time less than 3. 

We argue by contradiction to show that no algorithm that solves the five-coins 
puzzle has a worst-case time less than 3. Suppose that there is an algorithm that solves 
the five-coins puzzle in the worst case in two or fewer weighings. The algorithm can be 
described by a decision tree, and since the worst-case time is 2 or less, the height of the 
decision tree is 2 or less. Since each internal vertex has at most three children, such a 
tree can have at most nine terminal vertices (see Figure 9.7.4). Now the terminal vertices 
correspond to possible outcomes. Thus a decision tree of height 2 or less can account 
for at most nine outcomes. But the five-coins puzzle has 10 outcomes: 


C,,L, C,H, C2, L, C,H, C3,L, 
C3, H, Gad; CyH, Cs, L, Cs, HH. 


This is a contradiction. Therefore, no algorithm that solves the five-coins puzzle has a 
worst-case time less than 3, and the algorithm of Figure 9.7.3 is optimal. 


9 outcomes 


Figure 9.7.4 A five-coins puzzle algorithm that uses at most two weighings. 


We have seen how a decision tree can be used to give a lower bound for the worst- 
case time to solve a problem. Sometimes, the lower bound is unattainable. 

Consider the four-coins puzzle (all the rules are the same as for the five-coins 
puzzle except that the number of coins is reduced by one). Since there are now eight 
outcomes rather than 10, we can conclude that any algorithm to solve the four-coins 
puzzle requires at least two weighings in the worst case. (This time we cannot conclude 
that at least three weighings are required in the worst case.) However, closer inspection 
shows that, in fact, three weighings are required. 

The first weighing either compares two coins against two coins or one coin against 
one coin. Figure 9.7.5 shows that if we begin by comparing two coins against two coins, 
the decision tree can account for at most six outcomes. Since there are eight outcomes, 
no algorithm that begins by comparing two coins against two coins can solve the problem 
in two weighings or less in the worst case. Similarly, Figure 9.7.6 shows that if we begin 
by comparing one coin against one coin and the coins balance, the decision tree can 
account for only three outcomes. Since four outcomes are possible after identifying two 
good coins, no algorithm that begins by comparing one coin against one coin can solve 
the problem in two weighings or less in the worst case. Therefore, any algorithm that 
solves the four-coins puzzle requires at least three weighings in the worst case. 

If we modify the four-coins puzzle by requiring only that we identify the bad coin 
(without determining whether it is heavy or light), we can solve the puzzle in two weigh- 
ings in the worst case (see Exercise 1). 

We turn now to sorting. We can use decision trees to estimate the worst-case time 
to sort. 
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3 outcomes 


Figure 9.7.6 A four-coins 
puzzle algorithm that begins by 
comparing one coin against one 
coin. 


Example 9.7.2 


6 outcomes 


Figure 9.7.5 A four-coins puzzle algorithm that begins by comparing two coins against 
two coins. 


The sorting problem is easily described: Given n items x1, ..., X», arrange them in 
nondecreasing (or nonincreasing) order. We restrict our attention to sorting algorithms 
that repeatedly compare two elements and, based on the result of the comparison, modify 
the original list. 


An algorithm to sort a1, a2, a3 is given by the decision tree of Figure 9.7.7. Each edge is 
labeled with the arrangement of the list based on the answer to the question at an internal 
vertex. The terminal vertices give the sorted order. 


Yes 


41, 42, a3 a7, a4, 43 


Yes 


No No 


a1, 43, a2 42, 43, ay 


a1, 42, a3 42,41, 43 
Yes 


a1, 43, a2 43, 41, a2 a2, 43, a) 43,42, ay 


Figure 9.7.7 An algorithm to sort a, ay, a3. 


Let us define the worst-case time to sort to be the number of comparisons in the 
worst case. Just as in the case of the decision trees that solve coin puzzle problems, the 
height of a decision tree that solves a sorting problem is equal to the worst-case time. For 
example, the worst-case time for the algorithm given by the decision tree of Figure 9.7.7 
is equal to 3. We show that this algorithm is optimal, that is, that no algorithm that sorts 
three items has a worst-case time less than 3. 

We argue by contradiction to show that no algorithm that sorts three items has 
a worst-case time less than 3. Suppose that there is an algorithm that sorts three items 
in the worst case in two or fewer comparisons. The algorithm can be described by a 
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4 outcomes 


Figure 9.7.8 A sorting algorithm that makes at most two comparisons. 


decision tree, and since the worst-case time is 2 or less, the height of the decision tree 
is 2 or less. Since each internal vertex has at most two children, such a tree can have at 
most four terminal vertices (see Figure 9.7.8). Now the terminal vertices correspond to 
possible outcomes. Thus a decision tree of height 2 or less can account for at most four 
outcomes. But the problem of sorting three items has six possible outcomes (when the 
items are distinct), corresponding to the 3! = 6 ways that three items can be arranged: 


51, 52,53, $1,893,582,  $2,81,83, $2,83,81, 53,581,582, 53,52, 51. 


This is a contradiction. Therefore, no algorithm that sorts three items has a worst-case 
time less than 3, and the algorithm of Figure 9.7.7 is optimal. < 


Since 4! = 24, there are 24 possible outcomes to the problem of sorting four items 
(when the items are distinct). To accommodate 24 terminal vertices, we must have a tree 
of height at least 5 (see Figure 9.7.9). Therefore, any algorithm that sorts four items 
requires at least five comparisons in the worst case. Exercise 9 is to give an algorithm 
that sorts four items using five comparisons in the worst case. 


Figure 9.7.9 Level compared with the maximum number of vertices in that level in a binary tree. 


Theorem 9.7.3 


The method of Example 9.7.2 can be used to give a lower bound on the number 
of comparisons required in the worst case to sort an arbitrary number of items. 


If f() is the number of comparisons needed to sort n items in the worst case by a 
sorting algorithm, then f(n) = Q(nlgn). 


Proof Let 7 be the decision tree that represents the algorithm for input of size n and 
let h denote the height of 7. Then the algorithm requires h comparisons in the worst 
case, SO 
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1. 
2. 
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The tree T has at least n! terminal vertices, so by Theorem 9.5.6, 


h=fi(n). (9.7.1) 
Ign! <h. (9.7.2) 

Example 4.3.9 shows that lgn! = O(nlgn); thus, for some positive constant C, 
Cn Ign < Ign! (9.7.3) 


for all but finitely many integers n. Combining (9.7.1) through (9.7.3), we obtain 


Cn Ign < f(n) 


for all but finitely many integers n. Therefore, 


f(@) = Q(nlgn). <4 


Theorem 7.3.10 states that merge sort (Algorithm 7.3.8) uses ©(n lg) compar- 
isons in the worst case and is, by Theorem 9.7.3, optimal. Many other sorting algorithms 
are known that also attain the optimal number © (n lg n) of comparisons; one, tournament 
sort, is described before Exercise 14. 


9.7 Review Exercises 


What is a decision tree? 


How is the height of a decision tree that represents an algorithm 
related to the worst-case time of the algorithm? 


ae 


Use decision trees to explain why worst-case sorting requires at 
least Q(nlgn) comparisons. 


1, 


Four coins are identical in appearance, but one coin is either 
heavier or lighter than the others, which all weigh the same. 
Draw a decision tree that gives an algorithm that identifies in 
at most two weighings the bad coin (but not necessarily de- 
termines whether it is heavier or lighter than the others) using 
only a pan balance. 


. Show that at least two weighings are required to solve the 


problem of Exercise 1. 


. Eight coins are identical in appearance, but one coin is either 


heavier or lighter than the others, which all weigh the same. 
Draw a decision tree that gives an algorithm that identifies in 
at most three weighings the bad coin and determines whether 
it is heavier or lighter than the others using only a pan balance. 


. Twelve coins are identical in appearance, but one coin is ei- 


ther heavier or lighter than the others, which all weigh the 
same. Draw a decision tree that gives an algorithm that iden- 
tifies in at most three weighings the bad coin and determines 
whether it is heavier or lighter than the others using only a pan 
balance. 


. What is wrong with the following argument, which suppos- 


edly shows that the twelve-coins puzzle requires at least four 
weighings in the worst case if we begin by weighing four coins 
against four coins? 


If we weigh four coins against four coins and they bal- 
ance, we must then determine the bad coin from the remain- 
ing four coins. But the discussion in this section showed that 
determining the bad coin from among four coins requires at 
least three weighings in the worst case. Therefore, in the worst 
case, if we begin by weighing four coins against four coins, the 
twelve-coins puzzle requires at least four weighings. 


. Thirteen coins are identical in appearance, but one coin is ei- 


ther heavier or lighter than the others, which all weigh the 
same. How many weighings in the worst case are required to 
find the bad coin and determine whether it is heavier or lighter 
than the others using only a pan balance? Prove your answer. 


. Solve Exercise 6 for the fourteen-coins puzzle. 
A 


3)/2, n => 2, coins are identical in appearance, but one 
coin is either heavier or lighter than the others, which all weigh 
the same. [Kurosaka] gave an algorithm to find the bad coin 
and determine whether it is heavier or lighter than the others 
using only a pan balance in n weighings in the worst case. 
Prove that the coin cannot be found and identified as heavy or 
light in fewer than n weighings. 


Exercises 9 and 10 concern the following variant of the coin- 
weighing problem. We are givenn coins, some of which are bad, but 
are otherwise identical in appearance. All of the good coins have 


the same weight. All of the bad coins also have the same weight, 
but they are lighter than the good coins. We assume that there is at 
least one bad coin and at least one good coin among the n coins. 
The task is to determine the number of bad coins. 


9. 


10. 


11. 


12. 


13. 


Show that at least log3(n — 1) weighings are necessary to de- 
termine the number of bad coins. 


Show how to determine the number of bad coins in at most 
n — | weighings. 
Give an algorithm that sorts four items using five comparisons 
in the worst case. 


Use decision trees to find a lower bound on the number of 
comparisons required to sort five items in the worst case. Give 
an algorithm that uses this number of comparisons to sort five 
items in the worst case. 


Use decision trees to find a lower bound on the number of 
comparisons required to sort six items in the worst case. Give 
an algorithm that uses this number of comparisons to sort six 
items in the worst case. 


Exercises 14 —20 refer to tournament sort. 


Tournament Sort. We are given a sequence s\,.. 
in nondecreasing order. 

We will build a binary tree with terminal vertices la- 
beled s\,..., 89x. An example is shown. 


60) 


., 59k to sort 


(40) (50) 
G) 4) (@) 69) 


30 1 12 40 3 9 35 50 


Working left to right, create a parent for each pair and 
label it with the maximum of the children. Continue in this way 


14. 
15. 


16. 


17. 
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until you reach the root. At this point, the largest value, m, has 
been found. 

To find the second-largest value, first pick a value v less 
than all the items in the sequence. Replace the terminal vertex 
w containing m with v. Relabel the vertices by following the 
path from w to the root, as shown. At this point, the second- 
largest value is found. Continue until the sequence is ordered. 


30 1 12 40 3 935 vDO 


Why is the name “tournament” appropriate? 


Draw the two trees that would be created after the preceding 
tree when tournament sort is applied. 

How many comparisons does tournament sort require to find 
the largest element? 


Show that any algorithm that finds the largest value among n 
items requires at least n — 1 comparisons. 


. How many comparisons does tournament sort require to find 


the second-largest element? 


. Write tournament sort as a formal algorithm. 


. Show that if n is a power of 2, tournament sort requires 


@(nlgn) comparisons. 


. Give an example of a real situation (like that of Figure 9.7.1) 


that can be modeled as a decision tree. Draw the decision tree. 


. Draw a decision tree that can be used to determine who must 


file a federal tax return. 


. Draw a decision tree that gives a reasonable strategy for play- 


ing blackjack (see, e.g., [Ainslie]). 


9.8 Isomorphisms of Trees 


In Section 8.6 we defined what it means for two graphs to be isomorphic. (You might 
want to review Section 8.6 before continuing.) In this section we discuss isomorphic 
trees, isomorphic rooted trees, and isomorphic binary trees. 

Corollary 8.6.5 states that simple graphs G; and G2 are isomorphic if and only if 
there is a one-to-one, onto function f from the vertex set of G; to the vertex set of G2 
that preserves the adjacency relation in the sense that vertices v; and v; are adjacent in 
G, if and only if the vertices f(v;) and f(v;) are adjacent in G. Since a (free) tree is a 
simple graph, trees 7; and 7 are isomorphic if and only if there is a one-to-one, onto 
function f from the vertex set of 7; to the vertex set of 7> that preserves the adjacency 
relation; that is, vertices v; and v; are adjacent in 7, if and only if the vertices f(v;) and 


(vj) are adjacent in 7). 
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Example 9.8.1 The function f from the vertex set of the tree 7; shown in Figure 9.8.1 to the vertex set 
——jr—_ of the tree 7, shown in Figure 9.8.2 defined by 


fO=1, fO=3, fO=2, fO™O=4, fe)=5 


is a one-to-one, onto function that preserves the adjacency relation. Thus the trees 7; 
and 7> are isomorphic. 


c 1 2 
a y d 3 
e 4 5 
T T2 
Figure 9.8.1 A tree. Figure 9.8.2 A tree 
isomorphic to the tree 
in Figure 9.8.1. < 


As in the case of graphs, we can show that two trees are not isomorphic if we can 
exhibit an invariant that the trees do not share. 


Example 9.8.2 The trees T, and T> of Figure 9.8.3 are not isomorphic because T> has a vertex (x) of 
—————= degree 3, but 7; does not have a vertex of degree 3. 


T; T2 


Figure 9.8.3 Nonisomorphic 
trees. T> has a vertex of degree 3, 
but T> does not. < 


We can show that there are three nonisomorphic trees with five vertices. The three 
nonisomorphic trees are shown in Figures 9.8.1 and 9.8.3. 


Theorem 9.8.3 __ There are three nonisomorphic trees with five vertices. 


Proof Weill give an argument to show that any tree with five vertices is isomorphic 
to one of the trees in Figure 9.8.1 or 9.8.3. 

If T is a tree with five vertices, by Theorem 9.2.3 T has four edges. If T had a vertex 
v of degree greater than 4, v would be incident on more than four edges. It follows that 
each vertex in T has degree at most 4. 

We will first find all nonisomorphic trees with five vertices in which the maximum 
vertex degree that occurs is 4. We will next find all nonisomorphic trees with five vertices 
in which the maximum vertex degree that occurs is 3, and so on. 

Let T be a tree with five vertices and suppose that T has a vertex v of degree 4. 
Then there are four edges incident on v and, because of Theorem 9.2.3, these are all the 
edges. It follows that in this case T is isomorphic to the tree in Figure 9.8.1. 

Suppose that T is a tree with five vertices and the maximum vertex degree that 
occurs is 3. Let v be a vertex of degree 3. Then v is incident on three edges, as shown in 
Figure 9.8.4. The fourth edge cannot be incident on v since then v would have degree 4. 
Thus the fourth edge is incident on one of v1, v2, or v3. Adding an edge incident on any 
of v1, V2, Or v3 gives a tree isomorphic to the tree 7) of Figure 9.8.3. 


Example 9.8.5 


Example 9.8.6 
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U2 v3 

v 
v1 VI v v2 Wi w2 
Figure 9.8.4 Vertex Figure 9.8.5 Vertex Figure 9.8.6 Adding 
v has degree 3. v has degree 2. a third edge to the 


graph of Figure 9.8.5. 


Now suppose that T is a tree with five vertices and the maximum vertex degree 
that occurs is 2. Let v be a vertex of degree 2. Then v is incident on two edges, as shown 
in Figure 9.8.5. A third edge cannot be incident on v; thus it must be incident on either 
Vv, Or v2. Adding the third edge gives the graph of Figure 9.8.6. For the same reason, the 
fourth edge cannot be incident on either of the vertices w; or w2 of Figure 9.8.6. Adding 
the last edge gives a tree isomorphic to the tree T; of Figure 9.8.3. 

Since a tree with five vertices must have a vertex of degree 2, we have found all 
nonisomorphic trees with five vertices. < 


For two rooted trees T; and T> to be isomorphic, there must be a one-to-one, onto 
function f from 7; to 72 that preserves the adjacency relation and that preserves the 
root. The latter condition means that f(root of T,;) = root of 7). The formal definition 
follows. 


Definition 9.8.4 >» Let 7, be a rooted tree with root r; and let T> be a rooted 
tree with root 72. The rooted trees T; and 7 are isomorphic if there is a one-to-one, onto 
function f from the vertex set of T; to the vertex set of T> satisfying the following: 


(a) Vertices v; and v; are adjacent in 7, if and only if the vertices f(v,;) and f(v;) are 
adjacent in 7>. 


(b) f(r) = re. 


We call the function f an isomorphism. < 


The rooted trees T; and 7> in Figure 9.8.7 are isomorphic. An isomorphism is 


fy) =, f (v2) = w3, £03) = Wa, f(v4) = Wo, 
f(vs) = wy, (V6) = We, S (v7) = ws. 


Uy WI 
U3, v4 W2 W4 

U2 W3 
U5 U6 U7 W5 W6 W7 

T) T2 


Figure 9.8.7 Isomorphic rooted 
trees. < 


The isomorphism of Example 9.8.5 is not unique. Can you find another isomor- 
phism of the rooted trees of Figure 9.8.7? 


The rooted trees 7, and 7 of Figure 9.8.8 are not isomorphic since the root of 7; has 
degree 3 but the root of 7 has degree 2. These trees are isomorphic as free trees. Each 
is isomorphic to the tree T> of Figure 9.8.3. 
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V1 wt 


v2 W3 
U3 v4 W2 


US W4 Ws5 
T) T2 


Figure 9.8.8 Nonisomorphic 
rooted trees. (The trees are 
isomorphic as free trees.) < 


Arguing as in the proof of Theorem 9.8.3, we can show that there are four non- 
isomorphic rooted trees with four vertices. 


There are four nonisomorphic rooted trees with four vertices. These four rooted trees 
are shown in Figure 9.8.9. 


(a) (b) (c) (d) 


Figure 9.8.9 The four nonisomorphic 
rooted trees with four vertices. 


Proof We first find all nonisomorphic rooted trees with four vertices in which the root 
has degree 3; we then find all nonisomorphic rooted trees with four vertices in which the 
root has degree 2; and so on. We note that the root of a rooted tree with four vertices 
cannot have degree greater than 3. 

A rooted tree with four vertices in which the root has degree 3 must be isomorphic 
to the tree in Figure 9.8.9(a). 

A rooted tree with four vertices in which the root has degree 2 must be isomorphic 
to the tree in Figure 9.8.9(b). 

Let T be a rooted tree with four vertices in which the root has degree 1. Then 
the root is incident on one edge. The two remaining edges may be added in one of two 
ways [see Figure 9.8.9(c) and (d)]. Therefore, all nonisomorphic rooted trees with four 
vertices are shown in Figure 9.8.9. < 


Binary trees are special kinds of rooted trees; thus an isomorphism of binary trees 
must preserve the adjacency relation and must preserve the roots. However, in binary 
trees a child is designated a left child or a right child. We require that an isomorphism 
of binary trees preserve the left and right children. The formal definition follows. 


Definition 9.8.8 » Let T, be a binary tree with root r, and let T> be a binary 
tree with root r2. The binary trees 7; and 7) are isomorphic if there is a one-to-one, onto 
function f from the vertex set of T; to the vertex set of T> satisfying the following: 


(a) Vertices v; and v; are adjacent in 7; if and only if the vertices f(v;) and f(v;) are 
adjacent in 7>. 


(b) f(r) = re. 
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(c) vis a left child of w in 7, if and only if f(v) is a left child of f(w) in T>. 
(d) vis aright child of w in 7; if and only if f(y) is a right child of f(w) in 7). 


We call the function f an isomorphism. < 


Example 9.8.9 The binary trees T; and T> in Figure 9.8.10 are isomorphic. The isomorphism is f(v;) = 
SS “fort = 13.4054: 


v1 Ww 
v2 w2 
V3 U4 W3 W4 
T) T2 
Figure 9.8.10 Isomorphic binary trees. 4 


Example 9.8.10 The binary trees 7; and T> in Figure 9.8.11 are not isomorphic. The root v; in 7; has a 
——e’’'——=_ right child, but the root w, in T> has no right child. 


ail Wi 
u2 w2 
V3 U4 W3 W4 
T) 1g 


Figure 9.8.11 Nonisomorphic binary trees. 
(The trees are isomorphic as rooted trees and 
as free trees.) < 


The trees 7; and T> in Figure 9.8.11 are isomorphic as rooted trees and as free 
trees. As rooted trees, either of the trees of Figure 9.8.11 is isomorphic to the rooted tree 
T of Figure 9.8.9(c). 

Arguing as in the proofs of Theorems 9.8.3 and 9.8.7, we can show that there are 
five nonisomorphic binary trees with three vertices. 


Theorem 9.8.11 There are five nonisomorphic binary trees with three vertices. These five binary trees 
are shown in Figure 9.8.12. 


(a) (b) (c) (d) (e) 


Figure 9.8.12 The five nonisomorphic binary trees 
with three vertices. 


Proof We first find all nonisomorphic binary trees with three vertices in which the 
root has degree 2. We then find all nonisomorphic binary trees with three vertices in 
which the root has degree 1. We note that the root of any binary tree cannot have degree 
greater than 2. 

A binary tree with three vertices in which the root has degree 2 must be isomorphic 
to the tree in Figure 9.8.12(a). In a binary tree with three vertices in which the root has 
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fo \ 


Figure 9.8.13 The two 
nonisomorphic binary 
trees with two vertices. 


degree 1, the root either has a left child and no right child or has a right child and no 
left child. If the root has a left child, the child itself has either a left or a right child. We 
obtain the two binary trees in Figure 9.8.12(b) and (c). Similarly, if the root has a right 
child, the child itself has either a left or a right child. We obtain the two binary trees in 
Figure 9.8.12(d) and (e). Therefore, all nonisomorphic binary trees with three vertices 
are shown in Figure 9.8.12. < 


If S is a set of trees of a particular type (e.g., S is a set of free trees or S is a set 
of rooted trees or S is a set of binary trees) and we define a relation R on S by the rule 
T,RT> if T; and Tz are isomorphic, then R is an equivalence relation. Each equivalence 
class consists of a set of mutually isomorphic trees. 

In Theorem 9.8.3 we showed that there are three nonisomorphic free trees having 
five vertices. In Theorem 9.8.7 we showed that there are four nonisomorphic rooted trees 
having four vertices. In Theorem 9.8.11 we showed that there are five nonisomorphic bi- 
nary trees having three vertices. You might have wondered if there are formulas for the 
number of nonisomorphic n-vertex trees of a particular type. There are formulas for the 
number of nonisomorphic n-vertex free trees, for the number of nonisomorphic n-vertex 
rooted trees, and for the number of nonisomorphic n-vertex binary trees. The formulas 
for the number of nonisomorphic free trees and for the number of nonisomorphic rooted 
trees with n vertices are quite complicated. Furthermore, the derivations of these for- 
mulas require techniques beyond those that we develop in this book. The formulas and 
proofs appear in [Deo, Sec. 10-3]. We derive a formula for the number of binary trees 
with n vertices. 


There are C,, nonisomorphic binary trees with n vertices where C,, = C(2n, n)/(n+1) 
is the nth Catalan number. 


Proof Let a, denote the number of binary trees with n vertices. For example, 
ay = 1 since there is one binary tree having no vertices; a; =1 since there is one bi- 
nary tree having one vertex; a2 = 2 since there are two binary trees having two vertices 
(see Figure 9.8.13); and a3; = 5 since there are five binary trees having three vertices 
(see Figure 9.8.12). 

We derive a recurrence relation for the sequence do, a), .... Consider the con- 
struction of a binary tree with n vertices, n > 0. One vertex must be the root. Since there 
are n — | vertices remaining, if the left subtree has k vertices, the right subtree must 
have n — k — 1 vertices. We construct an n-vertex binary tree whose left subtree has k 
vertices and whose right subtree has n — k — 1 vertices by a two-step process: Construct 
the left subtree, construct the right subtree. (Figure 9.8.14 shows this construction for 
n = 6andk = 2.) By the Multiplication Principle, this construction can be carried 
out in aga,_~-1 ways. Different values of k give distinct n-vertex binary trees, so by the 
Addition Principle, the total number of n-vertex binary trees is 


n—1 


S AkAn—-k-1+ 


k=0 
We obtain the recurrence relation 


n—-l 


an = S AKAn—k-1 n > 1. 
k=0 


But this recurrence relation and initial condition ag = 1 define the sequence of Cata- 
lan numbers (see Examples 6.2.22 and 7.1.7). Thus a, is equal to the Catalan number 
C(2n, n)/(n+ 1). < 
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Laven 


Two vertices 
in left subtree (d) (e) (f) (g) 


Three vertices in right subtree 


LOL LY ES 
MLM OSS 


Figure 9.8.14 The proof of Theorem 9.8.12 for the case n = 6 vertices and k = 2 vertices in the left subtree. 


When discussing graph isomorphisms in Section 8.6, we remarked that there is 
no efficient method known to decide whether two arbitrary graphs are isomorphic. The 
situation is different for trees. It is possible to determine in polynomial time whether 
two arbitrary trees are isomorphic. As a special case, we give a linear-time algorithm to 
determine whether two binary trees T; and T> are isomorphic. The algorithm is based on 
preorder traversal (see Section 9.6). We first check that each of 7, and T> is nonempty, 
after which we check that the left subtrees of T, and T> are isomorphic and that the right 
subtrees of 7, and 7 are isomorphic. 


Algorithm 9.8.13 — Testing Whether Two Binary Trees Are Isomorphic 


Input: The roots r; and rz of two binary trees. (If the first tree is empty, 7 


Go Online has the special value null. If the second tree is empty, rz has the 
For a C program special value null.) 
implementing this Output: true, if the trees are isomorphic 


algorithm, see 


; false, if the trees are not isomorphic 
goo.gl/Q1icMOj 


bin_tree_isom(ry, r2) { 


il, it ( —— null Ni ——n al) 
2, return true 
// now one or both of 7; or rz is not null 
3 1 qr —— Ul Varo —— nad) 
4, return false 


// now neither of r; or 72 is null 
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Ic_r; = left child of r; 

Ic_ry = left child of rz 

rc_r, = right child of r; 

rc_ry = right child of ry 

return bin_tree_isom(Ic_r,, lc_r2) A bin_tree_isom(rc_r|, rc_r2) 


SS Ed =) Cx Ya 


As a measure of the time required by Algorithm 9.8.13, we count the number of 
comparisons with null in lines 1 and 3. We show that Algorithm 9.8.13 is a linear-time 
algorithm in the worst case. 


The worst-case time of Algorithm 9.8.13 is ©(n), where n is the total number of 
vertices in the two trees. 


Proof Leta, denote the number of comparisons with null in the worst case required 
by Algorithm 9.8.13, where n is the total number of vertices in the trees input. We use 
mathematical induction to prove that 


a, < 3n+2 forn > 0. 


Basis Step (n=0) 

If n = 0, the trees input to Algorithm 9.8.13 are both empty. In this case, there are two 
comparisons with null at line 1, after which the procedure returns. Thus aj = 2 and the 
inequality holds when n = 0. 


Inductive Step 


Assume that a, < 3k + 2 when k < n. We must show that a, < 3n+ 2. 

We first find an upper bound for the number of comparisons in the worst case when 
the total number of vertices in the trees input to the procedure is n > 0 and neither tree 
is empty. In this case, there are four comparisons at lines 1 and 3. Let L denote the sum 
of the numbers of vertices in the two left subtrees of the trees input and let R denote 
the sum of the numbers of vertices in the two right subtrees of the trees input. Then at 
line 9 there are at most az + dg additional comparisons. Therefore, at most 4+ az + dp 
comparisons are required in the worst case. By the inductive assumption, 


ay < 3L + 2 and aR < 3R + 2. (9.8.1) 


Now 


2+L+R=n (9.8.2) 


because the vertices comprise the two roots, the vertices in the left subtrees, and the 
vertices in the right subtrees. Combining (9.8.1) and (9.8.2), we obtain 


4+a,p+ar<4+4+ BL42)4+ 3R42) =3(224+L4R)4+2=3n4+2. 


If either tree is empty, four comparisons are required at lines 1 and 3, after which 
the procedure returns. Thus, whether one of the trees is empty or not, at most 3n + 2 
comparisons are required in the worst case. Therefore, a, < 3n + 2, and the Inductive 
Step is complete. We conclude that the worst-case time of Algorithm 9.8.13 
is O(n). 
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If n is even, say n = 2k, one can use induction to show (see Exercise 24) that 
when two k-vertex isomorphic binary trees are input to Algorithm 9.8.13, the number of 
comparisons is equal to 3n + 2. Using this result, one can show (see Exercise 25) that 
if n is odd, say n = 2k + 1, when the two binary trees shown in Figure 9.8.15 are input 
to Algorithm 9.8.13, the number of comparisons is equal to 37 + 1. Thus the worst-case 
time of Algorithm 9.8.13 is Q(n). 


7 


k vertices k+1 vertices 


Figure 9.8.15 Two binary trees that 
give worst-case run time 3n + 1 for 
Algorithm 9.8.13 when n = 2k + 1 is 
odd. 


Since the worst-case time is O(n) and Q(n), the worst-case time of Algorithm 


9.8.13 is O(n). 


< 


[Aho] gives an algorithm whose worst-case time is linear in the number of ver- 
tices that determines whether two arbitrary (not necessarily binary) rooted trees are 


isomorphic. 


9.8 Review Exercises 


1. What does it mean for two free trees to be isomorphic? 
2. What does it mean for two rooted trees to be isomorphic? 


3. What does it mean for two binary trees to be isomorphic? 


4. How many n-vertex, nonisomorphic binary trees are there? 


5. Describe a linear-time algorithm to test whether two binary 
trees are isomorphic. 


9.8 Exercises 


In Exercises 1-6, determine whether each pair of free trees is iso- 
morphic. If the pair is isomorphic, specify an isomorphism. If the 
pair is not isomorphic, give an invariant that one tree satisfies but 
the other does not. 


1. 
Lal Wi W2 
U4 
v2 US W3 W4 
U3 
V6 W5 W6 


Ty T2 


2. 3. T as in Exercise 1 
v1 Wy 
wi ow 
v4 W> 1 W2 
V2 V3 
W3 W3 Ws 
U5 V6 wa W4 
W5 W6 W6 
T T2 T2 
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4. 
U1 Wi 
v2 V3 V4 US U6 W2 W3 W4 W5 W6 
T) T2 
5. 
Wil 
Vi. VY12 Wwi12 
T; Tz 
6. 
U1 Wi 
v2 U5 vw wW2 W6 
V3 U4 W3 W4 W5 
U7 Vil W7 Wil 
vg ug V10 wg W9 W10 
V12 W12 
T\ Tz 


In Exercises 7-9, determine whether each pair of rooted trees is 
isomorphic. If the pair is isomorphic, specify an isomorphism. If 
the pair is not isomorphic, give an invariant that one tree satis- 
fies but the other does not. Also, determine whether the trees are 
isomorphic as free trees. 


ts 
Vy W1 
U2 U4 W2 W4 
V3 W3 
Us W6 
V6 Ws 
U7 Vg W7 Ws 
T T2 
8. 
v1 Wt 
v2 w2 
U4 W4 
U3 W3 
US W5 
T, T2 


9. 


T, and T> as in Exercise 2 


In Exercises 10-12, determine whether each pair of binary trees 
is isomorphic. If the pair is isomorphic, specify an isomorphism. 
If the pair is not isomorphic, give an invariant that one tree satis- 
fies but the other does not. Also, determine whether the trees are 
isomorphic as free trees or as rooted trees. 


10. 
11. 


12. 


13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 


21. 


22. 


23. 


24. 


25. 


T, and T> as in Exercise 8 
V1 
Wi 
v2 
U4 W3 
U3 w2 
US W4 
U6 Ws W6 
Ty T2 
Vv’ Wi 
U3 W3 
v2 Ww2 
U4 W4 
US V6 W5 W6 
T1 T2 


Draw all nonisomorphic free trees having three vertices. 
Draw all nonisomorphic free trees having four vertices. 
Draw all nonisomorphic free trees having six vertices. 
Draw all nonisomorphic rooted trees having three vertices. 
Draw all nonisomorphic rooted trees having five vertices. 
Draw all nonisomorphic binary trees having two vertices. 
Draw all nonisomorphic binary trees having four vertices. 


Draw all nonisomorphic full binary trees having seven ver- 
tices. (A full binary tree is a binary tree in which each internal 
vertex has two children.) 


Draw all nonisomorphic full binary trees having nine vertices. 


Find a formula for the number of nonisomorphic n-vertex full 
binary trees. 


Find all nonisomorphic (as free trees and not as rooted trees) 
spanning trees for each graph in Exercises 7—9, Section 9.3. 


Use induction to show that when two k-vertex isomorphic bi- 
nary trees are input to Algorithm 9.8.13, the number of com- 
parisons with null is equal to 6k + 2. 


Show that when the two binary trees shown in Figure 9.8.15 
are input to Algorithm 9.8.13, the number of comparisons with 
null is equal to 6k + 4. 


26. Write an algorithm to generate an n-vertex random binary tree. 


In Exercises 27-33, Ci, C2, ... denotes the sequence of Catalan 
numbers. Let X, denote the set of nonisomorphic full binary trees 
having n terminal vertices, n > 2, and let Xz denote the set of 
nonisomorphic full binary trees having n + 1 terminal vertices, 
n > 1, with one terminal vertex designated as “marked.” 


27. Given an (n—1)-vertex binary tree T,n > 2, construct a binary 
tree from T by adding a left child to every vertex in T that does 
not have a left child, and adding a right child to every vertex in 
T that does not have a right child. (A terminal vertex will add 
both a left and right child.) Show that this mapping is one-to- 
one and onto from the set of all nonisomorphic (n — 1)-vertex 
binary trees to X;. Conclude that |X;| = C,—, for all n > 2. 


28. Show that |Xo| = (n + 1)C,, for alln > 1. 


Given a tree T € X,, for each vertex v in T, we construct two trees 
in X2 as follows. One tree in Xz is obtained by inserting two new 
children of v—one is a new left child, which is marked, and the 
other is the root of the original subtree in T rooted at v. The other 
tree in X> is obtained by inserting two new children of v—one is a 
new right child, which is marked, and the other is the root of the 
original subtree in T rooted at v. Let Xr denote the set of all such 
trees constructed. This construction is due to Ira Gessel and was 
forwarded to the author by Arthur Benjamin. 


29. Show that |X7| = 2(2n — 1) for all T € X1. 


9.9 Game Trees 


T 


30. 


31. 
32. 


33. 


34. 


35. 
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Show that if 7; and 72 are distinct trees in X,, then X7,) 
Xr, = ©. 


Show that Urex, Xp =X). 


Use Exercises 29-31 to show that (n+ 1)C, = 2(2n—1)Cy_1 
for all n > 2. Exercise 26, Section 7.1, asked for a proof of 
this identity using the explicit formula for C,,. These exercises 
show a way to prove the identity without using the explicit 
formula for C;,. 

Use Exercise 32 to give another derivation of the explicit for- 
mula for the nth Catalan number C,, = C(2n, n)/(n+ 1). (See 
also Example 6.2.22.) 


An ordered tree is a tree in which the order of the children is 
taken into account. For example, the ordered trees 


are not isomorphic. Show that the number of nonisomorphic 
ordered trees with n edges is equal to C,,, the nth Catalan num- 
ber. Hint: Consider a preorder traversal of an ordered tree in 
which | means down and 0 means up. 


[Project] Report on the formulas for the number of nonisomor- 
phic free trees and for the number of nonisomorphic rooted 
trees with n vertices (see [Deo]). 


Trees are useful in the analysis of games such as tic-tac-toe, chess, and checkers, in which 


Go Online 


For more on game trees, see 


goo.gl/oc4uly computers. 


players alternate moves. In this section we show how trees can be used to develop game- 
playing strategies. This kind of approach is used in the development of many computer 
programs that allow human beings to play against computers or even computers against 


As an example of the general approach, consider a version of the game of nim. 
Initially, there are n piles, each containing a number of identical tokens. Players alternate 
moves. A move consists of removing one or more tokens from any one pile. The player 
who removes the last token loses. As a specific case, consider an initial distribution 
consisting of two piles: one containing three tokens and one containing two tokens. All 
possible move sequences can be listed in a game tree (see Figure 9.9.1). The first player 
is represented by a box and the second player is represented by a circle. Each vertex 
shows a particular position in the game. In our game, the initial position is shown as 
eae A path represents a sequence of moves. If a position is shown in a square, it is the 
first player’s move; if a position is shown in a circle, it is the second player’s move. A 
terminal vertex represents the end of the game. In nim, if the terminal vertex is a circle, 
the first player removed the last token and lost the game. If the terminal vertex is a box, 


the second player lost. 


The analysis begins with the terminal vertices. We label each terminal vertex with 
the value of the position to the first player. If the terminal vertex is a circle, since the 
first player lost, this position is worthless to the first player and we assign it the value 0 
(see Figure 9.9.2). If the terminal vertex is a box, since the first player won, this position 


iThis section can be omitted without loss of continuity. 
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Figure 9.9.2 The game tree of Figure 9.9.1 showing the values of all vertices. 


Example 9.9.1 
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is valuable to the first player and we label it with a value greater than 0, say 1 (see 
Figure 9.9.2). At this point, all terminal vertices have been assigned values. 

Now, consider the problem of assigning values to the internal vertices. Suppose, 
for example, that we have an internal box, all of whose children have been assigned 
a value. For example, if we have the situation shown in Figure 9.9.3, the first player 
(box) should move to the position represented by vertex B, since this position is the 
most valuable. In other words, box moves to a position represented by a child with the 
maximum value. We assign this maximum value to the box vertex. 


bd 
A/B Cc 
QOOO® 


Figure 9.9.3 The first player Figure 9.9.4 The second player 
(box) should move to position B (circle) should move to position C 
since it is most valuable. This since it is least valuable (to box). This 
maximum value (1) is assigned to minimum value (0) is assigned to the 
the box. circle. 


Consider the situation from the second (circle) player’s point of view. Suppose 
that we have the situation shown in Figure 9.9.4. Circle should move to the position 
represented by vertex C, since this position is least valuable to box and therefore most 
valuable to circle. In other words, circle moves to a position represented by a child with 
the minimum value. We assign this minimum value to the circle vertex. The process 
by which circle seeks the minimum of its children and box seeks the maximum of its 
children is called the minimax procedure. 

Working upward from the terminal vertices and using the minimax procedure, we 
can assign values to all of the vertices in the game tree (see Figure 9.9.2). These numbers 
represent the value of the game, at any position, to the first player. Notice that the root 
in Figure 9.9.2, which represents the original position, has a value of 1. This means 
that the first player can always win the game by using an optimal strategy. This optimal 
strategy is contained in the game tree: The first player always moves to a position that 
maximizes the value of the children. No matter what the second player does, the first 
player can always move to a vertex having value 1. Ultimately, a terminal vertex having 
value | is reached where the first player wins the game. 

Many interesting games, such as chess, have game trees so large that it is not 
feasible to use a computer to generate the entire tree. Nevertheless, the concept of a 
game tree is still useful for analyzing such games. 

When using a game tree we should use a depth-first search. If the game tree is 
so large that it is not feasible to reach a terminal vertex, we limit the level to which 
depth-first search is carried out. The search is said to be an n-level search if we limit the 
search to n levels below the given vertex. Since the vertices at the lowest level may not 
be terminal vertices, some method must be found to assign them a value. This is where 
the specifics of the game must be dealt with. An evaluation function E is constructed 
that assigns each possible game position P the value E(P) of the position to the first 
player. After the vertices at the lowest level are assigned values by using the function F, 
the minimax procedure can be applied to generate the values of the other vertices. We 
illustrate these concepts with an example. 


Apply the minimax procedure to find the value of the root in tic-tac-toe using a two- 
level, depth-first minimax search. Use the evaluation function E, which assigns a position 
the value VX — NO where NX (respectively, NO) is the number of rows, columns, or 
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diagonals containing an X (respectively, O) that X (respectively, O) might complete. For 
example, position P of Figure 9.9.5 has NX = 2, since X might complete the column 
or the diagonal, and NO = 1, since O can complete only a column. Therefore, E(P) = 
2-1=1. 


P SOLUTION In Figure 9.9.6, we have drawn the game tree for tic-tac-toe to level 2. We 
have omitted symmetric positions. We first assign to the vertices at level 2 the values 
Figure 9.9.5 The value of given by E (see Figure 9.9.7). Next, we compute circle’s values by minimizing over the 
position P is children. Finally, we compute the value of the root by maximizing over the children. 
E(P) =NX-NO=2-1=1. Using this analysis, the first move by the first player would be to the center square. 


Ee) 


[e) Oo oO 


Figure 9.9.6 The game tree for tic-tac-toe to level 2 with symmetric positions omitted. 
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Figure 9.9.7 The game tree of Figure 9.9.6 with the values 
of all vertices shown. 4 


Evaluation of a game tree, or even a part of a game tree, can be a time-consuming 
task, so any technique that reduces the effort is welcomed. The most general technique 
is called alpha-beta pruning. In general, alpha-beta pruning allows us to bypass many 
vertices in a game tree yet still find the value of a vertex. The value obtained is the same 
as if we had evaluated all the vertices. 

As an example, consider the game tree in Figure 9.9.8. Suppose that we want to 
evaluate vertex A using a two-level, depth-first search. We evaluate children left to right. 
We begin at the lower left by evaluating the vertices E, F, and G. The values shown are 
obtained from an evaluation function. Vertex B is 2, the minimum of its children. At 
this point, we know that the value x of A must be at least 2, since the value of A is the 
maximum of its children; that is, 


x>2. (9.9.1) 
This lower bound for A is called an alpha value of A. The next vertices to be evaluated 


are H,/, and J. When J evaluates to 1, we know that the value y of C cannot exceed 1, 
since the value of C is the minimum of its children; that is 


y<l. (9.9.2) 


Example 9.9.2 
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Figure 9.9.8 Evaluating vertex A using a 
two-level, depth-first search with alpha-beta 
pruning. An alpha cutoff occurs at vertex C 
when vertex J is evaluated since /’s value (1) is 
less than or equal to the current lower-bound 
estimate (2) for vertex A. 


It follows from (9.9.1) and (9.9.2) that whatever the value of y is, it will not affect the 
value of x; thus we need not concern ourselves further with the subtree rooted at vertex 
C. We say that an alpha cutoff occurs. We next evaluate the children of D and then D 
itself. Finally, we find that the value of A is 3. 

To summarize, an alpha cutoff occurs at a box vertex v when a grandchild w of v 
has a value less than or equal to the alpha value of v. The subtree whose root is the parent 
of w may be deleted (pruned). This deletion will not affect the value of v. An alpha value 
for a vertex v is only a lower bound for the value of v. The alpha value of a vertex is 
dependent on the current state of the search and changes as the search progresses. 

Similarly, a beta value of a circle vertex is an upper bound for v. A beta cutoff 
occurs at a circle vertex v when a grandchild w of v has a value greater than or equal 
to the beta value of v. The subtree whose root is the parent of w may be pruned. This 
deletion will not affect the value of v. A beta value for a vertex v is only an upper bound 
for the value of v. The beta value of a vertex is dependent on the current state of the 
search and changes as the search progresses. 


Evaluate the root of the tree of Figure 9.9.9 using depth-first search with alpha-beta 
pruning. Assume that children are evaluated left to right. For each vertex whose value is 
computed, write the value in the vertex. Place a check by the root of each subtree that is 
pruned. The value of each terminal vertex is written under the vertex. 


SOLUTION We begin by evaluating vertices A, B, C, and D (see Figure 9.9.10). Next, 
we find that the value of E is 6. This results in a beta value of 6 for F’. Next, we evaluate 
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Figure 9.9.9 The game tree for Example 9.9.2. 
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Figure 9.9.10 Evaluating the root of the game tree of Figure 9.9.9 
using depth-first search with alpha-beta pruning. Checked vertices 
are roots of subtrees that are pruned. The values of vertices that are 
evaluated are written inside the vertices. 


vertex G. Since its value is 8 and 8 exceeds the beta value of F, we obtain a beta cutoff 
and prune the subtree with root H. The value of F is 6. This results in an alpha value of 
6 for J. Next, we evaluate vertices J and K. Since the value 3 of K is less than the alpha 
value 6 of J, an alpha cutoff occurs and the subtree with root L may be pruned. Next, we 
evaluate M, N, O, P, Q, R, and S. No further pruning is possible. Finally, we determine 
that the root / has value 8. < 


It has been shown (see [Pearl]) that for game trees in which every parent has n 
children and in which the terminal values are randomly ordered, for a given amount of 
time, the alpha-beta procedure permits a search depth 4/3 greater than the pure minimax 
procedure, which evaluates every vertex. [Pearl] also shows that for such game trees, the 
alpha-beta procedure is optimal. 

Other techniques have been combined with alpha-beta pruning to facilitate the 
search of a game tree. One idea is to order the children of the vertices to be evaluated so 
that the most promising moves are examined first (see Exercises 23-26). Another idea 
is to allow a variable-depth search in which the search backtracks when it reaches an 
unpromising position as measured by some function. 

Some game-playing programs have been incredibly successful. The best chess, 
Go, backgammon, and checkers programs play at a level comparable to the best human 
players. The world checkers champion is a program named Chinook developed by a 
team from the University of Alberta. In 2007, they proved that Chinook can never lose. 
The best an opponent can hope for is a draw. In 1997 the IBM chess program, Deep 
Blue, defeated Garry Kasparov, who had been world champion since 1985, in a six- 
game match. Deep Blue won two games, drew three, and lost one. In 2016, AlphaGo, a 
program created by Google’s Deep Mind, defeated Lee Sedol, one of the best Go players 
in the world, in a five-game match, four to one. 


9.9 Review Exercises 


7 ao 


. What is a game tree? 


. What is the minimax procedure? 


What is an n-level search? 


What is an evaluation function? 


. What is an alpha value? 
. What is an alpha cutoff? 
. What is a beta value? 


6 
i 
8 
9. What is a beta cutoff? 


. Explain how alpha-beta pruning works. 
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9.9 Exercises 


. Draw the complete game tree for a version of nim in which the 


initial position consists of one pile of six tokens and a turn con- 
sists of taking one, two, or three tokens. Assign values to all 
vertices so that the resulting tree is analogous to Figure 9.9.2. 
Assume that the last player to take a token loses. Will the first 
or second player, playing an optimal strategy, always win? De- 
scribe an optimal strategy for the winning player. 


. Draw the complete game tree for nim in which the initial posi- 


tion consists of two piles of three tokens each. Omit symmetric 
positions. Assume that the last player to take a token loses. As- 
sign values to all vertices so that the resulting tree is analogous 
to Figure 9.9.2. Will the first or second player, playing an op- 
timal strategy, always win? Describe an optimal strategy for 
the winning player. 


. Draw the complete game tree for nim in which the initial po- 


sition consists of two piles, one containing three tokens and 
the other containing two tokens. Assume that the last player 
to take a token wins. Assign values to all vertices so that the 
resulting tree is analogous to Figure 9.9.2. Will the first or sec- 
ond player, playing an optimal strategy, always win? Describe 
an optimal strategy for the winning player. 


. Draw the complete game tree for nim in which the initial posi- 


tion consists of two piles of three tokens each. Omit symmet- 
ric positions. Assume that the last player to take a token wins. 
Assign values to all vertices so that the resulting tree is anal- 
ogous to Figure 9.9.2. Will the first or second player, playing 
an optimal strategy, always win? Describe an optimal strategy 
for the winning player. 


. Draw the complete game tree for the version of nim described 


in Exercise |. Assume that the last person to take a token wins. 
Assign values to all vertices so that the resulting tree is anal- 
ogous to Figure 9.9.2. Will the first or second player, playing 
an optimal strategy, always win? Describe an optimal strategy 
for the winning player. 


. Give an example of a (possibly hypothetical) complete game 


tree in which a terminal vertex is | if the first player won and 
0 if the first player lost having the following properties: There 
are more 0’s than 1’s among the terminal vertices, but the first 
player can always win by playing an optimal strategy. 


Exercises 7 and 8 refer to nim and nim’. Nim is the game using n 
piles of tokens as described in this section in which the last player 
to move loses. Nim’ is the game using n piles of tokens as described 
in this section except that the last player to move wins. We fix n piles 
each with a fixed number of tokens. We assume that at least one pile 
has at least two tokens. 


x7. 


8. 


Show that the first player can always win nim if and only if the 
first player can always win nim’. 


Given a winning strategy for a particular player for nim, de- 
scribe a winning strategy for this player for nim’. 


Evaluate each vertex in each game tree. The values of the terminal 


vertices are given. 


9. 


10. 


12 8 9 1 10 11 14 2 3 13 1 6 
11. 
4 15 1 5 9 13 
12. 
6 
2 9 6 
645 6 8 3 1 28 123 
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13. 


12 


14. 
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4 9 3 6 3 7 3 6 8 10 6 2 9 
Evaluate the root of each of the trees of Exercises 9-13 using a 
depth-first search with alpha-beta pruning. Assume that chil- 
dren are evaluated left to right. For each vertex whose value is 
computed, write the value in the vertex. Place a check by the 
root of each subtree that is pruned. The value of each terminal 
vertex is written under the vertex. 


In Exercises 15-18, determine the value of the tic-tac-toe position 
using the evaluation function of Example 9.9.1. 


15. 


16. 


O 


O|xX 


17. 
O 
xX|O 
x 
18. 
O 
x 
x 
19. Assume that the first player moves to the center square in 


x20. 


21. 


*22, 


tic-tac-toe. Draw a two-level game tree, with the root having 
an X in the center square. Omit symmetric positions. Evaluate 
all the vertices using the evaluation function of Example 9.9.1. 
Where will O move? 


Would a two-level search program based on the evaluation 
function E of Example 9.9.1 play a perfect game of tic-tac- 
toe? If not, can you alter E so that a two-level search program 
will play a perfect game of tic-tac-toe? 


Write an algorithm that evaluates vertices of a game tree to 
level n using depth-first search. Assume the existence of an 
evaluation function E. 


Write an algorithm that evaluates the root of a game tree using 
an n-level, depth-first search with alpha-beta pruning. Assume 
the existence of an evaluation function E. 


The following approach often leads to more pruning than pure alpha-beta minimax. First, perform a two-level search. Evaluate children 
from left to right. At this point, all the children of the root will have values. Next, order the children of the root with the most promising 
moves to the left. Now, use an n-level, depth-first search with alpha-beta pruning. Evaluate children from left to right. 

Carry out this procedure for n = 4 for each game tree of Exercises 23-25. Place a check by the root of each subtree that is pruned. 
The value of each vertex, as given by the evaluation function, is given under the vertex. 


23. 


9133 19 716 4 4 #169 14 6 


8 8 16 12 13 20 


24. 
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30. = 10 70 35 


42 56 9 


25. 


9 22 2110 6 41 4020 3 43133756 3 3435 1355 6 33 445251 


9 25 30 50 


9133 19 716 4 4 #16 9 14 6 


26. Write an algorithm to carry out the procedure described before 
Exercise 23. 


Consider the following simplified version of Jai Alai (see [Moser]) 
in which there are n players numbered 1,2, ...,n. When the game 
begins, the players are aligned in order 1|,...,n. The game be- 
gins with the first player (number 1) playing the next player (num- 
ber 2). When a player wins, that player plays the next player in 
line. (So if player | beats player 2, player | next plays player 3.) 
A losing player goes to the end of the line. (So, again, if player 
1 beats player 2, player 2 goes to the end of the line and the line 
becomes 1,3,4,...,n, 2.) One point is given to each winner in 
the first n — 1 rounds, and two points are given to each winner 
in the subsequent rounds. The first player to score at least n — 1 
points wins the game. An extensive analysis of Jai Alai is given in 
[Skiena]. 


27. Draw the complete game tree for n = 3. 


28. Draw the complete game tree for n = 4. 


*29. Prove that the height of the game tree is n(n — 1)/2. 


Mu Torere is a two-person game played by the Maoris (see [Bell]). 
The board is an eight-pointed star with a circular area in the center 
known as the putahi. 


8 8 16 12 13 20 


The first player has four black tokens and the second player has 
four white tokens. The initial position is shown. A player who can- 
not make a move loses. Players alternate moves. At most one token 
can occupy a point of the star or the putahi. A move consists of 


(a) Moving to an adjacent point 
(b) Moving from the putahi to a point 
(c) Moving from a point to the putahi provided that one or 
both of the adjacent points contain the opponent's pieces 
«30. 
x31. 


Develop an evaluation function for Mu Torere. 


Combine the evaluation function in Exercise 30 with a two- 
level search of the game tree to obtain a game-playing algo- 
rithm for Mu Torere. Evaluate the game-playing ability of this 
algorithm. 
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. Can the first player always win in Mu Torere? 
. Can the first player always tie in Mu Torere? 


. [Project] According to [Nilsson], the complete game tree for 


chess has over 10! vertices. Report on how this estimate was 
obtained. 


x35. [Project] Develop an evaluation function for Kalah. (See 
[Ainslie] for the rules.) 


x36. Develop a game-playing algorithm for Kalah based on the 
evaluation function of Exercise 35. Evaluate the game-playing 
ability of this algorithm. 


Chapter 9 Notes 


The following are recommended references on trees: [Berge; Bondy; Deo; Even, 1979; 
Gibbons; Harary; Knuth, 1997; Liu, 1985; and Ore]. 

See [Date] for the use of trees in hierarchical databases. 

[Johnsonbaugh] has additional information on Huffman codes and a proof that 
Algorithm 9.1.9 constructs an optimal Huffman tree. 

[Golomb, 1965] describes backtracking and contains several examples and 


Aun kwWYN = 


applications. 


Minimal spanning tree algorithms and their implementation can be found in [Tarjan]. 
[Johnsonbaugh] discusses the minimal time for sorting as well as lower bounds for 


other problems. 


Classical sorting algorithms are thoroughly covered in [Knuth, 1998b]. See [Ak]; 
Leighton; Lester; Lewis; Miller; and Quinn] for sorting using parallel machines. 

Good references on game trees are [Nievergelt; Nilsson; and Slagle]. In [Frey], the 
minimax procedure is applied to a simple game. Various methods to speed up the search of 
the game tree are discussed and compared. Computer programs are given. [Berlekamp, 2001, 
2003] contains a general theory of games as well as analyses of many specific games. 


Chapter 9 Review 


Section 9.1 


. Free tree 

. Rooted tree 

. Level of a vertex in a rooted tree 
. Height of a rooted tree 

. Hierarchical definition tree 

. Huffman code 


Section 9.2 


. Parent 

. Ancestor 

. Child 

. Descendant 

. Sibling 

. Terminal vertex 

. Internal vertex 

. Subtree 

. Acyclic graph 

. Alternative characterizations of trees (Theorem 9.2.3) 


Section 9.3 


. Spanning tree 

. A graph has a spanning tree if and only if it is connected. 
. Breadth-first search 

. Depth-first search 

. Backtracking 


Section 9.4 


22. Minimal spanning tree 
23. Prim’s Algorithm to find a minimal spanning tree 
24. Greedy algorithm 


Section 9.5 

25. Binary tree 

26. Left child in a binary tree 
27. Right child in a binary tree 
28. Full binary tree 


29. If T is full binary tree with 7 internal vertices, then T has 
i+ 1 terminal vertices and 27 + 1 total vertices. 

30. If a binary tree of height h has ¢ terminal vertices, then 
Igt<h. 

31. Binary search tree 

32. Algorithm to construct a binary search tree 


Section 9.6 

33. Preorder traversal 

34. Inorder traversal 

35. Postorder traversal 

36. Prefix form of an expression (Polish notation) 

37. Infix form of an expression 

38. Postfix form of an expression (reverse Polish notation) 
39. Tree representation of an expression 


Section 9.7 


47. 
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Linear-time algorithm (Algorithm 9.8.13) to test whether 
two binary trees are isomorphic 


40. Decision tree 
41. The height of a decision tree that represents an algorithm is 

proportional to the worst-case time of the algorithm. Section 9.9 
42. Any sorting algorithm requires at least Q(nlgn) compar- 48. Game tree 

isons in the worst case to sort n items. 49. Minimax procedure 

50. n-level search 

Section 9.8 51. Evaluation function 
43. Isomorphic free trees 52. Alpha-beta pruning 
44. Isomorphic rooted trees 53. Alpha value 
45. Isomorphic binary trees 54. Alpha cutoff 
46. The Catalan number C(2n, n)/(n+1) is equal to the number 55. Beta value 

of nonisomorphic binary trees with n vertices. 56. Beta cutoff 

Chapter 9 Self-Test 

Exercises 1-4 refer to the following tree. 8. List the order in which the vertices are processed using in- 
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. Draw the tree as a rooted tree with root c. 
. Find the level of every vertex in the tree rooted at c. 
. Find the height of the tree rooted at c. 


. Draw the tree as a rooted tree with root f. Find 


(a) The parent of a. 

(b) The children of b. 

(c) The terminal vertices. 
(d) The subtree rooted at e. 


. Draw a binary tree with exactly two left children and one 


right child. 


. A full binary tree has 15 internal vertices. How many ter- 


minal vertices does it have? 


Exercises 7-9 refer to the following binary tree. 


V 


List the order in which the vertices are processed using pre- 
order traversal. 


9. 


10. 


11. 


12. 


13. 


14. 


order traversal. 


List the order in which the vertices are processed using post- 
order traversal. 


Find a minimal spanning tree for the following graph. 


1 8 2 14 3 


F 24 : 26 |, 
28 

4 2 6 

16 22 4 


In what order are the edges added by Prim’s Algorithm for 
the graph of Exercise 10 if the initial vertex is 1? 


In what order are the edges added by Prim’s Algorithm for 
the graph of Exercise 10 if the initial vertex is 6? 

Give an example of the use of the greedy method that does 
not lead to an optimal algorithm. 


Construct an optimal Huffman code for the set of letters in 
the table. 


Letter Frequency 


5 
8 


aAmoQn> 
Du 


Answer true or false in Exercises 15-17 and explain your 
answer. 


15. 


If T is a tree with six vertices, T must have five edges. 
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16. If T is a rooted tree with six vertices, the height of T is at 
most 5. 

17. An acyclic graph with eight vertices has seven edges. 

18. Represent the prefix expression — * E/BD — CA as a binary 


tree. Also write the postfix form and the fully parenthesized 
infix form of the expression. 


Answer true or false in Exercises 19 and 20 and explain your 
answer. 


19. If T; and T> are isomorphic as rooted trees, then 7; and T> 
are isomorphic as free trees. 


20. IfT) and T> are rooted trees that are isomorphic as free trees, 


then 7; and T> are isomorphic as rooted trees. 
21. Place the words 
WORD PROCESSING PRODUCES CLEAN MANUSCRIPTS 
BUT NOT NECESSARILY CLEAR PROSE 


in the order in which they appear, in a binary search tree. 


22. Explain how we would look for MORE in the binary search 


tree of Exercise 21. 

23. Use breadth-first search (Algorithm 9.3.6) with the vertex 
ordering eachgbdfi to find a spanning tree for the following 
graph. 


a b Cc 


g h i 


24. Use depth-first search (Algorithm 9.3.7) with the vertex or- 
dering eachgbdfi to find a spanning tree for the graph of 
Exercise 23. 

25. Use breadth-first search (Algorithm 9.3.6) with the vertex 

ordering fdehaghci to find a spanning tree for the graph of 

Exercise 23. 

Use depth-first search (Algorithm 9.3.7) with the vertex or- 

dering fdehaghbci to find a spanning tree for the graph of 

Exercise 23. 


26 


27. Determine whether the free trees are isomorphic. If the trees 
are isomorphic, give an isomorphism. If the trees are not 


isomorphic, give an invariant the trees do not share. 


vi v2 w4 
U3 
U4 U7 Ww2 ° W4 
U5 V6 W3 
Us W5 W6 W7 Weg 


Ty 14 


28. 


29. 


30. 


31 


32. 


33. 


34 


35 


Determine whether the rooted trees are isomorphic. If the 
trees are isomorphic, give an isomorphism. If the trees 
are not isomorphic, give an invariant the trees do not 
share. 


v1 WI 
v v v, W 
2 3 4 2 wz W4 
V6 Ws 
U5 
U7 W6 W7 
T; TT) 


Six coins are identical in appearance, but one coin is either 
heavier or lighter than the others, which all weigh the same. 
Prove that at least three weighings are required in the worst 
case to identify the bad coin and determine whether it is 
heavy or light using only a pan balance. 


Draw a decision tree that gives an algorithm to solve the 
coin puzzle of Exercise 29 in at most three weighings in the 
worst case. 


Professor E. Sabic claims to have discovered an algorithm 
that uses at most 100” comparisons in the worst case to sort 
nitems, for all n > 1. The professor’s algorithm repeatedly 
compares two elements and, based on the result of the com- 
parison, modifies the original list. Give an argument that 
shows that the professor must be mistaken. 


The binary insertion sort algorithm sorts an array of size n 
as follows. If n = 1, 2, or 3, the algorithm uses an optimal 
sort. Ifn > 3, the algorithm sorts s;, ..., 5, in the following 
way. First, s;,..., S,—, 1s recursively sorted. Then binary 
search is used to determine the correct position for s,, after 
which s, is inserted in its correct position. Determine the 
number of comparisons used by binary insertion sort in the 
worst case for n = 4, 5, 6. Does any algorithm use fewer 
comparisons for n = 4, 5, 6? 


Find the value of the tic-tac-toe position using the evalua- 
tion function of Example 9.9.1. 


X/O 


Give an evaluation function for a tic-tac-toe position dif- 
ferent from that of Example 9.9.1. Attempt to discriminate 
more among the positions than does the evaluation function 
of Example 9.9.1. 


Evaluate each vertex in the following game tree. The values 
of the terminal vertices are given. 


36 
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Chapter 9 Computer Exercises 


. Evaluate the root of the tree of Exercise 35 using the min- 
imax procedure with alpha-beta pruning. Assume that the 
children are evaluated left to right. For each vertex whose 
value is computed, write the value in the vertex. Place a 
check by the root of each subtree that is pruned. 


Chapter 9 Computer Exercises 


1. Write a program that tests if a graph is a tree. 


. Write a program that, given the adjacency matrix of a tree 


and a vertex v, draws the tree rooted at v using a computer 
graphics display. 


. Write a program that, given a frequency table for characters, 


constructs an optimal Huffman code. 


. Write a program that encodes and decodes text given a Huff- 


man code. 


. Compute a table of characters and frequencies by sampling 


some text. Use your program of Exercise 3 to generate an 
optimal Huffman code. Use your program of Exercise 4 to 
encode some sample text. Compare the number of bits used 
to encode the text using the Huffman code with the number 
of bits used to encode the text in ASCII. 


. Write a program that, given a tree T, computes the ec- 


centricity of each vertex in 7 and finds the center(s) 
of T. 


. Write a program that, given a rooted tree and a vertex v, 


(a) Finds the parent of v. 

(b) Finds the ancestors of v. 
(c) Finds the children of v. 

(d) Finds the descendants of v. 
(e) Finds the siblings of v. 


(f) Determines whether v is a terminal vertex. 


8. Write a program that finds a spanning tree in a graph. 


9. Write a program that determines whether a graph is con- 


10. 
11. 
12. 


nected. 
Write a program that finds the components of a graph. 
Write a program to solve the n-queens problem. 


Write a backtracking program to determine whether two 
graphs are isomorphic. 
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14. 


15. 


16. 
17. 


18. 


19. 
20. 


21. 


22. 
23. 


24. 


25. 


26. 


27. 


28. 
29. 


. Write a backtracking program that determines whether a 
graph can be colored with n colors and, if it can be colored 
with n colors, produces a coloring. 


Write a backtracking program that determines whether a 
graph has a Hamiltonian cycle and, if there is a Hamilto- 
nian cycle, finds it. 


Write a program that, given a graph G and a spanning tree 
for G, computes the fundamental cycle matrix of G. 


Implement Prim’s Algorithm as a program. 


Implement Kruskal’s Algorithm (given before Exercise 20, 
Section 9.4) as a program. 


Write a program that accepts strings and puts them into a 
binary search tree. 


Write a program that constructs all n-vertex binary trees. 


Write a program that generates a random n-vertex binary 
tree. 


Implement preorder, postorder, and inorder tree traversals 
as programs. 


Implement tournament sort as a program. 


Implement Algorithm 9.8.13, which tests whether two bi- 
nary trees are isomorphic, as a program. 


Write a program to generate the complete game tree for nim 
in which the initial position consists of two piles of four to- 
kens each. Assume that the last player to take a token loses. 
Implement the minimax procedure as a program. 
Implement the minimax procedure with alpha-beta pruning 
as a program. 

Implement the method of playing tic-tac-toe in Exam- 
ple 9.9.1 as a program. 

Write a program that plays a perfect game of tic-tac-toe. 
[Project] Develop a computer program to play a game that 


has relatively simple rules. Suggested games are Cribbage, 
Othello, The Mill, Battleship, and Kalah. 
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10.1 


Chapter 10 


NETWORK MODELS 


In this chapter we discuss network models, which make use of directed graphs. The 
major portion of the chapter is devoted to the problem of maximizing the flow through 
a network. The network might be a transportation network through which commodities 
flow, a pipeline network through which oil flows, a computer network through which 
data flow, or any number of other possibilities. In each case the problem is to find a 
maximal flow. Many other problems, which on the surface seem not to be flow problems, 
can, in fact, be modeled as network flow problems. 

Maximizing the flow in a network is a problem that belongs both to graph theory 
and to operations research. The traveling salesperson problem furnishes another example 
of a problem in graph theory and operations research. Operations research studies the 
very broad category of problems of optimizing the performance of a system. Typical 
problems studied in operations research are network problems, allocation of resources 
problems, and personnel assignment problems. 


Introduction 


Consider the directed graph in Figure 10.1.1, which represents an oil pipeline network. 
Oil is unloaded at the dock a and pumped through the network to the refinery z. The 
vertices b, c, d, and e represent intermediate pumping stations. The directed edges rep- 
resent subpipelines of the system and show the direction the oil can flow. The labels on 
the edges show the capacities of the subpipelines. The problem is to find a way to max- 
imize the flow from the dock to the refinery and to compute the value of this maximum 
flow. Figure 10.1.1 provides an example of a transport network. 


a= dock z = refinery 


Figure 10.1.1 A transport network. 


Definition 10.1.1 > — A transport network (or more simply network) is a simple, 
weighted, directed graph satisfying: 


Example 10.1.2 


Example 10.1.4 
be “2,2, 


d 2,2 e 


Figure 10.1.2 Flow ina 
network. Edges are labeled 
x, y to indicate capacity x 
and flow y. 
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(a) A designated vertex, the source, has no incoming edges. 
(b) A designated vertex, the sink, has no outgoing edges. 


(c) The weight Cj; of the directed edge (i, j), called the capacity of (i, j), is a non- 
negative number. 


The graph of Figure 10.1.1 is a transport network. The source is vertex a and the sink 
is vertex z. The capacity of edge (a, b), Cap, is 3 and the capacity of edge (b,c), Cyc, 
is 2. < 


Throughout this chapter, if G is a network, we will denote the source by a and the 
sink by z. 

A flow in a network assigns a flow in each directed edge that does not exceed the 
capacity of that edge. Moreover, it is assumed that the flow into a vertex v, which is 
neither the source nor the sink, is equal to the flow out of v. The next definition makes 
these ideas precise. 


Definition 10.1.3 > Let Gbe a transport network. Let C;; denote the capacity 
of the directed edge (i,j). A flow F in G assigns each directed edge (i, j) a nonnegative 
number F7; such that: 

(a) F; ix Cj. 


(b) For each vertex j, which is neither the source nor the sink, 
SRS Fe (10.1.1) 


{In a sum such as (10.1.1), unless specified otherwise, the sum is assumed to be taken 
over all vertices i. Also, if (i, j) is not an edge, we set Fj = 0.] 
We call Fj; the flow in edge (i, j). For any vertex j, we call 


Di 
the flow into j and we call 
Di 
the flow out of j. < 
The property expressed by equation (10.1.1) is called conservation of flow. In the 


oil-pumping example of Figure 10.1.1, conservation of flow means that oil is neither 
used nor supplied at pumping stations b, c, d, and e. 


The assignments, 
Foy =2, Foe = 2, Fee = 3, Faa = 3, 
Fac =1, Fae = 2, F.z = 2, 


define a flow for the network of Figure 10.1.1. For example, the flow into vertex d, 
Faq = 3, is the same as the flow out of vertex d, Fae + Fae = 1+2 = 3. < 


In Figure 10.1.2 we have redrawn the network of Figure 10.1.1 to show the flow 
of Example 10.1.4. An edge e is labeled “‘x, y” if the capacity of e is x and the flow in e 
is y. This notation will be used throughout this chapter. 
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Theorem 10.1.5 


Example 10.1.7 


Notice that in Example 10.1.4, the flow out of the source a, Fy, + Faa, 1s the same 
as the flow into the sink z, F., + F.-; both values are 5. The next theorem shows that it 
is always true that the flow out of the source equals the flow into the sink. 


Given a flow F in a network, the flow out of the source a equals the flow into the sink 


z; that is, 
ri = pags 
i i 


Proof Let V be the set of vertices. We have 
> (ea] -5 (EH), 
jeV \ieV jeV \iev 


since each double sum is 


oF 


ecE 


where E is the set of edges. Now 


0-0 (S4-EA) 


JEeV \ieVv ieV 
= (SA r.) + (See Ee) 
ieV ieV ieV ieV 
+O (ea-ra) 
hie ieV ieV 
= Ske _ ee 
ieV ieV 


since F,; = 0 = Fig, for all i € V, and (Definition 10.1.3b) 


Fy — > Fi =0 if j ¢ V — {a, z}. < 


ieV ieV 


In light of Theorem 10.1.5, we can state the following definition. 


Definition 10.1.6 >» Let F be a flow in a network G. The value 
a Ha = > Fiz 
is called the value of the flow F. < 


The value of the flow in the network of Figure 10.1.2 is 5. < 


The problem for a transport network G may be stated: Find a maximal flow in G; 
that is, among all possible flows in G, find a flow F so that the value of F is a maximum. 
In the next section we give an algorithm that efficiently solves this problem. We conclude 
this section by giving additional examples. 
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Example 10.1.8 A Pumping Network Figure 10.1.3 represents a pumping network in which water for 


Example 10.1.9 


two cities, A and B, is delivered from three wells, w;, w2, and w3. The capacities of the 
intermediate systems are shown on the edges. Vertices b, c, and d represent intermediate 
pumping stations. Model this system as a transport network. 


Figure 10.1.3 A pumping 
network. Water for cities A and B is 
delivered from wells w;, w2, and 
w3. Capacities are shown on the 
edges. 


SOLUTION To obtain a designated source and sink, we can obtain an equivalent trans- 
port network by tying together the sources into a supersource and tying together the 
sinks into a supersink (see Figure 10.1.4). In Figure 10.1.4, oo represents an unlimited 
capacity. 


Wi 6 b 4 A 


3 d B 


Figure 10.1.4 The network of Figure 10.1.3 with a designated 
source and sink. < 


A Traffic Flow Network It is possible to go from city A to city C directly or by going 
through city B. During the period 6:00 PM. to 7:00 PM., the average trip times are 


AtoB 15 minutes 
BtoC 30 minutes 
AtoC 30 minutes. 


The maximum capacities of the routes are 


AtoB 3000 vehicles 
BtoC 2000 vehicles 
AtoC 4000 vehicles. 


Represent the flow of traffic from A to C during the period 6:00 PM. to 7:00 PM. as a 
network. 


SOLUTION A vertex will represent a city at a particular time (see Figure 10.1.5). An 
edge connects X, ty to Y, f2 if we can leave city X at t; P.M. and arrive at city Y at f2 PM. 
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The capacity of an edge is the capacity of the route. Edges of infinite capacity connect 
A, ti, to A, tz, and B, ft; to B, fz to indicate that any number of cars can wait at city A or 
city B. Finally, we introduce a supersource and supersink. 


4000 


4000 


Figure 10.1.5 A network that represents the flow of traffic from city 
A to city C during the period 6:00 P.M. to 7:00 P.M. ST 


Variants of the network flow problem have been used in the design of efficient com- 
puter networks (see [Jones; Kleinrock]). In a model of a computer network, a vertex is a 
message or switching center, an edge represents a channel on which data can be transmit- 
ted between vertices, a flow is the average number of bits per second being transmitted 
on a channel, and the capacity of an edge is the capacity of the corresponding channel. 


10.1 Review Exercises 


1. What is a network? 8. What is a flow out of a vertex? 

2. What is a source in a network? 9. What is conservation of flow? 

3. What is a sink in a network? 10. Given a flow in a network, what is the relation between the 
: ae oe flow out of the source and the flow into the sink? 

4. What is a capacity in a network? 
: ; 11. What is a supersource? 

5. What is a flow in a network? P 
: . 12. What is a supersink? 

6. What is a flow in an edge? P 

7. What is a flow into a vertex? 


10.1 Exercises 


In Exercises 1-3, fill in the missing edge flows so that the result is 2. 
a flow in the given network. Determine the values of the flows. 


i, 
b 6, c aye 


f 5,2 8g 


. The following graph represents a pumping network in which 
oil for three refineries, A, B, and C, is delivered from three 
wells, wi, w2, and w3. The capacities of the intermediate 
systems are shown on the edges. Vertices b, c, d, e, and f rep- 
resent intermediate pumping stations. Model this system as a 
network. 


oS: i 2 f 6 


. Model the system of Exercise 4 as a network assuming that 
well w; can pump at most 2 units, well w2 at most 4 units, and 
well w3 at most 7 units. 


. Model the system of Exercise 5 as a network assuming, in 
addition to the limitations on the wells, that city A requires 4 
units, city B requires 3 units, and city C requires 4 units. 


. Model the system of Exercise 6 as a network assuming, in ad- 
dition to the limitations on the wells and the requirements by 
the cities, that the intermediate pumping station d can pump 
at most 6 units. 


10.2 


10. 


11. 
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. There are two routes from city A to city D. One route passes 


through city B and the other route passes through city C. Dur- 
ing the period 7:00 A.M. to 8:00 A.M., the average trip times are 


AtoB 30 minutes 
AtoC 15 minutes 
BtoD 15 minutes 
CtoD 15 minutes. 


The maximum capacities of the routes are 


AtoB 1000 vehicles 
AtoC 3000 vehicles 
BtoD 4000 vehicles 
CtoD 2000 vehicles. 


Represent the flow of traffic from A to D during the period 
7:00 A.M. to 8:00 A.M. as a network. 


. In the system shown, we want to maximize the flow from 


a to z. The capacities are shown on the edges. The flow be- 
tween two vertices, neither of which is a or z, can be in either 
direction. Model this system as a network. 


Give an example of a network with exactly two maximal flows, 
where each F’ is a nonnegative integer. 


What is the maximum number of edges that an n-vertex net- 
work can have? 


A Maximal Flow Algorithm 


If G is a transport network, a maximal flow in G is a flow with maximum value. In 
general, there will be several flows having the same maximum value. In this section we 
give an algorithm for finding a maximal flow. The basic idea is simple—start with some 
initial flow and iteratively increase the value of the flow until no more improvement is 
possible. The resulting flow will then be a maximal flow. 

We can take the initial flow to be the one in which the flow in each edge is zero. 
To increase the value of a given flow, we must find a path from the source to the sink 
and increase the flow along this path. 

It is helpful at this point to introduce some terminology. Throughout this section, 
G denotes a network with source a, sink z, and capacity C. Momentarily, consider the 
edges of G to be undirected and let 


P=(v0,V1,-- 


oy Vn)s vo = 4, Vn = %; 
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Example 10.2.1 


be a path from a to z in this undirected graph. (All paths in this section are with reference 
to the underlying undirected graph.) If an edge e in P is directed from vj;_; to vj, we say 
that e is properly oriented (with respect to P); otherwise, we say that e is improperly 
oriented (with respect to P) (see Figure 10.2.1). 


Vi-1 
ee Un =Z 
v2 eae x wee 
V1 Un-1 
aa Properly 
a= U0 oriented vj Improperly 


oriented 


Path P 


Figure 10.2.1 Properly and improperly oriented edges. Edge (vj_1, vj) is 
properly oriented because it is oriented in the a-to-z direction. Edge (v;, vj+1) is 
improperly oriented because it is not in the a-to-z direction. 


If we can find a path P from the source to the sink in which every edge in P is 
properly oriented and the flow in each edge is less than the capacity of the edge, it is 
possible to increase the value of the flow. 


Consider the path from a to z in Figure 10.2.2. All the edges in P are properly oriented. 
The value of the flow in this network can be increased by 1, as shown in Figure 10.2.3. 


3,2 2,2 
b 4,2 
Zz 
a c 
Figure 10.2.2 A path all of whose Figure 10.2.3 After increasing 
edges are properly oriented. the flow of Figure 10.2.2 by 1. < 


It is also possible to increase the flow in certain paths from the source to the sink 
in which we have properly and improperly oriented edges. Let P be a path from a to 
z and let x be a vertex in P that is neither a nor z (see Figure 10.2.4). There are four 
possibilities for the orientations of the edges e; and e> incident on x. In case (a), both 
edges are properly oriented. In this case, if we increase the flow in each edge by A, the 
flow into x will still equal the flow out of x. In case (b), if we increase the flow in e 
by A, we must decrease the flow in e; by A so that the flow into x will still equal the 
flow out of x. Case (c) is similar to case (b), except that we increase the flow in e; by 
A and decrease the flow in e, by A. In case (d), we decrease the flow in both edges 
by A. In every case, the resulting edge assignments give a flow. Of course, to carry out 


e x e 

— a ee - ° - e sa0) Seg (a) 
e eo 

OC ——+ HH, ws % . - my ee ere (b) 
e (=) 

a .-—__+__.. . = ° < ° —-—-z (c) 
e e2 

a . — ~ os +e oe. Zz (dd) 


Figure 10.2.4 The four possible orientations of the edges incident on x. 
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these alterations, we must have flow less than capacity in a properly oriented edge and 
a nonzero flow in an improperly oriented edge. 


Consider the path from a to z in Figure 10.2.5. Edges (a, b), (c, d), and (d, z) are prop- 
erly oriented and edge (c, b) is improperly oriented. We decrease the flow by 1 in the 
improperly oriented edge (c, b) and increase the flow by 1 in the properly oriented edges 
(a, b), (c, d), and (d, z) (see Figure 10.2.6). The value of the new flow is | greater than 
that of the original flow. 


a d a d 
Figure 10.2.5 A path with an improperly Figure 10.2.6 After increasing the flow of 
oriented edge: (c, b). Figure 10.2.5 by 1. < 


Theorem 10.2.3 


Go Online 

For more on the maximal 
flow algorithm, see 
goo.gl/Zs13xb 


We summarize the method of Examples 10.2.1 and 10.2.2 as a theorem. 


Let P be a path from a to z in a network G satisfying the following conditions: 
(a) For each properly oriented edge (i, j) in P, 
Fy = Gz. 
(b) For each improperly oriented edge (i, j) in P, 
O) << he 
Let 
A=minXx, 


where X consists of the numbers Cj; — Fi, for properly oriented edges (i, j) in P, and 
F;;, for improperly oriented edges (i, j) in P. Define 
Fj if (7, j) is not in P 
Fj = 4 Fij+A if Gj) is properly oriented in P 
F,,— A _ if Gj) is not properly oriented in P. 


Then F* is a flow whose value is A greater than the value of F. 


Proof (See Figures 10.2.2, 10.2.3, 10.2.5, and 10.2.6.) The argument that F™ is a flow 
is given just before Example 10.2.2. Since the edge (a, v) in P is increased by A, the value 
of F* is A greater than the value of F. < 


In the next section we show that if there are no paths satisfying the conditions of 
Theorem 10.2.3, the flow is maximal. Thus it is possible to construct an algorithm based 
on Theorem 10.2.3. The outline is as follows: 


1. Start with a flow (e.g., the flow in which the flow in each edge is 0). 


2. Search for a path satisfying the conditions of Theorem 10.2.3. If no such path 
exists, stop; the flow is maximal. 
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3. Increase the flow through the path by A, where A is defined as in Theorem 10.2.3, 


and go to line 2. 


In the formal algorithm, we search for a path satisfying the conditions of Theo- 


rem 10.2.3 while simultaneously keeping track of the quantities Cj — Fi, Fi. 


This algorithm finds a maximal flow in a network. The capacity of each edge is a 
nonnegative integer. 


eel 


Input: A network with source a, sink z, capacity C, vertices a = vo,..., 
Vn = Z, andn 


Output: A maximal flow F 


max_flow(a, z, C, v, n) { 
// v’s label is (predecessor(v), val(v)) 
// start with zero flow 
for each edge (i, j) 
while (true) { 
// remove all labels 
fori=Oton{ 
predecessor(v;) = null 
val(v;) = null 
} 
// label a 
predecessor(a) = — 
val(a) = oo 
// U is the set of unexamined, labeled vertices 
U = {a} 
// continue until z is labeled 
while (val(z) == null) { 
if (U == @) // flow is maximal 
return F 
choose v in U 
U=U — {vy} 
A = val(v) 
for each edge (v, w) with val(w) == null 
I Clim << Cope) 
predecessor(w) = v 
val(w) = min{A, Cy — Fry} 


U=UU{w} 
} 
for each edge (w, v) with val(w) == null 
if (Fy > 0) { 


predecessor(w) =v 
val(w) = min{A, F,,,} 
(O/ {UND in} 
} 
} // end while (val(z) == null) loop 
// find path P from a to z on which to revise flow 
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30. Wo =Z 
Bila k=O 
B28 while (wp-= a) { 
3)3}, We+1 = predecessor(wx) 
34. kes lear Ih 
3B, } 
36. P= (Wet, Wks --- W1, Wo) 
Sip A = val(z) 
38. fori=1tok+1{ 
39. e= (wij, W;-1) 
40. if (e is properly oriented in P) 
41. F,=F.+A 
42. else 
43. F,=F.—A 
44. } 
45. } // end while (true) loop 
} 


A proof that Algorithm 10.2.4 terminates is left as an exercise (Exercise 19). If the 
capacities are allowed to be nonnegative rational numbers, the algorithm also terminates; 
however, if arbitrary nonnegative real capacities are allowed and we permit the edges 
in line 17 to be examined in any order, the algorithm may not terminate (see [Ford, 
pp. 21-22]). 

Algorithm 10.2.4 is often referred to as the labeling procedure. We will illustrate 
the algorithm with two examples. 


In this discussion, if vertex v satisfies 


predecessor(v) = p and val(v) = t, 


we show v’s label on the graph as (p, f). 

At lines | and 2, we initialize the flow to 0 in each edge (see Figure 10.2.7). Next, 
at lines 4—7 we set all labels to null. Then, at lines 8 and 9 we label vertex a as (—, 00). 
At line 10 we set U = {a}. We then enter the while loop (line 11). 


b 2,0 Cc 


d 2,0 e 


Figure 10.2.7 After the first labeling. Vertex 
v is labeled (predecessor(v), val(v)). 


Since z is not labeled and U is not empty, we move to line 14, where we choose 
vertex a in U and remove it from U at line 15. At this point, U = ©. We set A to 
oo [= val(a)] at line 16. At line 17 we examine the edges (a, b) and (a, d) since neither 
b nor d is labeled. For edge (a, b) we have F,, = 0 < Cy, = 3. At lines 19 and 20, we 
label vertex b as (a, 3) since predecessor(b) = a and 


val(b) = min{A, 3 — 0} = min{oo, 3 — 0} = 3. 
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At line 21, we add b to U. Similarly, we label vertex d as (a, 5) and add d to U. At this 
point, U = {b, d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U is 
not empty, we move to line 14, where we choose a vertex in U. Suppose that we choose b. 
We remove b from U at line 15. We set A to 3 [= val(b)] at line 16. At line 17 we examine 
edge (b, c). At lines 19 and 20 we label vertex c as (b, 2) since predecessor(c) = b and 


val(c) = min{A, 2 — 0} = min{3, 2 — 0} = 2. 


At line 21 we add c to U. At this point, U = {c, d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U is 
not empty, we move to line 14, where we choose a vertex in U. Suppose that we choose 
c. We remove c from U at line 15. We set A to 2 [= val(c)] at line 16. At line 17 we 
examine edge (c, z). At lines 19 and 20 we label vertex z as (c, 2). At line 21, we add z 
to U. At this point, U = {d, z}. 

We then return to the top of the while loop (line 11). Since z is labeled, we proceed 
to line 30. At lines 30-36, by following predecessors from z, we find the path P = 
(a, b, c, z) from a to z. At line 37 we set A to 2. Since each edge in P is properly oriented, 
at line 41 we increase the flow in each edge in P by A = 2 to obtain Figure 10.2.8. 

We then return to the top of the while loop (line 3). Next, at lines 4-7 we set all 
labels to null. Then, at lines 8 and 9 we label vertex a as (—, 00) (see Figure 10.2.8). At 
line 10 we set U = {a}. We then enter the while loop (line 11). 


b 2,2 Cc 


(a, 5) (d, 2) 
d 2,0 e 


Figure 10.2.8 After increasing the flow 
on path (a, b, c, z) by 2 and the second 
labeling. 


Since z is not labeled and U is not empty, we move to line 14, where we choose 
vertex a in U and remove it from U at line 15. At lines 19 and 20 we label vertex b as 
(a, 1), and we label vertex d as (a, 5). We add b and d to U so that U = {b, d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U is 
not empty, we move to line 14, where we choose a vertex in U. Suppose that we choose 
b. We remove b from U at line 15. At line 17 we examine edge (b, c). Since Fye = Cre, 
we do not label vertex c at this point. Now U = {d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U 
is not empty, we move to line 14, where we choose vertex d in U and remove it from U 
at line 15. At lines 19 and 20 we label vertex c as (d, 2) and we label vertex e as (d, 2). 
We add c and e to U so that U = {c, e}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U is 
not empty, we move to line 14, where we choose a vertex in U. Suppose that we choose 
cin U and remove it from U at line 15. At lines 19 and 20 we label vertex z as (c, 2). 
We add z to U so that U = {z, e}. 

We then return to the top of the while loop (line 11). Since z is labeled, we proceed 
to line 30. At line 36 we find that P = (a, d, c, z). Since each edge in P is properly ori- 
ented, at line 41 we increase the flow in each edge in P by A = 2 to obtain Figure 10.2.9. 
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¢€ b 2s. c 


e d 2,2 e 


Figure 10.2.9 After increasing the flow Figure 10.2.10 After increasing the 


on path (a, d, c, z) by 2 and the third 


labeling. 


Example 10.2.6 


flow on path (a, d, e, z) by 2 and the 
fourth and final labeling. The flow is 
maximal. 


You should check that the next iteration of the algorithm produces the labeling 
shown in Figure 10.2.9. Increasing the flow by A = 2 produces Figure 10.2.10. 

We then return to the top of the while loop (line 3). Next, at lines 4-7 we set all 
labels to null. Then, at lines 8 and 9 we label vertex a as (—, oo) (see Figure 10.2.10). 
At line 10 we set U = {a}. We then enter the while loop (line 11). 

Since z is not labeled and U is not empty, we move to line 14, where we choose 
vertex a in U and remove it from U at line 15. At lines 19 and 20 we label vertex b as 
(a, 1) and we label vertex d as (a, 1). We add b and d to U so that U = {b, d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and 
U is not empty, we move to line 14, where we choose a vertex in U. Suppose that we 
choose b. We remove b from U at line 15. At line 17, we examine edge (b, c). Since 
Fy. = Cyc, we do not label vertex c. Now U = {d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U is 
not empty, we move to line 14, where we choose vertex d in U and remove it from U at 
line 15. At line 17 we examine edges (d, c) and (d, e). Since Fae = Cac and Fae = Cae, 
we do not label either vertex c or vertex e. Now U = ©. 

We then return to the top of the while loop (line 11). Since z is not labeled, we 
move to line 12. Since U is empty, the algorithm terminates. The flow of Figure 10.2.10 
is maximal. < 


Our last example shows how to modify Algorithm 10.2.4 to generate a maximal 
flow from a given flow. 


Replace the zero flow in lines 1 and 2 of Algorithm 10.2.4 with the flow of Figure 10.2.11 
and then find a maximal flow. 


e 2,0 f 


Figure 10.2.11 After labeling. 


SOLUTION After initializing the given flow, we move to lines 4-7, where we set all 
labels to null. Then, at lines 8 and 9 we label vertex a as (—, oo) (see Figure 10.2.11). 
At line 10 we set U = {a}. We then enter the while loop (line 11). 
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Since z is not labeled and U is not empty, we move to line 14, where we choose 
vertex a in U and remove it from U at line 15. At lines 19 and 20, we label vertex b as 
(a, 1) and we label vertex d as (a, 1). We add b and d to U so that U = {b, d}. 

We then return to the top of the while loop (line 11). Since z is not labeled and U is 
not empty, we move to line 14, where we choose a vertex in U. Suppose that we choose 
b. We remove b from U at line 15. At line 17 we examine edges (b, c) and (e, b). Since 
Fye = Che, we do not label vertex c. At lines 25 and 26, vertex e is labeled (b, 1) since 


val(e) = min{val(b), F.5} = min{1, 2} = 1. 


We then return to the top of the while loop (line 11). We ultimately label z (see 
Figure 10.2.11), and at line 36 we find the path P = (a, b, e, f, z). Edges (a, b), (e, f), 
and (f, z) are properly oriented, so the flow in each is increased by 1. Since edge (e, b) 
is improperly oriented, its flow is decreased by 1. We obtain the flow of Figure 10.2.12. 

Another iteration of the algorithm produces the maximal flow shown in 
Figure 10.2.13. 


e 2,1 f 


Figure 10.2.12 After increasing the 
flow on path (a, b, e, f, z) by 1. Notice 
that edge (e, b) is improperly oriented 
and so has its flow decreased by 1. 


e 2,2 f 


Figure 10.2.13 After increasing the 
flow on path (a, d, e, f, z) by 1. The 
flow is maximal. 


A 


10.2 Review Exercises 


— 


. What is a maximal flow? 


- BB NY 


10.2 Exercises 


. What is an improperly oriented edge with respect to a path? 


5. Explain how to increase the flow under the conditions of 
Exercise 4. 


. What is a properly oriented edge with respect to a path? 


6. Explain how to find a maximal flow in a network. 


. When can we increase the flow in a path from the source to the 
sink? 


In Exercises 1-3, a path from the source a to the sink z in a network 2. c 
is given. Find the maximum possible increase in the flow obtain- 5.2 3.2 6.3 < 
able by altering the flows in the edges in the path. 5.1 : : é 
1. a b d 
40 3. 
4 & 6,2 4,2 
6,1 4 om 43) LI f 8,1 
a 
a d 
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In Exercises 4—12, use Algorithm 10.2.4 to find a maximal flow in In Exercises 13-18, find a maximal flow in each network starting 
each network. with the flow given. 
4. Figure 10.1.4 13. Figure 10.1.2 14. Exercise 1, Section 10.1 
5. Figure 10.1.5 15. Exercise 2, Section 10.1 16. Exercise 3, Section 10.1 
6. 17. Figure 10.1.4 with flows 
Faw, = 2, Fw,,b =2; Fpa = 0, Fea = 0, 
Faz = 0, Fa,w> = 0, Fy,,b =0, Foe =2, 
Fea = 4, Fez =4, Faw; _ 2, Fyw,.d = 2, 
Fue=2. 
18. Figure 10.1.4 with flows 
Faw, _ if Fyw,.b =1, Foa = 4, 
Fra = 2, Faz = 6, Faw; = 3, 
Fy,,b = 35 Foe = 0, Fp =1, 
7. Exercise 5, Section 10.1 Fn =, Faw, = 3, Fund = 3, 
Fa@e=3. 


8. Exercise 6, Section 10.1 
9. Exercise 7, Section 10.1 
10. Exercise 8, Section 10.1 
11. Exercise 9, Section 10.1 


19. Show that Algorithm 10.2.4 terminates. 


10.3 The Max Flow, Min Cut Theorem 


In this section we show that at the termination of Algorithm 10.2.4, the flow in the net- 
work is maximal. Along the way we will define and discuss cuts in networks. 

Let G be a network and consider the flow F at the termination of Algorithm 10.2.4. 
Some vertices are labeled and some are unlabeled. Let P (P) denote the set of labeled 
(unlabeled) vertices. (Recall that P denotes the complement of P.) Then the source a is 
in P and the sink z is in P. The set S of edges (v, w), with v € P and w € P, is called 
a cut, and the sum of the capacities of the edges in S is called the capacity of the cut. 
We will see that this cut has a minimum capacity and, since a minimal cut corresponds 
to a maximal flow (Theorem 10.3.9), the flow F is maximal. We begin with the formal 
definition of cut. 

Throughout this section, G is a network with source a and sink z. The capacity of 
edge (i, j) is Cj. 
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Definition 10.3.1 > A cut (P,P) in G consists of a set P of vertices and the 
complement P of P, witha € Pandze P. < 


Example 10.3.2 Consider the network G of Figure 10.3.1. If we let P = {a, b, d}, then P = {c, e, f, z} 
and (P, P) is a cut in G. As shown, we sometimes indicate a cut by drawing a dashed 
line to partition the vertices. 


“e¢ 21 f 


Figure 10.3.1 A cut in a network. The 

dashed line divides the vertices into sets 

P = {a, b, d} and P = {c, e, f, z} 

producing the cut (P, P). < 


Example 10.3.3 Figure 10.2.10 shows the labeling at the termination of Algorithm 10.2.4 for a particular 
————— network. If we let P (P) denote the set of labeled (unlabeled) vertices, we obtain the cut 
shown in Figure 10.3.2. 


af 232 e 


Figure 10.3.2 A network at termination 

of Algorithm 10.2.4. The cut (P, P), 

P = {a, b, d}, is obtained by letting P be 

the set of labeled vertices. < 


We next define the capacity of a cut. 


Definition 10.3.4 >» The capacity of the cut (P, P) is the number 


Cer >. FG, 


ieP jeP < 
Example 10.3.5 The capacity of the cut of Figure 10.3.1 is Cye + Cae = 8. < 
Example 10.3.6 The capacity of the cut of Figure 10.3.2 is Cy. + Cue + Cue = 6. < 


The next theorem shows that the capacity of any cut is always greater than or equal 
to the value of any flow. 
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Theorem 10.3.7 — Let bea flow in Gand let (P, P) be acut in G. Then the capacity of (P, P) is greater 
than or equal to the value of F; that is, 


Se a (10.3.1) 


ieP jeP i 


(The notation }7; means the sum over all vertices i.) 


Proof Note that 


Syn=LYA, 


jeP ieP jeP ieP 


since either side of the equation is merely the sum of Fj over all i, j € P. 


Now 
De 2 POY 
i jeP i jeP i 
= Daath Fr DD 
jeP icP jeP icP jeP icP jeP icP 
=U - Fs Rs CG. < 
jeP icP jeP icP jeP icP jeP icP 
Example 10.3.8 In Figure 10.3.1, the value 5 of the flow is less than the capacity 8 of the cut. < 


A minimal cut is a cut having minimum capacity. 


Theorem 10.3.9 Max Flow, Min Cut Theorem 


Let F be a flow in G and let (P, P) be a cut in G. If equality holds in (10.3.1), then the 
flow is maximal and the cut is minimal. Moreover, equality holds in (10.3.1) if and 


only if 
(@) fy = Cy foe! € IP, IP 
and 


(b) Fj =0 forie P,jeP. 


Proof The first statement follows immediately. 
The proof of Theorem 10.3.7 shows that equality holds precisely when 


DV Fi=9 and YD = DDG 


jeP icP jeP icP jeP icP 
thus the last statement is also true. < 


Example 10.3.10 In Figure 10.3.2, the value of the flow and the capacity of the cut are both 6; therefore, 
the flow is maximal and the cut is minimal. < 


We can use Theorem 10.3.9 to show that Algorithm 10.2.4 produces a maximal 
flow. 
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Theorem 10.3.11 


At termination, Algorithm 10.2.4 produces a maximal flow. Moreover, if P (respec- 


tively, P) is the set of labeled (respectively, unlabeled) vertices at the termination of 
Algorithm 10.2.4, the cut (P, P) is minimal. 


Proof Let P (P) be the set of labeled (unlabeled) vertices of G at the termination of 
Algorithm 10.2.4. Consider an edge (i, 7), where i € P, j € P. Since iis labeled, we must 


have 


Fy = Cy; 


otherwise, we would have labeled j at lines 19 and 20. Now consider an edge (j, i), where 
J €P,ieé P. Since iis labeled, we must have 


Fj = 0; 


otherwise, we would have labeled j at lines 25 and 26. By Theorem 10.3.9, the flow at the 
termination of Algorithm 10.2.4 is maximal and the cut (P, P) is minimal. < 


10.3 Review Exercises 


— 


. What is a cut in a network? 


Nn 


. What is the capacity of a cut? 


7s) 


. What is the relation between the capacity of any cut and the 
value of any flow? 


Ps 


. What is a minimal cut? 


5. State the max flow, min cut theorem. 


6. Explain how the max flow, min cut theorem proves that the al- 
gorithm of Section 10.2 correctly finds a maximal flow in a net- 
work. 


10.3 Exercises 


In Exercises 1-3, find the capacity of the cut (P, P). Also, deter- 
mine whether the cut is minimal. 

1. P = {a, d} for Exercise 1, Section 10.1 

2. P = {a, d, e} for Exercise 2, Section 10.1 

3. P = {a, b, c, d} for Exercise 3, Section 10.1 


In Exercises 4-16, find a minimal cut in each network. 


4. Figure 10.1.1 5. Figure 10.1.4 
6. Figure 10.1.5 7. Exercise 1, Section 10.1 
8. Exercise 2, Section 10.1 9. Exercise 3, Section 10.1 
10. Exercise 4, Section 10.1 11. Exercise 5, Section 10.1 
12. Exercise 6, Section 10.1 13. Exercise 7, Section 10.1 
14. Exercise 8, Section 10.1 15. Exercise 9, Section 10.1 
16. Exercise 12, Section 10.2 
Exercises 17-22 refer to a network G that, in addition to having 


nonnegative integer capacities Cj, has nonnegative integer mini- 
mal edge flow requirements mj. That is, a flow F must satisfy 


mi < Fj S Cy 
for all edges (i, j). 


17. Give an example of a network G, in which mj < Cj for all 
edges (i, 7), for which no flow exists. 


Define 
Cer = Sy Gs 
ieP jeP 
m(P,P) => So my, m(P, P) = S> > my. 
ieP jeP ieP jeP 


18. Show that the value V of any flow satisfies 
m(P, P) — C(P, P) < V < C(P, P) — m@?, P) 
for any cut (P, P). 
19. Show that if a flow exists in G, a maximal flow exists in G with 
value 
min{C(P, P) — m(P, P) | (P, P) is a cut in G}. 
20. Assume that G has a flow F. Develop an algorithm for finding 
a maximal flow in G. 
21. Show that if a flow exists in G, a minimal flow exists in G with 
value 
max{m(P, P) — C(P, P) | (P, P) is a cut in G}. 
22. Assume that G has a flow F. Develop an algorithm for finding 
a minimal flow in G. 


23. True or false? If F is a flow in a network G and (P, P) is a cut 
in G and the capacity of (P, P) exceeds the value of the flow, 
F, then the cut (P, P) is not minimal and the flow F is not 
maximal. If true, prove it; otherwise, give a counterexample. 
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10.4 Matching 


In this section we consider the problem of matching elements in one set to elements in 
another set. We will see that this problem can be reduced to finding a maximal flow ina 
network. We begin with an example. 


Example 10.4.1 Suppose that four persons A, B, C, and D apply for five jobs J), Jz, J3, J4, and Js. Sup- 
———————=_ pose that applicant A is qualified for jobs Jz and Js; applicant B is qualified for jobs J2, 
and Js; applicant C is qualified for jobs J), J3, Ja, and Js; and applicant D is qualified for 

jobs Jz and Js. Is it possible to find a job for each applicant? 

The situation can be modeled by the graph of Figure 10.4.1. The vertices represent 
the applicants and the jobs. An edge connects an applicant to a job for which the applicant 
is qualified. We can show that it is not possible to match a job to each applicant by 
considering applicants A, B, and D, who are qualified for jobs Jz and Js. If A and B 
are assigned a job, none remains for D. Therefore, no assignments exist for A, B, C, 
and D. 


Ji 
A = 
Jy 
B —— 
0 J3 
Cc 
ety 
D - 
Js 


Figure 10.4.1 Applicants (A, B, C, D) and jobs 

(J), Jo, J3, J4, Js). An edge connects an applicant to a 

job for which the applicant is qualified. The black 

lines show a maximal matching (i.e., the maximum 

number of applicants have jobs). < 


In Example 10.4.1 a matching consists of finding jobs for qualified persons. A 
maximal matching finds jobs for the maximum number of persons. A maximal matching 
for the graph of Figure 10.4.1 is shown with black lines. A complete matching finds jobs 
for everyone. We showed that the graph of Figure 10.4.1 has no complete matching. The 
formal definitions follow. 


Definition 10.4.2 > Let G be a directed, bipartite graph with disjoint vertex 
sets V and W in which the edges are directed from vertices in V to vertices in W. (Any 
vertex in G is either in V or in W.) A matching for G is a set of edges E with no vertices in 
common. A maximal matching for G is a matching EF in which EF contains the maximum 
number of edges. A complete matching for G is a matching E having the property that 
if v € V, then (v, w) € E for some w € W. < 


Example 10.4.3 The matching for the graph of Figure 10.4.2, shown with black lines, is a maximal match- 
—e—=-—_ ing and a complete matching. 
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Ww 
a = — X 
ie = Y 
. _ Z 


Figure 10.4.2 The black lines show a maximal 

matching (the maximum number of edges are used) 

and a complete matching (each of A, B, and C is 

matched). < 


In the next example we illustrate how the matching problem can be modeled as a 
network problem. 


Example 10.4.4 A Matching Network Model the matching problem of Example 10.4.1 as a network. 


SOLUTION We first assign each edge in the graph of Figure 10.4.1 capacity 1 (see 
Figure 10.4.3). Next we add a supersource a and edges of capacity | from a to each of 
A, B, C, and D. Finally, we introduce a supersink z and edges of capacity 1 from each 
of J), Jo, J3, J4, and Js to z. We call a network such as that of Figure 10.4.3 a matching 
network. 


Jy 


Figure 10.4.3 The matching problem (Figure 10.4.1) as a 
matching network. <4 


The next theorem relates matching networks and flows. 


Theorem 10.4.5 Let G be a directed, bipartite graph with disjoint vertex sets V and W in which the 
edges are directed from vertices in V to vertices in W. (Any vertex in G is either in V 
or in W.) 


(a) A flow in the matching network gives a matching in G. The vertex v € V is 
matched with the vertex w € W if and only if the flow in edge (v, w) is 1. 


(b) A maximal flow corresponds to a maximal matching. 


(c) A flow whose value is |V| corresponds to a complete matching. 


Proof Let a (z) represent the source (sink) in the matching network, and suppose 
that a flow is given. 


Example 10.4.6 


Go Online 

For more on the Marriage 
Problem, see 
goo.gl/Zs13xb 


Theorem 10.4.7 
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Suppose that the edge (v, w), v € V, w € W, has flow 1. The only edge into vertex 
v is (a, v). This edge must have flow 1; thus the flow into vertex v is 1. Since the flow 
out of v is also 1, the only edge of the form (v, x) having flow 1 is (v, w). Similarly, the 
only edge of the form (x, w) having flow | is (v, w). Therefore, if E is the set of edges 
of the form (v, w) having flow 1, the members of F have no vertices in common; thus E 
is a matching for G. 

Parts (b) and (c) follow from the fact that the number of vertices in V matched is 
equal to the value of the corresponding flow. < 


Since a maximal flow gives a maximal matching, Algorithm 10.2.4 applied to a 
matching network produces a maximal matching. In practice, the implementation of 
Algorithm 10.2.4 can be simplified by using the adjacency matrix of the graph (see 
Exercise 11). 


The matching of Figure 10.4.1 is represented as a flow in Figure 10.4.3. Since the flow 
is maximal, the matching is maximal. < 


Next, we turn to the existence of a complete matching in a directed, bipartite graph 
G with vertex sets V and W. If S C V, we let 


R(S) = {w € W | v € Sand (v, w) is an edge in G}. 


Suppose that G has a complete matching. If S C V, we must have |$| < |R(S)|. It turns 
out that if |S| < |R(S)| for all subsets S of V, then G has a complete matching. This 
result was first given by the English mathematician Philip Hall and is known as Hall’s 
Marriage Theorem, since if V is a set of men and W is a set of women and edges exist 
from v € V to w € W if v and w are compatible, the theorem gives a condition under 
which each man can marry a compatible woman. 


Hall's Marriage Theorem 


Let G be a directed, bipartite graph with disjoint vertex sets V and W in which the 
edges are directed from vertices in V to vertices in W. (Any vertex in G is either in V 
or in W.) There exists a complete matching in G if and only if 


IS] <|R(S)| forall SCV. (10.4.1) 


Proof We have already pointed out that if there is a complete matching in G, condi- 
tion (10.4.1) holds. 

Suppose that condition (10.4.1) holds. Let n = |V| and let (P, P) be a minimal cut 
in the matching network. If we can show that the capacity of this cut is n, a maximal 
flow would have value n. The matching corresponding to this maximal flow would be a 
complete matching. 

The argument is by contradiction. Assume that the capacity of the minimal cut 
(P, P) is less than n. The capacity of this cut is the number of edges in the set 


E={(x,y)|xeP,yeP} 
(see Figure 10.4.4). A member of E is one of the three types: 


Typel: (a,v),veV; 
Type ll: (v,w),veV,we W; 
Type Il: (w,z),w e W. 


We will estimate the number of edges of each type. 
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Figure 10.4.5 The proof of 
Theorem 10.4.7 for n = 3. If 
V CP, as shown the capacity 
of the cut is n. Since we are 
assuming that the capacity of 
the cut is less than n, this case 
cannot occur. Therefore, VM P 
is nonempty. 


Example 10.4.8 


Example 10.4.9 


V* 


Figure 10.4.4 The proof of Theorem 10.4.7. 

V = (1, V2, V3, v4, V5}s 2 = |V] = 5; W = {w1, Wo, ws}; the 
cut is (P, P), where P = {a, v3, v4, Vs, W3}; 

V* = VOP = (v3, va, v5}; R(V*) = {wo, w3}; 

W, = R(V*) NP = {ws}; W2 = R(V*) OP = {w9}; 

E = {(a, v1), (a, v2), (V3, W2), (w3, z)}. The capacity of the cut 
is |E| = 4 <n. The type I edges are (a, v,) and (a, v2). Edge 
(v3, W2) is the only type II edge, and edge (w3, z) is the only 
type III edge. 


If V C P, the capacity of the cut is n (see Figure 10.4.5); thus V* = VN P is 
nonempty. It follows that there are n — |V*| edges in EF of type I. 
We partition R(V*) into the sets 


Wi =R(V*) NP and W2 = R(V*) OP. 
Then there are at least |W1| edges in E of type III. Thus there are less than 
n—(n—|V*|)—|Wi] = |V*| — [Wi 
edges of type II in E. Since each member of W2 contributes at most one type II edge, 
|W2| < |V*| —|W\]. 
Thus 
IR(V*)| = |Wi| + [Wa] < |V"1, 


which contradicts (10.4.1). Therefore, a complete matching exists. < 


For the graph in Figure 10.4.1, if S = {A, B, D}, we have R(S) = {J2, Js} and 
|S] =3 >2 = |R(S)I. 


By Theorem 10.4.7, there is not a complete matching for the graph of Figure 10.4.1. < 


There are n computers and n disk drives. Each computer is compatible with m > 0 disk 
drives and each disk drive is compatible with m computers. Is it possible to match each 
computer with a compatible disk drive? 


SOLUTION Let V be the set of computers and W be the set of disk drives. An edge 
exists from v € V tow € W if v and w are compatible. Notice that every vertex has 


degree m. Let S = {vy .. 


IfR(S) = {wi,... 
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. , Vz} be a subset of V. Then there are km edges from the set S. 
, w;}, then R(S) receives at most jm edges from S. Therefore, km < jm. 


Now |S| =k <j = |R(S)|. By Theorem 10.4.7 there is a complete matching. Thus it is 
possible to match each computer with a compatible disk drive. < 


10.4 Review Exercises 


1. What is a matching? 


2. What is a maximal matching? 


3. What is a complete matching? 


10.4 Exercises 


1. 


2. 


Show that the flow in Figure 10.4.3 is maximal by exhibiting 
a minimal cut whose capacity is 3. 


Find the flow that corresponds to the matching of Fig- 
ure 10.4.2. Show that this flow is maximal by exhibiting a min- 
imal cut whose capacity is 3. 


Exercises 3—7 refer to Figure 10.4.1, where we reverse the direction 
of the edges so that edges are directed from jobs to applicants. 


3. 


4 
5 
6 
7 


What does a matching represent? 


. What does a maximal matching represent? 
. Show a maximal matching. 
. What does a complete matching represent? 


. Is there acomplete matching? If there is a complete matching, 


show one. If there is not a complete matching, explain why 
none exists. 


. Applicant A is qualified for jobs J; and J4; B is qualified for 


jobs Jz, J3, and Je; C is qualified for jobs J;, J3, Js, and Jg; D 
is qualified for jobs J), J3, and J4; and E is qualified for jobs 
J,, J3, and Je. 


(a) Model this situation as a matching network. 
(b) Use Algorithm 10.2.4 to find a maximal matching. 


(c) Is there a complete matching? 


. Applicant A is qualified for jobs J), J2, J4, and Js; B is qualified 


for jobs J;, J4, and Js; C is qualified for jobs J, J4, and J5; D 
is qualified for jobs J; and Js; E is qualified for jobs J2, J3, and 
Js; and F is qualified for jobs J4 and Js. Answer parts (a)—(c) 
of Exercise 8 for this situation. 


. Applicant A is qualified for jobs J), J2, and J4; B is qualified 


for jobs J3, J4, Js, and Jg; C is qualified for jobs J; and J5; D 
is qualified for jobs J), J3, J4, and Jg; E is qualified for jobs 
J, J2, Ja, Jo, and Jg; F is qualified for jobs J4 and J6; and G 
is qualified for jobs J3, Js, and J7. Answer parts (a)—(c) of Ex- 
ercise 8 for this situation. 


. Five students, V, W,X, Y, and Z, are members of four com- 


mittees, C;, C2, C3, and Cy. The members of C are V, X, and 
Y; the members of Cz are X and Z; the members of C3 are 


4. What is the relation between flows and matchings? 


5. State Hall’s Marriage Theorem. 


V, Y, and Z; and the members of C4 are V, W, X, and Z. Each 
committee is to send a representative to the administration. No 
student can represent two committees. 


(a) Model this situation as a matching network. 

(b) What is the interpretation of a maximal matching? 
(c) What is the interpretation of a complete matching? 
(d) Use Algorithm 10.2.4 to find a maximal matching. 


(e) Is there a complete matching? 


. Show that by a suitable ordering of the vertices, the adjacency 


matrix of a bipartite graph can be written 


0 A 
Ar oy? 


where 0 is a matrix consisting only of 0’s and A? is the trans- 
pose of the matrix A. 


In Exercises 13-15, G is a bipartite graph, A is the matrix of Exer- 
cise 12, and F is a flow in the associated matching network. Label 
each entry in A that represents an edge with flow 1. 


. What kind of labeling corresponds to a matching? 
. What kind of labeling corresponds to a complete matching? 
. What kind of labeling corresponds to a maximal matching? 


. Restate Algorithm 10.2.4, applied to a matching network, in 


terms of operations on the matrix A of Exercise 12. 


Let G be a directed, bipartite graph with disjoint vertex sets V and 
W in which the edges are directed from vertices in V to vertices in 
W. (Any vertex in G is either in V or in W.) We define the deficiency 
of G as 


5(G) = max{|S| — |R(S)| | SC V}. 


. Show that G has a complete matching if and only if 5(G) = 0. 


. Show that the maximum number of vertices in V that can be 


matched with vertices in W is |V| — 5(G). 


. True or false? Any matching is contained in a maximal match- 


ing. If true, prove it; if false, give a counterexample. 
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Problem-Solving Corner 


Matching 


Problem 


Let G be a directed, bipartite graph with disjoint vertex 
sets V and W in which the edges are directed from ver- 
tices in V to vertices in W. (Any vertex in G is either in 
V or in W.) Let My denote the maximum degree that 
occurs among vertices in W, and let my denote the min- 
imum degree that occurs among vertices in V. Show 
that if O<My< my, then G has a complete matching. 


Attacking the Problem 


Hall’s Marriage Theorem (Theorem 10.4.7) says that a 
directed, bipartite graph with disjoint vertex sets V and 
W has a complete matching if and only if |S| < |R(S)| 
for all S C V. Thus a possible way to solve the problem 
is to show that the given condition My < my implies 
that |S| < |R(S)| for all S$ C V. 


Finding a Solution 


Our goal is to prove that if My < my, then |S| < |R(S)| 
for all S C V. Let’s start with an example graph G for 
which My < my; in fact, let’s arrange to have Mw = 2 
and my = 3: 


il 4 

5 

6 

2 7 

8 

3 9 
V Ww 


Consider an example subset §S = {1, 3} of V and 
the edges incident on vertices in S: 


4 
1 5 
6 
ei 
3 8 
9 

S R(S) 


The fact that the minimum degree of vertices in V is 3 
means that for any subset S of V, at least three edges 
are incident on each vertex in S. In general, there will 
be at least 3|S| = my|S| edges incident on vertices in S. 
In our example, 3|5| = 6, but there are actually seven 


edges incident on vertices in S. The expression my|S| is 
always a lower bound for the number of edges incident 
on vertices in S. 

The fact that the maximum degree of vertices in 
W is 2 means that for any subset S of V, at most two 
edges are incident on each vertex in R(S). In general, 
there will be at most 2|R(S)| = My|R(S)| edges inci- 
dent on vertices in R(S). In our example, 2|R(S)| = 12, 
but there are actually 10 edges incident on vertices in 
R(S). Since the edges incident on vertices in S are a 
subset of the edges incident on vertices in R(S), the 
expression My|R(S)| is always an upper bound for the 
number of edges incident on vertices in S. 

We have two ways of estimating the number of 
edges incident on vertices in S. The first way, using S, 
gives us a lower bound my|S| on the number of edges, 
and the second way, using R(S), gives us an upper 
bound My|R(S)| on the number of edges. Comparing 
these estimates gives us the inequality 


my|S| < Myw|R(S)|. 
We can’t deduce || < |R(S)|, but we haven’t used the 
hypothesis 
My < my 
yet! Combining the last two inequalities, we have 
my|S| < Mw|R(S)| < my|R(S)|. 


If we now cancel my from both ends of the inequality, 
we obtain 


|S| < |R(S)I, 


which is exactly the inequality we wanted to prove. 


Formal Solution 


Let § C V. Each vertex in S is incident on at least 
my|S| edges; thus there are at least my|S| edges inci- 
dent on vertices in S$. Each vertex in R(S) is incident 
on at most My edges; thus there are at most My|R(S)| 
edges incident on vertices in R(S). It follows that 
my|S| < My|R(S)|. Since My < my, |R(S)|Mw < 
|R(S)|my. Therefore, my|S| < my|R(S)|, and |S| < 
|R(S)|. By Theorem 10.4.7, G has a complete 
matching. 


Summary of Problem-Solving Techniques 


= Look at example graphs. 


= When looking at examples, it’s a good idea to 
assign distinct values to the parameters in the 
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problem so you can keep them straight. (In our way, an equality can sometimes be proved by count- 
example we set My = 2 and my = 3.) ing the number of elements in some set in two differ- 
m= Try to reduce given conditions to those in a use- ent ways. If one way of counting gives c and the other 
ful theorem. (We reduced the conditions given way of counting gives c2, it follows that c; = cz. These 
in this problem to the conditions given in Hall’s techniques are widely used and their usefulness can- 
Marriage Theorem.) not be overemphasized. For example, in Section 6.2 we 


derived a formula for C(n, r) by counting the number 


m= An inequality can sometimes be proved by es- ; 3 ; 
lane P y of r-permutations of an n-element set in two different 


timating the size of some set in two different 


ways. If one estimate gives an upper bound M eee 
and another gives a lower bound m, it follows Exercise 
that m < M. 1. Give an example of a bipartite graph G that has a 
complete matching but does not satisfy the condi- 
Comments 


tion M Ww <My. 
The last summarized problem-solving technique pro- 
vides a method of proving an inequality. In a similar 
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General references that contain sections on network models are [Berge; Deo; Liu, 1968, 
1985; and Tucker]. The classic work on networks is [Ford]; many of the results on networks, 
especially the early results, are due to Ford and Fulkerson, the authors of this book. [Tarjan] 
discusses network flow algorithms and implementation details. 

See [Bachelis] for a nice direct proof of Hall’s Marriage Theorem (Theorem 10.4.7) 
using mathematical induction. 

The problem of finding a maximal flow in a network G, with source a, sink z, and 
capacities Cj, may be rephrased as follows: 


maximize > Foj 
j 


subject to 


0<F; < Cj for all Ld 


So Fy = Fi for all j. 


Such a problem is an example of a linear programming problem. In a linear pro- 
gramming problem, we want to maximize (or minimize) a linear expression, such as 7; Fj, 
subject to linear inequality and equality constraints, such as 0 < Fy < Cj and $0; Fy = 
>); Fi. Although the simplex algorithm is normally an efficient way to solve a 
general linear programming problem, network transport problems are usually more efficiently 
solved using Algorithm 10.2.4. See [Hillier] for an exposition of the simplex 
algorithm. 

Suppose that for each edge (7, j) in a network G, cj; represents the cost of the flow of 
one unit through edge (i, /). Suppose that we want a maximal flow, with minimal cost 


>> Fy. 
ij 


This problem, called the transportation problem, is again a linear programming problem 
and, as with the maximal flow problem, a specific algorithm can be used to obtain a solution 
that is, in general, more efficient than the simplex algorithm (see [Hillier]). 


530 


Chapter 10 ® Network Models 


Chapter 10 Review 


Section 10.1 


= 
i) 


11. 
12. 


CHIDAM EWN 


. (Transport) network 
. Source 


Sink 


. Capacity 


Flow in a network 


. Flow in an edge 

. Flow into a vertex 

. Flow out of a vertex 

. Conservation of flow 

. Given a flow F in a network, the flow out of the source 


equals the flow into the sink. This common value is called 
the value of the flow F. 

Supersource 

Supersink 


Section 10.2 


13. 
14. 
15. 
16. 


Maximal flow 

Properly oriented edge with respect to a path 

Improperly oriented edge with respect to a path 

How to increase the flow in a path from the source to the 

sink when: 

(a) for each properly oriented edge the flow is less than the 
capacity and 


Chapter 10 Self-Test 


17. 


(b) each improperly oriented edge has positive flow (see 
Theorem 10.2.3) 


How to find a maximal flow in a network (Algorithm 10.2.4) 


Section 10.3 


18. 
19. 
20. 


21. 
22. 
23. 


Cut in a network 

Capacity of a cut 

The capacity of any cut is greater than or equal to the value 
of any flow (Theorem 10.3.7). 

Minimal cut 

Max flow, min cut theorem (Theorem 10.3.9) 

At the termination of the maximal flow algorithm, Algo- 
rithm 10.2.4, the set of labeled vertices defines a mini- 
mal cut. 


Section 10.4 


24. 
25. 
26. 
27. 
28. 


29. 


Matching 
Maximal matching 
Complete matching 


Matching network 

Relationship between flows and matchings (Theo- 
rem 10.4.5) 

Hall’s Marriage Theorem (Theorem 10.4.7) 


Exercises 1-4 refer to the following network. The capacities are 
shown on the edges. 


b 3 Cc 4 d 
6 5 
4 2 7 Zz 
3 7 
e 6 f 1 g 
. Explain why 
on Ep 25 Poce= > 
Foa=3, Faz=3, Fap=1, 


with all other F,, = 0, is a flow. 


. What is the flow into b? 
. What is the flow out of c? 
. What is the value of the flow F? 


5. 


For the flow of Exercise 1, find a path from a to z satisfying 
the following: (a) for each properly oriented edge the flow 
is less than the capacity and (b) each improperly oriented 
edge has positive flow. 


. By modifying only the flows in the edges of the path of 


Exercise 5, find a flow with a larger value than F. 


. Use Algorithm 10.2.4 to find a maximal flow in the network 


of Exercise 1 (beginning with the flow in which the flow in 
each edge is equal to zero). 


. Use Algorithm 10.2.4 to find a maximal flow in the follow- 
ing network (beginning with the flow in which the flow in 
each edge is equal to zero). 


9. In each of parts (a)—-(d), answer true if the statement is true 
for every network; otherwise, answer false. 
(a) If the capacity of a cut in a network is equal to Ca, then 
the value of any flow is less than or equal to Ca. 
(b) Ifthe capacity of a cut in a network is equal to Ca, then 
the value of any flow is greater than or equal to Ca. 
(c) If the capacity of a cut in a network is equal to Ca, then 
the value of some flow is greater than or equal to Ca. 
(d) Ifthe capacity of a cut in a network is equal to Ca, then 
the value of some flow is less than or equal to Ca. 
10. Find the capacity of the cut (P, P) in the network of Exer- 
cise 1, where P = {a, b, e, f}. 
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11. Is the cut (P,P), P = {a, b, e, f}, in the network of Exer- 
cise | minimal? Explain. 


12. Find a minimal cut in the network of Exercise 8. 

Exercises 13-16 refer to the following situation. Applicant A is 
qualified for jobs Jz, J4, and Js; applicant B is qualified for jobs 
J, and J3; applicant C is qualified for jobs J, J3, and Js; and 
applicant D is qualified for jobs J3 and Js. 

13. Model the situation as a matching network. 

14. Use Algorithm 10.2.4 to find a maximal matching. 

15. Is there a complete matching? 


16. Find a minimal cut in the matching network. 
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1. Write a program that accepts as input a network with a given 
flow and outputs all possible paths from the source to the sink 
on which the flow can be increased. 


2. Implement Algorithm 10.2.4 that finds a maximal flow in a 
network as a program. Have the program output the minimal 
cut as well as the maximal flow. 


3. Write a program that computes the deficiency of a network. 


11.1. Combinatorial Circuits 


11.2 Properties of 
Combinatorial Circuits 


11.3 Boolean Algebras 


11.4 Boolean Functions and 
Synthesis of Circuits 


11.5 Applications 


Go Online 
Biographies of Boole 
and Shannon are at 
goo.gl/5HWhOy 


11.1 


Go Online 


For more on combinatorial 
circuits, see 
goo.gl/5HWhOy 
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BOOLEAN ALGEBRAS 
AND COMBINATORIAL 
CIRCUITS 


Several definitions honor the nineteenth-century mathematician George Boole—Boolean 
algebra, Boolean function, Boolean expression, and Boolean ring—to name a few. Boole 
is one of the persons in a long historical chain who were concerned with formalizing and 
mechanizing the process of logical thinking. In fact, in 1854 Boole wrote a book entitled 
The Laws of Thought. One of Boole’s contributions was the development of a theory of 
logic using symbols instead of words. For a discussion of Boole’s work, see [Hailperin]. 

Almost a century after Boole’s work, it was observed, especially by C. E. Shannon 
in 1938 (see [Shannon]), that Boolean algebra could be used to analyze electrical cir- 
cuits. Thus Boolean algebra became an indispensable tool for the analysis and design of 
electronic computers in the succeeding decades. We explore the relationship of Boolean 
algebra to circuits throughout this chapter. 


Combinatorial Circuits 


In a digital computer, there are only two possibilities, written 0 and 1, for the smallest, 
indivisible object. All programs and data are ultimately reducible to combinations of bits. 
A variety of devices have been used throughout the years in digital computers to store 
bits. Electronic circuits allow these storage devices to communicate with each other. A 
bit in one part of a circuit is transmitted to another part of the circuit as a voltage. Thus 
two voltage levels are needed—for example, a high voltage can communicate | and a 
low voltage can communicate 0. 

In this section we discuss combinatorial circuits. The output of a combinatorial 
circuit is uniquely defined for every combination of inputs. A combinatorial circuit has 
no memory; previous inputs and the state of the system do not affect the output of a 
combinatorial circuit. Circuits for which the output is a function, not only of the inputs, 
but also of the state of the system, are called sequential circuits and are considered in 
Chapter 12. 

Combinatorial circuits can be constructed using solid-state devices, called gates, 
which are capable of switching voltage levels (bits). We will begin by discussing AND, 
OR, and NOT gates. 


»—\S-— 


Figure 11.1.3 NOT 


gate. 


Example 11.1.4 


Example 11.1.5 


11.1. @ Combinatorial Circuits 533 


Definition 11.1.1 > An AND gate receives inputs x; and x», where x, and x, 
are bits, and produces output denoted x; A x2, where 


1 ifx) =landxy=1 


xi Ax = { 7 
0 otherwise. 


An AND gate is drawn as shown in Figure 11.1.1. 


*1 
X, A X2 
x2 


Figure 11.1.1 AND gate. < 


Definition 11.1.2 >» AnOR gate receives inputs x, and x2, where x; and x are 
bits, and produces output denoted x; V x2, where 


1 ifx) =lor»m=1 


xX,V x2 = { ‘ 
0 otherwise. 


An OR gate is drawn as shown in Figure 11.1.2. 
—e— 
X1 V X2 
x2 
Figure 11.1.2 OR gate. < 


Definition 11.1.3 » A NOT gate (or inverter) receives input x, where x is a bit, 
and produces output denoted x, where 


at ifx=0 
~ 0 ake f, 
A NOT gate is drawn as shown in Figure 11.1.3. < 


The logic table of a combinatorial circuit lists all possible inputs together with the 
resulting outputs. 


Following are the logic tables for the basic AND, OR, and NOT circuits 
(Figures 11.1.1—11.1.3). 


xX, x2 xX, AX2 xy x2 x, V x2 x x 

1 1 1 1 1 1 1 0 
1 O 0 1 O 1 Oo. 1 
0 1 0 Oo 1 1 ——s 
0 0 0 0 0 0 


We note that performing the operation AND (OR) is the same as taking the minimum 
(maximum) of the two bits x; and x2. < 


The circuit of Figure 11.1.4 is an example of a combinatorial circuit since the output y 
is uniquely defined for each combination of inputs x;, x2, and x3. 
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Example 11.1.6 


Example 11.1.7 


x1 
x2 
X3 
Figure 11.1.4 A combinatorial circuit. 


The logic table for this combinatorial circuit follows. 


Xt XQ y 
1 1 1 0 
1 1 0 0 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 1 
0 0 if 0 
0 0 0 1 


Notice that all possible combinations of values for the inputs x;, x2, and x3 are 
listed. For a given set of inputs, we can compute the value of the output y by tracing 
the flow through the circuit. For example, the fourth line of the table gives the value of 
the output y for the input values x, = 1, x. = 0, and x3 = 0. If x; = 1 and x. = 0, the 
output from the AND gate is 0 (see Figure 11.1.5). Since x; = 0, the inputs to the OR 
gate are both 0. Therefore, the output of the OR gate is 0. Since the input to the NOT 
gate is 0, it produces output y = 1. 


yaa 


Figure 11.1.5 The circuit of Figure 11.1.4 when x; = 1 and 
2 = = 0. < 


The circuit of Figure 11.1.6 is not a combinatorial circuit, because the output y is not 
uniquely defined for each combination of inputs x; and x2. For example, suppose that 
x; = 1 and x. = O. If the output of the AND gate is 0, then y = 0. On the other hand, 
if the output of the AND gate is 1, then y = 1. Such a circuit might be used to store 
one bit. 


x1 


x2 


Figure 11.1.6 A circuit that is not a 
combinatorial circuit. < 


Individual combinatorial circuits may be interconnected. The combinatorial circuits 
Ci, C2, and C3 of Figure 11.1.7 may be combined, as shown, to obtain the combina- 
torial circuit C. < 
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x1 


x3 


x4 yo CC 


X5 


X6 
y3 C3 


x1 y1 x5 
x2 
x3 yo X6 ve 


x4 


Figure 11.1.7 Combinatorial circuit C is obtained by interconnecting 
the combinatorial circuits C;, Cy, and C3. 


Example 11.1.8 A combinatorial circuit with one output, such as that in Figure 11.1.4, can be represented 
—_—=<——_ by an expression using the symbols A, V, and ~. We follow the flow of the circuit sym- 
bolically. First, x; and x2 are ANDed (see Figure 11.1.8), which produces output x; A x2. 
This output is then ORed with x3 to produce output (x; A x2) V x3. This output is then 

NOTed. Thus the output y may be 


y= | AX) V x3. (11.1.1) 


Expressions such as (11.1.1) are called Boolean expressions. 


X41 
(x1 A X2) V x3 


x2 


. y = (41 A X2) V x3 
3 


Figure 11.1.8 Representation of a combinatorial circuit by a Boolean 


expression. 
< 
Definition 11.1.9 » Boolean expressions in the symbols x1, .. . , X, ate defined 
recursively as follows. 
OW Mia caceae hp (11.1.2) 
are Boolean expression. If X; and X2 are Boolean expressions, then 
(a) (X%), — (b) Xi, (©) Mi VX, ) Xi AX (11:4.3) 


are Boolean expressions. 
If X is a Boolean expression in the symbols x;, ...,X,, we Sometimes write 


X =X O15 205 Bq) 


Either symbol x or X is called a literal. < 
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Example 11.1.10 


Example 11.1.11 


Example 11.1.12 


Example 11.1.13 


x2 


x3 


Figure 11.1.9 The combinatorial 


Use Definition 11.1.9 to show that the right side of (11.1.1) is a Boolean expression in 
X1,X2, and x3. 


SOLUTION By (11.1.2), x; and x2 are Boolean expressions. By (11.1.3d), x1 A x2 isa 
Boolean expression. By (11.1.3a), (x1 A x2) is a Boolean expression. By (11.1.2), x3 isa 
Boolean expression. Since (x; A x2) and x3 are Boolean expressions, by (11.1.3c), so is 
(x1 A X2) V x3 .. Finally, we may apply (11.1.3b) to conclude that (x; A x2) V x3 is a 


Boolean expression. < 

If X = X(1,..., X,) is a Boolean expression and x), ...,X, are assigned values 
a\,..-, Gy in {0, 1}, we may use Definitions 11.1.1—11.1.3 to compute a value for X. We 
denote this value X(q),..., G,) or X(x; = aj). 


For x; = 1,x. = 0, and x3; = 0, the Boolean expression X (x), X2,.x3) = (X11 A x2) V x3 
of (11.1.1) becomes 


X(1,0,0) = (1A0) V0 
=0v0 since 1 AO =0 
=0 sinceO VO =0 
= 1 since 0 = 1. 
We have again computed the fourth row of the table in Example 11.1.5. < 


In a Boolean expression in which parentheses are not used to specify the order of 
operations, we assume that / is evaluated before v. 


For x; = 0, x2 = 0, and x3 = 1, the value of the Boolean expression x; A x2 V x3 is 
xX A21V%3=OADVI=0OVI=1. < 
Example 11.1.8 showed how to represent a combinatorial circuit with one output 


as a Boolean expression. The following example shows how to construct a combinatorial 
circuit that represents a Boolean expression. 


Find the combinatorial circuit corresponding to the Boolean expression 
(x1 A (2 V x3) V X2 
and write the logic table for the circuit obtained. 


SOLUTION We begin with the expression X2 V x3 in the innermost parentheses. This 
expression is converted to a combinatorial circuit, as shown in Figure 11.1.9. 

The output of this circuit is ANDed with x; to produce the circuit drawn in 
Figure 11.1.10. Finally, the output of this circuit is ORed with x2 to give the desired 
circuit drawn in Figure 11.1.11. The logic table follows. 


XA (BV xX 
te Vs 1 A (¥2 V x3) 


circuit corresponding to the Boolean Figure 11.1.10 The combinatorial circuit corresponding to 


expression X2 V x3. 


the Boolean expression x; A (X2 V x3). 
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Figure 11.1.11 The combinatorial circuit corresponding to the Boolean 


expression (x; A (X2 V x3)) V x2. 


x1 A (x2 V x3) 


Xp X23 (x1 A (X2 V x3) V x2 
1 1 1 1 
1 1 0 1 
I 0 1 1 
1 (0) 0 1 
0 1 1 1 
0 1 0 1 
0 0 1 0 
0 0 0 0 
11.1 Review Exercises 
1. What is a combinatorial circuit? 6. What is an inverter? 
2. What is a sequential circuit? 7. What is a logic table of a combinatorial circuit? 
3. What is an AND gate? 8. What is a Boolean expression? 
4. What is an OR gate? 9. What is a literal? 
5. What is a NOT gate? 
11.1 Exercises 
In Exercises 1-6, write the Boolean expression that represents the o: 
combinatorial circuit, write the logic table, and write the output of 
each gate symbolically as in Figure 11.1.8. x1 
1. x2 
x1 
2. *4 
xy 
x2 6. The circuit at the bottom of Figure 11.1.7. 
x3 Exercises 7-9 refer to the circuit 


(x1 A (¥2 V x3)) V x2 


x 
3. y 


x2 


x3 


7. Show that this circuit is not a combinatorial circuit. 
8. Show that if x = 0, the output y is uniquely determined. 
9. Show that if x = 1, the output y is undetermined. 


In Exercises 10-14, find the value of the Boolean expressions for 


HH 1, mH, x7 = 0, x= 1. 


10. xp Ax2 11. (x) A X2) V (x V X3) 


538 


12. x1 V (2 A x3) 
13. (x1 A (x2 V (1 A X2))) V (1 AX2) V 1 AX3)) 
14. (((x1 Ax2) V (03 AX4)) V (G1 V3) A G2 V x3))) V 1 AX3) 


15. Using Definition 11.1.9, show that each expression in Exer- 
cises 10-14 is a Boolean expression. 


In Exercises 16-20, tell whether the given expression is a Boolean 
expression. If it is a Boolean expression, use Definition 11.1.9 to 
show that it is. 


16. x1 A (x2 V x3) 
17. (x1) 

18. x1 AX. V x3 
19. ((x1 Ax2) V X3 
20. (@1)) 


21. Find the combinatorial circuit corresponding to each Boolean 
expression in Exercises 10—14 and write the logic table. 


A switching circuit is an electrical network consisting of switches 
each of which is open or closed. An example is given in Fig- 
ure 11.1.12. If switch X is open (closed), we write X = 0(X = 1). 
Switches labeled with the same letter, such as B in Figure 11.1.12, 
are either all open or all closed. Switch X, such as A in Figure 
11.1.12, is open if and only if switch X, such as A, is closed. If cur- 
rent can flow between the extreme left and right ends of the circuit, 
we say that the output of the circuit is 1; otherwise, we say that 
the output of the circuit is 0. A switching table gives the output of 
the circuit for all values of the switches. The switching table for 
Figure 11.1.12 is as follows: 


A B Cc Circuit Output 
1 1 1 1 

1 1 0 1 

1 0 1 0 

1 0 0 0 

0 1 1 1 

0 1 0 1 

0 0 1 1 

0 0 0 1 


Figure 11.1.12 A switching circuit. 
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22. Draw a circuit with two switches A and B having the prop- 
erty that the circuit output is 1 precisely when both A and B 
are closed. This configuration is labeled A A B and is called a 
series circuit. 


23. Draw a circuit with two switches A and B having the prop- 
erty that the circuit output is 1 precisely when either A or B 
is closed. This configuration is labeled A V B and is called a 
parallel circuit. 


24. Show that the circuit of Figure 11.1.12 can be represented 
symbolically as 
(AAB)VAV (BAC). 
Represent each circuit in Exercises 25-29 symbolically and give 
its switching table. 


25. 
(nt 
c a .% 
26. 
27. 
B 
A 
C 
D 
28. 
B 
ro 24 ’ 
A B— 
A 
Cc D 
C 


Represent the expressions in Exercises 29-33 as switching circuits 
and write the switching tables. 

29. (AVB)AA 

30. AV (BAC) 

31. (AAB)V (CAA) 

32. AAA(BAC)V(BAC))) VAABAC) 

33. AA(BACAD)V (BAC) VD) V (BACAD)) A (BV D) 
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11.2 Properties of Combinatorial Circuits 


Theorem 11.2.1 


In the preceding section we defined two binary operators A and V on Z) = {0, 1} and 
a unary operator ~ on Z). (Throughout the remainder of this chapter we let Z) denote 
the set {0, 1}.) We saw that these operators could be implemented in circuits as gates. In 
this section we discuss some properties of the system consisting of Z) and the operators 
A, V, and~. 


If A, V, and ~ are as in Definitions 11.1.1—-11.1.3, then the following properties 
hold. 


(a) Associative laws: 

(WD) Vie SOV (OV) 

(IND) NG aN Ne) for all a,b,c € Z. 
(b) Commutative laws: 

aVb=bVa, CNS DAG for all a,b € Z. 

(c) Distributive laws: 

aN (bVc) =(anb)vV (anc) 

aV (bAc) = (aVb)A (avec) for all a,b,c € Z. 
(d) Identity laws: 

av O= cr, OK L=o forall a € Z. 

(e) Complement laws: 


aVa—Ae anal —10) forall ae ZH. 


Proof The proofs are straightforward verifications. We shall prove the first 
distributive law only and leave the other equations as exercises (see Exercises 16 
and 17). 

We must show that 


aN(bVc)=(anb)Vv (adc) for all a, b,c € Z. (11.2.1) 


We simply evaluate both sides of (11.2.1) for all possible values of a, b, and c in Z, and 
verify that in each case we obtain the same result. The table gives the details. 


a bee aN(bVc) (adab)V (aac) 
1 1 1 1 1 
I. 2 0 1 1 
1 0 1 1 1 
1 0 0O 0 0 
0 1 1 0 0 
0 1 0 0 0 
0 oO 1 0 0 
0 0 0 0 0 


540 Chapter 11 Boolean Algebras and Combinatorial Circuits 


Example 11.2.2 


Example 11.2.4 


By using Theorem 11.2.1, show that the combinatorial circuits of Figure 11.2.1 have 
identical outputs for given identical inputs. 


xy 
x4 X2 a s ) 
2 = p 
X3 X3 
(a) (b) 


Figure 11.2.1 The combinatorial circuits (a) and (b) have identical outputs for 
given identical inputs and are said to be equivalent. 


SOLUTION The Boolean expressions representing the circuits are, respectively, 


XV (x2 AXx3), (x1 V x2) A (1 V x3). 
By Theorem 11.2.1(c), 
aV(bAc)=(avb)A (avec) for all a,b,c € Z. (11.2.2) 


But (11.2.2) says that the combinatorial circuits of Figure 11.2.1 have identical outputs 
for given identical inputs. < 


Arbitrary Boolean expressions are defined to be equal if they have the same values 
for all possible assignments of bits to the literals. 


Definition 11.2.3 >» Let 
X, = Xi (%,-.-, Xn) and Xq = X7(X1, ..- Xn) 
be Boolean expressions. We define X; to be equal to Xz and write X, = X2 if 


X1 (a1, .--, An) = X2(ay,---, Q) for all a; € Z. 4 


Show that 
(XV y) =XAY. (11.2.3) 
SOLUTION According to Definition 11.2.3, equation (11.2.3) holds if the equation is 


true for all choices of x and y in Z). Thus we may simply construct a table listing all 
possibilities to verify (11.2.3). 


x y (Vy) XAY 
1 1 0 0 
1 0 0) 0 
0 1 0 0 
0 0 1 1 
< 


If we define a relation R on a set of Boolean expressions by the rule X; R X2 if 
X, = Xp, Ris an equivalence relation. Each equivalence class consists of a set of Boolean 
expressions any one of which is equal to any other. 
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Because of the associative laws, Theorem 11.2.1(a), we can unambiguously write 
ay V a2 V+++V an (11.2.4) 
or 
a, Ady A+++ Ady (11.2.5) 
for a;j€Z). The combinatorial circuit corresponding to (11.2.4) is drawn as in 


Figure 11.2.2, and the combinatorial circuit corresponding to (11.2.5) is drawn as in 
Figure 11.2.3. 


ay ay 

—— el 
a2 . a2 ° 
an : ————— 


Figure 11.2.2 An n-input OR gate. Figure 11.2.3 Ann-input AND gate. 


Example 11.2.6 


The properties listed in Theorem 11.2.1 hold for a variety of systems. Any system 
satisfying these properties is called a Boolean algebra. Abstract Boolean algebras are 
examined in Section 11.3. 

Having defined equality of Boolean expressions, we define equivalence of combi- 
natorial circuits. 


Definition 11.2.5 >» We say that two combinatorial circuits, each having inputs 
X1,...,X, and a single output, are equivalent if, whenever the circuits receive the same 
inputs, they produce the same outputs. < 


The combinatorial circuits of Figures 11.2.4 and 11.2.5 are equivalent since, as shown, 
they have identical logic tables. 


y1 
a b 
a b|yi 
1 1 0 — 
a b|y 
1 0 0 —— 
1 1 0 
0 1 0 
1 0 0 
0 0 1 
SS 0 1 0 
Figure 11.2.4 A combinatorial 0 O 1 


circuit and its logic table. 


Figure 11.2.5 A combinatorial 

circuit and its logic table, which 

is identical to the logic table of 

Figure 11.2.4. The circuits of 

Figures 11.2.4 and 11.2.5 are 

said to be equivalent because 

they have identical logic tables. < 


If we define a relation R on a set of combinatorial circuits by the rule C; RC) if 
C; and C> are equivalent (in the sense of Definition 11.2.5), R is an equivalence relation. 
Each equivalence class consists of a set of mutually equivalent combinatorial circuits. 
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Theorem 11.2.7 


Example 11.2.8 


Example 11.2.6 shows that equivalent circuits may not have the same number of 
gates. In general, it is desirable to use as few gates as possible to minimize the cost of 
the components. 

It follows immediately from the definitions that combinatorial circuits are equiv- 
alent if and only if the Boolean expressions that represent them are equal. 


Let C; and C2 be combinatorial circuits represented, respectively, by the Boolean 
expressions X; = X\(%11,..., Xn) and X2 = X2(x,..., Xn). Then C; and C> are equiv- 
alent if and only if X; = Xp. 


Proof The value X;(a), ..., dn) [respectively, X2(a,, ..., dn)] for a; € Zp is the out- 
put for circuit C; (respectively, C2) for inputs a), ..., dy. 

According to Definition 11.2.5, circuits C; and C2 are equivalent if and only if 
they have the same outputs X)(qa),...,d,) and X2(a),...,@,) for all possible inputs 
a,..., 4. Thus circuits C; and C2 are equivalent if and only if 

Xi (1, .--, An) = X2(a1,..., Qn) for all values a; € Zp. (11.2.6) 
But by Definition 11.2.3, equation (11.2.6) holds if and only if X; = X2. < 


In Example 11.2.4 we showed that 


(XV y) =XAY. 
By Theorem 11.2.7, the combinatorial circuits (Figures 11.2.4 and 11.2.5) corresponding 
to these expressions are equivalent. < 


11.2 Review Exercises 


1. State the associative laws for A and v. 6. When are two Boolean expressions equal? 

2. State the commutative laws for A and v. 7. What are equivalent combinatorial expressions? 

3. State the distributive laws for A and v. 8. What is the relation between combinatorial expressions and the 
dh: Sse heatdeacey ian tee Nanay: Boolean expressions that represent them? 

5. State the complement laws for A, V, and~. 


11.2 Exercises 


Show that the combinatorial circuits of Exercises 1-5 are equiva- 2. 
lent. *1 
x2 
kL 


(b) 


(a) 


(b) 


x] 
me y1 
(a) 
xy 
we yi 
(b) 
Xx] 
yi 
x2 
x3 
(a) 
xy 
x2 
x yi 
3 
(b) 
x1 
x2 
- Dal 
x4 
(a) 
xy 
x2 
x3 
yi 
x4 
(b) 


Verify the equations in Exercises 6-10. 


10. 


6 X17 Vx) =X 

7. x1 AX, = Hy V x2) 
8. 
9 


» Xy A (X2 A x3) = (11 A X2) V (11 A X3) 


x, V (x1 Axo) =X] 


(x1 V x2) A (x3 V x4) = (43 A x1) V (3 A x2) V (X4 A x1) 


V (x4 A x2) 
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Prove or disprove the equations in Exercises 11-15. 
ll. ¥=x 12. X} AX2 =x) V x2 
13. X1 A ((x2 A x3) V (x1 A x2 AX3)) = x2 AX3 

14. (@ A x2) V (1 AX3)) = (1 V X2) A (1 V3) 
15. (x V x2) A (%3 V x4) A (43 AX2) = 0 

16. Prove the second statement of Theorem 11.2.1(c). 
17. Prove Theorem 11.2.1, parts (a), (b), (d), and (e). 


We say that two switching circuits are equivalent if the Boolean 
expressions that represent them are equal. 


18. Show that the switching circuits are equivalent. 


A B 


19. For each switching circuit in Exercises 25-28, Section 11.1, 
find an equivalent switching circuit using parallel and series 
circuits having as few switches as you can. 


20. For each Boolean expression in Exercises 29-33, Section 11.1, 
find a switching circuit using parallel and series circuits hav- 
ing as few switches as you can. 


A bridge circuit is a switching circuit, such as the one shown here, 
that uses nonparallel and nonseries circuits. 


{kh 


For each switching circuit, find an equivalent switching circuit us- 
ing bridge circuits having as few switches as you can. 


21. 
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3. 

B 
F 

A 
D 
E C 

a 
B Oca 


24. For each Boolean expression in Exercises 29-33, Section 11.1, 
find a switching circuit using bridge circuits having as few 
switches as you can. 


22. *2. 


Cc 
D—F 
B—F 
A 
D 


ies] 


11.3 Boolean Algebras 


In this section we consider general systems that have properties like those given in 
Theorem 11.2.1. We will see that apparently diverse systems obey these same laws (for 
example, see Example 11.3.2, Exercises 2 and 22, and Exercise 8 following the Problem- 
Solving Corner). We call such systems Boolean algebras. 


Go Online 


For more on Boolean 
algebras, see 


goo.g1/5hWhOy oe seed ; ae 
Definition 11.3.1 >» A Boolean algebra B consists of a set S containing distinct 


elements 0 and 1, binary operators + and - on S, and a unary operator ’ on S satisfying 
the following laws. 


(a) Associative laws: 


Qty)+z=x+(04+29 
(x+y) +z = x+(y+z) forall x,y,z eS. 


(b) Commutative laws: 
x+ty=y+x, Xey=yerx for all x,y € S. 
(c) Distributive laws: 


X*(yt+z) = @+y) + @-2) 
x+ (yz) = (&+y)-@4+2) for all x, y,z € S. 


(d) Identity laws: 

x+0=x, xel=x forallx eS. 
(e) Complement laws: 

x+x = 1, xx’ =0 forallx eS. 


If B is a Boolean algebra, we write B = (S,+, -, ’, 0, 1). < 


The associative laws can be omitted from Definition 11.3.1 since they follow from 
the other laws (see Exercise 24). 


Example 11.3.2 


Example 11.3.3 


Theorem 11.3.4 
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By Theorem 11.2.1, (Z2, V, A, ~, 0, 1) is a Boolean algebra. (We are letting Z. denote 
the set {0, 1}.) The operators +, -, ’ in Definition 11.3.1 are V, A,~, respectively. <4 


As is the standard custom, we will usually abbreviate a-b as ab. We also assume 
that - is evaluated before +. This allows us to eliminate some parentheses. For example, 
we can write (xy) + z more simply as xy + z. 

Several comments are in order concerning Definition 11.3.1. In the first place, 
0 and 1 are merely symbolic names and, in general, have nothing to do with the numbers 
0 and 1. This same comment applies to + and -, which merely denote binary operators 
and, in general, have nothing to do with ordinary addition and multiplication. 


Let U be a universal set and let S = P(U), the power set of U. If we define the following 
operations 


X+Y=XUY, X*Y=xXNY, x= 


on S, then (S$, U,,~, Z, U) is a Boolean algebra. The empty set @ plays the role of 
0 and the universal set U plays the role of 1. If we let X, Y, and Z be subsets of S, prop- 
erties (a)—(e) of Definition 11.3.1 become the following properties of sets (see Theorem 
1.1.22): 


(a’) (XKUY)UZ=XU(YUZ) 

(XNY)NZ=XN(YNZ) for all X, Y,Z € P(U). 
(b’) XUY=YUX, XNY=YNX © forallX,Ye P(U). 
(c!) XN(YUZ) = (XNY)U(XNZ) 


XU(YNZ) = (KUY)N(XUZ) for all X, Y,Z € P(U). 
(d’) XUSB=X, XNU=X for every X € P(U). 
(e') XUX =U, XNX=2 for every X € P(U). < 


At this point we will deduce several other properties of Boolean algebras. We begin 
by showing that the element x’ in Definition 11.3.1(e) is unique. 


In a Boolean algebra, the element x’ of Definition 11.3.1(e) is unique. Specifically, if 
x+y=1and xy =0, theny =’. 


Proof 

y=yl Definition 11.3.1(d) 
=y(x4+7) Definition 11.3.1(e) 
= yx + yy Definition 11.3.1(c) 
= xy +x Definition 11.3.1(b) 
=0+4 yx’ Given 
= xx' + yx’ Definition 11.3.1(e) 
=xx4xy Definition 11.3.1(b) 
=x (x+y) Definition 11.3.1(c) 
=x 1 Given 
=x Definition 11.3.1(d) < 


Definition 11.3.5 >» Ina Boolean algebra, we call the element x’ the comple- 
ment of x. < 


We can now derive several additional properties of Boolean algebras. 
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Theorem 11.3.6 


(a) x=x+0 


=x+ (xx’) 
= (x+x(x+7) 
=(x4+x)1 


=x 


(b) x4+1l=C@4)l1 
=(4+ D4) 


=x-+1x' 
=x471 
=x+x 
=1 

x0 = x0+0 
= x0 + xy’ 


= x(0+ x’) 
= x(x + 0) 


= x 
=0 

(c) x+xy = xl + xy 
=x(1+y) 


=x 
(f) If we show that 


Definition 11.3.1(d) 
Definition 11.3.1(e) 
Definition 11.3.1(c) 
Definition 11.3.1(e) 
Definition 11.3.1(d) 


Definition 11.3.1(d) 
Definition 11.3.1(e) 
Definition 11.3.1(c) 
Definition 11.3.1(b) 
Definition 11.3.1(d) 
Definition 11.3.1(e) 
Definition 11.3.1(d) 
Definition 11.3.1(e) 
Definition 11.3.1(c) 
Definition 11.3.1(b) 
Definition 11.3.1(d) 
Definition 11.3.1(e) 


Definition 11.3.1(d) 
Definition 11.3.1(c) 
Definition 11.3.1(b) 
Part (b) 

Definition 11.3.1(d) 


(x«+y)Q’y’) =0 


Proof We will prove (b) and the first statement of parts (a), (c), and (f) and leave the 
others as exercises (see Exercises 18-20). 


(11.3.1) 


Example 11.3.7 


Example 11.3.9 


Theorem 11.3.10 
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and 
(xty)+xy =1, (11.3.2) 


it will follow from Theorem 11.3.4 that x’y’ = (x + y)’. Now 


w+y)@’y) = Wy)a@ty) Definition 11.3.1(b) 
= (yx + Q’y)y Definition 11.3.1(c) 
= xX’ y’) + Q’y’)y Definition 11.3.1(b) 
= (xx’)y’ +. x’'(yy) Definition 11.3.1(a) 
= (xx’)y' +x’(yy) Definition 11.3.1(b) 
= Oy + x0 Definition 11.3.1(e) 
=y04+2x0 Definition 11.3.1(b) 
=0+0 Part (b) 
= Definition 11.3.1(d) 


Therefore, (11.3.1) holds. 
Next we verify (11.3.2). 


(x+y) +xy =(at+y)+x)(a+y)+y) Definition 11.3.1(c) 
=(y+x)+x)(xt+y)+y) Definition 11.3.1(b) 
=+@4+xr))@+O0+y')) Definition 11.3.1(a) 
=(0+)Dat+l Definition 11.3.1(e) 
= 1-1 Part (b) 
= ij Definition 11.3.1(d) 

By Theorem 11.3.4, xy’ = (x+ y)’. < 


As explained in Example 11.3.3, if U is a set, P(U) can be considered a Boolean algebra. 
Therefore, De Morgan’s laws, which for sets may be stated 


(XUY)=XNY, (XNY)=XUY forall X,Y ¢P(U), 


hold. These equations may be verified directly (see Theorem 1.1.22), but Theorem 11.3.6 
shows that they are a consequence of other laws. < 


The reader has surely noticed that equations involving elements of a Boolean al- 
gebra come in pairs. For example, the identity laws [Definition 11.3.1(d)] are 


x+0=x, pl eae oA 


Such pairs are said to be dual. 


Definition 11.3.8 >» The dual of a statement involving Boolean expressions is 
obtained by replacing 0 by 1, 1 by 0, + by -, and + by +. < 


The dual of (x + y)’ = xy’ is (xy)’ =x + y’. < 


Each condition in the definition of a Boolean algebra (Definition 11.3.1) includes 
its dual. Therefore, we have the following result. 


The dual of a theorem about Boolean algebras is also a theorem. 


Proof Suppose that T is a theorem about Boolean algebras. Then there is a proof P 
of T involving only the definitions of a Boolean algebra (Definition 11.3.1). Let P’ be 
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the sequence of statements obtained by replacing every statement in P by its dual. Then 
P’ is a proof of the dual of T. 


Example 11.3.11 The dual of 


< 
X+x=xX (11.3.3) 
XX =X. (11.3.4) 


We proved (11.3.3) earlier [see the proof of Theorem 11.3.6(a)]. If we write the 
dual of each statement in the proof of (11.3.3), we obtain the following proof of (11.3.4): 


Example 11.3.12 The proofs given in Theorem 11.3.6 of the two statements of part (b) are dual to each 


other. 


x=<xl 
= x(x +x) 
= xx + xx" 
=xx+0 


= an: < 


< 


11.3 Review Exercises 


an & WB NR = 


. Define Boolean algebra. 

. What are the idempotent laws for Boolean algebras? 
. What are the bound laws for Boolean algebras? 

. What are the absorption laws for Boolean algebras? 


. What is the involution law for Boolean algebras? 


6 
ef 
8 
9 


. What are the 0/1 laws for Boolean algebras? 
. What are De Morgan’s laws for Boolean algebras? 
. How is the dual of a Boolean expression obtained? 


. What can we say about the dual of a theorem about Boolean 


algebras? 


11.3 Exercises 


1. Verify properties (a’)-(e’) of Example 11.3.3. 
2. Let S = {1, 2, 3, 6}. Define 


‘d 


6 
x+y =Iem(x, y), x+y = ged(x, y), x= - 
x 


for x, y € S (Icm and ged denote, respectively, the least com- 
mon multiple and the greatest common divisor). Show that 
(S,+, +, ', 1, 6) is a Boolean algebra. 


3. S = {1, 2,4, 8}. Define + and - as in Exercise 2 and define 


x’ = 8/x. Show that (S,+, +, ’, 1, 8) is not a Boolean algebra. 


Let S, = {1,2,..., n}. Define 


x+y = max{x, y}, x+y = min{x, y}. 


4. Show that parts (a)—(c) of Definition 11.3.1 hold for S,,. 
5. Show that it is possible to define 0,1 and ’ so that 


(Sn, +, +, ', 0, 1) is a Boolean algebra if and only if n = 2. 


6. Rewrite the conditions of Theorem 11.3.6 for sets as in 


Example 11.3.3. 


7. Interpret Theorem 11.3.4 for sets as in Example 11.3.3. 


Write the dual of each statement in Exercises 8-14. 


8. 


@+y@t+)=x+xy+y 
2 + yl! = xy 
. Ifx+y=x+zandx +y=x +z, theny =z. 


. xy’ = Oif and only if xy = x. 

. x = Oif and only if y = xy’ +2’y for all y. 

. Ifx+y=0,thenx=O0=y. 

~xX4t+xyt+)=x 

. Prove the statements of Exercises 8-14. 

. Prove the duals of the statements of Exercises 8-14. 

. Write the dual of Theorem 11.3.4. How does the dual relate to 


Theorem 11.3.4 itself? 


. Prove the second statements of parts (a), (c), and (f) of 


Theorem 11.3.6. 


. Prove the second statements of parts (a), (c), and (f) of 


Theorem 11.3.6 by dualizing the proofs of the first statements 
given in the text. 
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20. Prove Theorem 11.3.6, parts (d) and (e). 


«21. Deduce part (a) of Definition 11.3.1 from parts (b)-(e) of 
Definition 11.3.1. 


*23. Let n be a positive integer. Let S$ be the set of all divisors 
of n, including | and n. Define + and - as in Exercise 2 
and define x’ = n/x. What conditions must n satisfy so that 


22. 


Let U be the set of positive integers. Let S$ be the collec- 
tion of subsets X of U with either X or X finite. Show that 
(S, U,N,~, B, U) is a Boolean algebra. 


Problem-Solving Corner 


(S,+, +, ’, 1,n) is a Boolean algebra? 


*24. Show that the associative laws follow from the other laws of 


Definition 11.3.1. 


Boolean Algebras 


Problem that (A, +, -, ’, 0, 1) is a Boolean algebra. According 
Let (S$, +, -,’, 0, 1) be a Boolean algebra and let A be WOM gre Sia ET aN SUG INS [ee ne 

a subset of S. Show that (A,+, -, ', 0, 1) is a Boolean A contains distinct elements 0 and 1. (3) 

algebra if and only if 1 € A and xy’ €A for all x, yeA. +-and -are binary operators on A. (4) 

Attacking the Problem ‘is a unary operator on A. (5) 

Since the given statement is an “if and only if” state- The associative laws hold. (6) 

ment, there are two statements to be proved: The commutative laws hold. (7) 

If (A, +, +, ’, 0, 1) is a Boolean algebra, The distributive laws hold. (8) 

then 1 € Aandxy €Aforallx,yeA. (1) The identity laws hold. (9) 

If 1 € A and xy’ € A for all x, y € A, then The complement laws hold. (10) 

(A, +, +, ',0, 1) isa Boolean algebra. (2) A contains | by assumption. To prove (3), we must 


To prove (1), we can use the laws as specified by 
the definition of “Boolean algebra” (Definition 11.3.1) 
and the laws derived in Theorem 11.3.6 that ele- 


show that 0 € A. We have only two assumptions 
about A: 1 € A and if x,y € A, then xy’ € A. All 
we can do at this point is combine these assumptions; 


ments of a Boolean algebra must obey. To prove that is, take x = y = | and examine the conclu- 
that (A, +, -, ’,0, 1) is a Boolean algebra, we will sion: 11’ € A. Now Theorem 11.3.6(e) [applied to the 
verify that the laws specified by Definition 11.3.1 Boolean algebra (S,+, +, ‘, 0, 1)] says that I’ = 0. 


are satisfied. Before reading on, you should review 
Definition 11.3.1 and Theorem 11.3.6. 


Finding a Solution 


First let’s try to prove (1). We assume that (A, +, -, ’, 
0, 1) is a Boolean algebra and prove that 


mleA 


and 
m xy’ €Aforallx,y EA. 


Definition 11.3.1 says that a Boolean algebra con- 
tains 1. Since (A, +, -, ’,0, 1) is a Boolean algebra, 
1eEA. 

Now suppose that x, y € A. Definition 11.3.1 says 
that ' is a unary operator on A. This means that y’ € A. 
Definition 11.3.1 also says that - is a binary operator 
on A. This means that xy’ € A. This completes the 
proof of (1). 

Now let’s try to prove (2). This time we assume 
that 1 € A and xy’ € A for all x, y € A and try to prove 


Substituting for 1’, we know now that 10 € A. But 
Theorem 11.3.6(b) says that for any x, x0 = 0. Thus 
10 = Ois inA. Success! A contains | and 0. 0 and 1 
are distinct because they are elements of the Boolean 
algebra (S, +, -, ’, 0, 1). Therefore, (3) is proved. 

To prove (4), we must show that + and - are 
binary operators on A; that is, if x,y € A, then x+y 
and xy are in A. Consider proving that - is a binary 
operator on A. What we know is that if x, y € A, then 
xy’ € A, which is close to what we want to prove. If we 
could somehow replace y by y in the expression xy’, 
we could conclude that xy €¢ A. What we would like to 
do is assume that x, y € A, then deduce 


x,y EA, (11) 
and then conclude that 
xy=xy eA. 


To deduce (11), we need to show if y € A, then y’ € A. 
But this is (5). Detour! Let’s work on (5). 
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We will assume that y € A and try to prove that 
y €A. If we could get rid of that pesky x (in the hy- 
pothesis x,y € A implies xy’ € A), we would have 
exactly what we want. We can effectively eliminate x 
by taking x = | since ly = y. Formally, we argue as 
follows. Let y be in A. Since 1 € A,y’ = ly € A. 
[y’ = ly’ by Definition 11.3.1(b) and 11.3.1(d).] We 
have proved (5). 

Now back to (4). Let x, y € A. By the just proved 
(5), y’ € A. By the given condition, xy = xy” € A. 
[y = y” by Theorem 11.3.6(d).] We have proved that - 
is a binary operator on A. 

De Morgan’s laws [Theorem 11.3.6(f)], in effect, 
allow us to interchange + and -, so we can use them 
to prove that if x, y € A, then x + y € A. Formally we 
argue as follows. Suppose that x, y € A. By (5), x and 
y are both in A. Since we have already proved that - 
is a binary operator on A, x’y’ € A. By (5), Q’y’)’ € A. 
By De Morgan’s laws [Theorem 11.3.6(f)] and Theo- 
rem 11.3.6(d), x+y = x’+y" = (vy) € A. Therefore, 
+ is a binary operator on A. We have proved (4). 

The next statement to prove is (6), which is to ver- 
ify the associative laws 

(x+y) +z=x+(4+2), 

(xy)z = xz) 
Now (S,+, -, ’, 0, 1) is a Boolean algebra and so the 
associative laws hold in S. Since A is a subset of S, 
the associative laws surely hold in A. Thus (6) holds. 
For the same reason, properties (7) through (10) also 
hold in A. Therefore, (A,+,-, ’,0, 1) is a Boolean 
algebra. 


for all x, y,z EA. 


Formal Solution 


Suppose that (A,+, -, ’,0, 1) is a Boolean algebra. 
Then | € A. Suppose that x, y € A. Then y’ € A. Thus 
xy EA. 

Now suppose that 1 € A and xy’ ¢€ A for all 
x,y € A. Taking x = y = 1, we obtain0 = 11’ € A. 
Taking x = 1, we obtain y = ly’ ¢€ A. Thus’ is 
a unary operator on A. Replacing y by y’, we obtain 
xy = xy” ¢€ A. Thus - is a binary operator on A. 
Nowx+y = x’ +y" = (vy)! € A. Thus + is a 
binary operator on A. Parts a—e of Definition 11.3.1 
automatically hold in A since they hold in S. Therefore 
(A, +, -, ', 0, 1) is a Boolean algebra. 
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Summary of Problem-Solving Techniques 


= When trying to construct a proof, write out care- 
fully what is assumed and what is to be proved. 

= When trying to construct a proof, look at closely 
related definitions and theorems. 

= To prove that something is a Boolean algebra, go 
directly to the definition (Definition 11.3.1). 

= Consider proving statements in an order different 
from that given. In this problem, it was easier to 
prove statement (5) before proving statement (4). 

= Try various substitutions for the variables in 
a universally quantified statement. (After all, 
“universally quantified” means that the state- 
ment holds true for all values.) By taking 
x = y = | in the statement 

xy eA for all x, y € A, 
we were able to prove that 0 € A. 


Exercises 


Use the following definitions for Exercises 1-8. For 
a < b, let 
[a,b) = {x Ee R|a<x < Dd}, 

where, as usual, R denotes the set of real numbers. 
Define a subset X of R to be open relative to the [a, b) 
definition if for every x € X, there exists a,b € R with 
a < bandx € [a,b) C X. Henceforth, we abbreviate 
“open relative to the [a, b) definition” to “open.” Let 
R be the universal set. Define a set X C Rto be clopen 
if X and X are both open. Let A be the set of clopen 
subsets of R. 


1. Prove that R and @ are in A. 
2. Prove that [1, 2) U [3, 5) € A. 


3. Prove that ¥ = {x € R| 3 < x < 10} is open but 
not clopen. 


. Prove that if X € A, then X € A. 

. Prove that if X and Y are open, then X U Y is open. 
. Prove that if X and Y are open, then X M Y is open. 
. Prove that if X, Y ¢ A, then XN Y € A. 

. Prove that (A, U,M,~, @, R) is a Boolean algebra. 


. Can we prove Exercises | and 4-8 if we replace 
[a,b),a <b, by {xe R|a<x <b}? 
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11.4 Boolean Functions and Synthesis of Circuits 


TABLE 11.4.1 m@ The 


exclusive-OR 

xX] x2 x1 B x2 
1 1 0 

1 0 1 

0 1 1 

0 0 0 


A circuit is constructed to carry out a specified task. If we want to construct a combi- 
natorial circuit, the problem can be given in terms of inputs and outputs. For example, 
suppose that we want to construct a combinatorial circuit to compute the exclusive-OR 
of x; and x2. We can state the problem by listing the inputs and outputs that define the 
exclusive-OR. This is equivalent to giving the desired logic table. 


Definition 11.4.1 > The exclusive-OR of x; and x written x; @ x is defined 
by Table 11.4.1. < 


A logic table, with one output, is a function, The domain is the set of inputs and 
the range is the set of outputs. For the exclusive-OR function given in Table 11.4.1, the 
domain is the set {(1, 1), (1, 0), (0, 1), (O, 0)} and the range is the set Z, = {0, 1}. 

If we could develop a formula for the exclusive-OR function of the form 


x1 @ x2 = X(X1, X2), 
where X is a Boolean expression, we could solve the problem of constructing the com- 
binatorial circuit. We could merely construct the circuit corresponding to X. 


Functions that can be represented by Boolean expressions are called Boolean 
functions. 


Definition 11.4.2 >» Let X(x,,...,x,) be a Boolean expression. A function f 
of the form 


FOiss009%a) = X11, ..+5 Xn) 


is called a Boolean function. < 


Example 11.4.3 The function f: Z} > Z, defined by 


f(%1, 2,3) =X, A 2 V x3) 


is a Boolean function. The inputs and outputs are given in the following table. 


x] x2 X3 SF (x1, X2, 3) 
i 1 1 1 
i 1 0 0 
1 0 1 1 
1 0 0 1 
0 i il 0 
0 1 0 0 
0 0 1 0 
0 0 0 0 


< 


In the next example we show how an arbitrary function f: Z; — Z) can be realized 
as a Boolean function. 
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Example 11.4.4 Show that the function f given by the following table is a Boolean function. 


x} x2 x3 Sf (x1, x2, X3) 
1 1 1 1 
1 1 0 0 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0) 1 0 1 
0 0 1 0 
0 0 0 0 


SOLUTION Consider the first row of the table and the combination 
xX, AX2 A X3. (11.4.1) 


Notice that if x} = x. = x3 = 1, as indicated in the first row of the table, then (11.4.1) is 
1. The values of x; given by any other row of the table give (11.4.1) the value 0. Similarly, 
for the fourth row of the table we may construct the combination 


X, AX2 AX3. (11.4.2) 


Expression (11.4.2) has the value | for the values of x; given by the fourth row of the 
table, whereas the values of x; given by any other row of the table give (11.4.2) the 
value 0. 

The procedure is clear. We consider a row R of the table where the output is 1. We 
then form the combination x; A x2 A x3 and place a bar over each x; whose value is 0 in 
row R. The combination formed is | if and only if the x; have the values given in row R. 
Thus, for row 6, we obtain the combination 


X, AX. AX3. (11.4.3) 
Next, we OR the terms (11.4.1)—(11.4.3) to obtain the Boolean expression 
(x1 A X2 AXx3) V (x1 AX2 A X3) V(X, A X2 AX). (11.4.4) 


We claim that f(x), x2, x3) and (11.4.4) are equal. To verify this, first suppose that x, x2, 
and x3 have values given by a row of the table for which f(x1, x2, x3) = 1. Then one of 
(11.4.1)-(11.4.3) is 1, so the value of (11.4.4) is 1. On the other hand, if x;, x2, x3 have 
values given by a row of the table for which f(x, x2, x3) = 0, all of (11.4.1)-(11.4.3) 
are 0, so the value of (11.4.4) is 0. Thus f and the Boolean expression (11.4.4) agree on 
Ze therefore, 


f(X1, X2,.3) = (1 A X2 AX3) V (ry AX2 AX3) V A X2 AX), 


as claimed. < 


After one more definition, we will show that the method of Example 11.4.4 can 
be used to represent any function f: Z3 — Zp. 


Definition 11.4.5 » A minterm inthe symbols x1, ..., x, is a Boolean expres- 
sion of the form 


Yi Ay2 A+++ AYn, 


where each y,; is either x; or X;. < 


Theorem 11.4.6 


Example 11.4.8 
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Iff: Z} — Z), then/f is a Boolean function. If f is not identically zero, let A), ..., Ax 
denote the elements A; of Z} for which f(A;) = 1. For each Aj = (a), ..., d,), set 


LG = Vil OPS INN 


where 
_ Jy fa=l 
oT = Xj if a; = (0), 
Then 
HiCilo coag kia) = why VAN 22 OY iP (11.4.5) 
Proof Iff(@i,...,Xn) = 0 for all x;, then f is a Boolean function, since 0 is a Boolean 
expression. 
Suppose that f is not identically zero. Let mj;(a1, ..., dn) denote the value obtained 


from m; by replacing each x; with a;. It follows from the definition of m, that 


1 ifA=A; 
a fe if A # Aj. 


Let A € Z}. If A = A; for some i € {1,..., k}, then f(A) = 1, mj(A) = 1, and 
m(A) V-+-Vm,(A) = 1. 


On the other hand, if A ~ A; for any i € {1,...,k}, then f(A) = 0,m;(A) = 0 for 
i=1,...,k,and 


m(A) V «++ V my(A) = 0. 


Therefore, (11.4.5) holds. < 


Definition 11.4.7 » The representation (11.4.5) of a Boolean function 


f:Z5 — Zz is called the disjunctive normal form of the function f. < 


Design a combinatorial circuit that computes the exclusive-OR of x; and x2. 


SOLUTION The logic table for the exclusive-OR function x; © x is given in Table 
11.4.1. The disjunctive normal form of this function is 


X, Bx. = (X1 AX) V (X, AX). (11.4.6) 


The combinatorial circuit corresponding to (11.4.6) is given in Figure 11.4.1. 


x1 


x1 @® X2 
x2 


Figure 11.4.1 A combinatorial circuit for the 
exclusive-OR. 4 
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Go Online 


For more on conjunctive 


normal form, see 
goo.gl/5HWhOy 


Suppose that a function is given by a Boolean expression such as 
f(%1, 2, X3) = (x1 V X2) A x3 


and we wish to find the disjunctive normal form of f. We could write the logic table 
for f and then use Theorem 11.4.6. Alternatively, we can deal directly with the Boolean 
expression by using the definitions and results of Sections 11.2 and 11.3. We begin by 
distributing the term x3 as follows: 


(x1 V x2) Ax3 = (41 AX3) V (%2 A x3). 


Although this represents the Boolean expression as a combination of terms of the form 
y A Z, it is not in disjunctive normal form, since each term does not contain all of the 
symbols x;, x2, and x3. However, this is easily remedied, as follows: 
(x1 A.x3) V (2 A x3) = (41 Ax3 A 1) V (02 A x3 A 1) 
= (X41 Ax3 A (%2 V X2)) V (42 Ax3 A (X1 V X1)) 
= (x1 AX. AXx3) V (41 AX2 A x3) 
V (x1 A X2 A X3) V (X1 A X2 A x3) 
= (x1 A X2 AX3) V (x1 AX2 A X3) 
V (X1 A X2 A X3). 


This expression is the disjunctive normal form of f. 
Theorem 11.4.6 has a dual. In this case the function f is expressed as 


SQ1,-++5%n) = M1 AM2A--+> AM. (11.4.7) 
Each M,; is of the form 
yi Vers Vay (11.4.8) 


where y; is either x; or x;. A term of the form (11.4.8) is called a maxterm and the 
representation of f (11.4.7) is called the conjunctive normal form. Exercises 24—28 
explore maxterms and the conjunctive normal form in more detail. 


11.4 Review Exercises 


& WB NN = 


. Define exclusive-OR. 
. What is a Boolean function? 
. What is a minterm? 


. What is the disjunctive normal form of a Boolean function? 


5. How can one obtain the disjunctive normal form of a Boolean 
function? 


6. What is a maxterm? 


7. What is the conjunctive normal form of a Boolean function? 


11.4 Exercises 


In Exercises 1-10, find the disjunctive normal form of each function 3. 
and draw the combinatorial circuit corresponding to the disjunc- 


tive normal form. 


i. 
% 


1 
1 
0 
0 


Oror |< 


f(x, y) 


1 
0 
1 
1 


x y z f(x,y, 2) 
1 1 1 1 
—S- 1 1 0 1 
x y f(x,y) 1 0 1 (0) 
14 0 1 0 0 1 
1 0 I 0 1 1 0 
0 1 0 0 1 0 0 
0 0 1 0 0 1 1 
0 0 0 1 


x y z f(x, y, 2) 
1 1 1 1 
1 1 0 1 
1 0 1 0 
1 0 0 1 
0 1 1 1 
0 1 0 1 
0 0 1 0 
0 0 0 0 
x y z f(x, y, 2) 
1 1 1 0 
1 1 0 1 
1 0 1 1 
1 0 0 1 
0 1 1 1 
0 1 0 1 
0 0 1 1 
0 0 0 0) 
x y z f(x, y, 2) 
1 1 1 1 
1 1 0 1 
1 0 1 1 
1 0 0 0 
0 1 1 0 
0 1 0 1 
0 0 1 1 
0 0 0 1 
x y Zz SY, 2) 
1 1 1 1 
1 1 0 0) 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 0 
0 0 1 0 
0 0 0 1 
x y z f(x, y, 2) 
1 1 1 0 
1 1 0 0 
1 0 1 0 
1 0 0 1 
0 1 1 1 
0 1 0 1 
0 0 1 1 
0 0 0 0 
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9. 


10. 


w x y z f(w, x,y, 2) 
1 1 1 1 1 
1 1 1 0 0 
1 1 0 1 i 
1 1 0 0 0 
1 0 1 1 0 
1 0 1 0 0 
il 0 0 1 0 
1 0 0 0 i 
0 i! 1 1 1 
0 1 1 0 0 
0 1 (0) 1 0 
0 1 0) 0 (0) 
0 0 1 1 1 
0 0 1 0 0 
0 0 0 1 0 
0 0 0 0 0) 
w x y Z f(w, x, y, 2) 
1 1 1 1 0 
1 1 t 0 0 
il 1 0 1 1 
1 1 0 0) 1 
1 0 1 1 1 
1 0 1 0 1 
1 0 0 1 0 
1 0 0 0 1 
0 1 1 1 0 
0 1 1 0 1 
0 1 0 1 1 
0 1 0 0 1 
0 0 1 1 0 
0 0 1 0 1 
0 0 (0) 1 (0) 
0 0 0 0 1 


In Exercises 11-20, find the disjunctive normal form of each func- 
tion using algebraic techniques. (We abbreviate a A b as ab.) 


11. 
12. 
13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 


21. 


SQ Y) =xXV xy 

fy) = @V y)%VY) 

f(, y, 2) =x V yx V 2) 

f(x,y, 2) = QZ V AZ) QF V 2) 

fy, 2 =XV GV OF V 22)) 

ff (X,Y, 2) = Gy V XZ)KV yz) 

f(x, y, 2) = (&V Ry V RyzZ) xy V 3Z)(y V xy?) 

f(w, xy, 2) = wy V (WF V Zr V We) 

f(x,y, 2) = Gy V XZ) (xyz V yZ) (az V XV V xVz V XZ) 
f(w, x, y, 2) = (WxPZ VX (Wz V XYZ V yxz) (Wz V xy VZV 
XyzZ V Xyz) 


How many Boolean functions are there from Z} into Z2? 
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Let F denote the set of all functions from Z} into Zz. Define 
FYIM=f(MOVEQ) «EZ 


FAQM=fAAga) xEZ 
f(x) =f@) xeZS 
0) =0 xEZ 
l(x) = 1 x eZ}. 


22. How many elements does F have? 
23. Show that (F, V, A, ~, 0, 1) is a Boolean algebra. 


24. By dualizing the procedure of Example 11.4.4, explain how to 
find the conjunctive normal form of a Boolean function from 
Z5 into Z. 

25. Find the conjunctive normal form of each function in Exer- 
cises 1-10. 


26. 


27. 


28. 


29. 


By using algebraic methods, find the conjunctive normal form 
of each function in Exercises 11-20. 


Show that if m; V --- V mx is the disjunctive normal form of 
S(X1,-.+,%n), then m, A --- A mx is the conjunctive normal 
form of f(x], ...,%n)- 

Using the method of Exercise 27, find the conjunctive normal 
form of f for each function f of Exercises 1-10. 


Show that the disjunctive normal form (11.4.5) is unique; that 
is, show that if we have a Boolean function 


X,..-,Xp) =m Vee Ving=mvV--- Vv mi, 
1 J 


where each m;, m, is a minterm, then k = j and the subscripts 
on the m; may be permuted so that mj; = m;, fori = 1,...,k. 


11.5 Applications 


Go Online In the preceding section we showed how to design a combinatorial circuit using AND, 
For more on logic design, OR, and NOT gates that would compute an arbitrary function from Z; into Z,, where 
see g00.g1/5HWhOy Z = {0, 1}. In this section we consider using other kinds of gates to implement a circuit. 
We also consider the problem of efficient design. We will conclude by looking at several 

useful circuits having multiple outputs. Throughout this section, we write ab for a A b. 

Before considering alternatives to AND, OR, and NOT gates, we must give a pre- 


cise definition of “gate.” 


Definition 11.5.1 > A gate is a function from Z} into Z. < 


Example 11.5.2 The AND gate is the function A from Z3 into Z) defined as in Definition 11.1.1. The 
NOT gate is the function ~ from Z, into Z) defined as in Definition 11.1.3. < 


We are interested in gates that allow us to construct arbitrary combinatorial 


circuits. 


Definition 11.5.3 > A set of gates {g1, ..., gx} is said to be functionally com- 
plete if, given any positive integer n and a function f from Z} into Zy, it is possible to 
construct a combinatorial circuit that computes f using only the gates g1,..., gx. < 


Example 11.5.4 Theorem 11.4.6 shows that the set of gates {AND, OR, NOT} is functionally complete. 


< 


It is an interesting fact that we can eliminate either AND or OR from the set {AND, 
OR, NOT} and still obtain a functionally complete set of gates. 


Theorem 11.5.5 _The sets of gates 


are functionally complete. 


{AND, NOT} {OR, NOT} 


X1 
x x 
yo pets 


Figure 11.5.2 NAND gate. 


Theorem 11.5.7 


Example 11.5.8 
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Proof We will show that the set of gates {AND, NOT} is functionally complete and 
leave the problem of showing that the other set is functionally complete for the exercises 
(see Exercise 1). 

We have 


xVy=xXVY involution law 
=Xy De Morgan’s law. 


Therefore, an OR gate can be replaced by one AND gate and three NOT gates. (The 
combinatorial circuit is shown in Figure 11.5.1.) 


xvy 
BY 


Figure 11.5.1 A combinatorial circuit using 
only AND and NOT gates that computes x v y. 


Given any function f:Z3 —> Z), by Theorem 11.4.6 we can construct a combi- 
natorial circuit C using AND, OR, and NOT gates that computes f. But Figure 11.5.1 
shows that each OR gate can be replaced by AND and NOT gates. Therefore, the circuit 
C can be modified so that it consists only of AND and NOT gates. Thus the set of gates 
{AND, NOT} is functionally complete. < 


Although none of AND, OR, or NOT singly forms a functionally complete set 
(see Exercises 2—4), it is possible to define a new gate that by itself forms a functionally 
complete set. 


Definition 11.5.6 >» ANAND gate receives inputs x, and x2, where x, and x, 
are bits, and produces output denoted x; + x2, where 


0 ifx,; =landxy=1 
i to= 
1 otherwise. 


A NAND gate is drawn as shown in Figure 11.5.2. < 


Many basic circuits used in digital computers today are built from NAND gates. 


The set {NAND} is a functionally complete set of gates. 


Proof First we observe that x ¢ y = xy. Therefore, 


x= H=xtx (11.5.1) 
xVyY=FV=FAF=HOETMt Oty). (11.5.2) 


Equations (11.5.1) and (11.5.2) show that both OR and NOT can be written in terms of 
NAND. By Theorem 11.5.5, the set {OR, NOT} is functionally complete. It follows that 
the set {NAND} is also functionally complete. < 


Design combinatorial circuits using NAND gates to compute the functions fi (x) = x 
and fi (x, y) =x Vy. 
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SOLUTION The combinatorial circuits, derived from equations (11.5.1) and (11.5.2), 
are shown in Figure 11.5.3. 


Figure 11.5.3 Combinatorial circuits 
using only NAND gates that compute x 
and x V y. < 


Consider the problem of designing a combinatorial circuit using AND, OR, and 
NOT gates to compute the function f. 


x y z f(s 2) 
1 1 1 1 
1 1 0 1 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 0 
0 0 1 0 
0 0 0 0 
The disjunctive normal form of f is 
SQ, y, Z) = xyz V XYZ V XYZ. (11.5.3) 


The combinatorial circuit corresponding to (11.5.3) is shown in Figure 11.5.4. 


Figure 11.5.4 A combinatorial circuit that computes 
f(y, Z) = xyz V XYZ V xyz. 


The circuit in Figure 11.5.4 has nine gates. As we will show, it is possible to design 
a circuit having fewer gates. The problem of finding the best circuit is called the mini- 
mization problem. There are many definitions of “best.” 

To find a simpler combinatorial circuit equivalent to that in Figure 11.5.4, we 
attempt to simplify the Boolean expression (11.5.3) that represents it. The equations 


Example 11.5.9 
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EavVEaA=E (11.5.4) 
E=EV Ea, (11.5.5) 


where F represents an arbitrary Boolean expression, are useful in simplifying Boolean 
expressions. 
Equation (11.5.4) may be derived as follows: 


EaV Ea=E(ava)y=E\1=E 


using the properties of Boolean algebras. Equation (11.5.5) is essentially the absorption 
law [Theorem 11.3.6(c)]. 
Using (11.5.4) and (11.5.5), we may simplify (11.5.3) as follows: 


XYZ V XYZ V XYZ = XV V XYZ by (11.5.4) 
= xy V XYZ V XVZ by (11.5.5) 
= xy V XZ. by (11.5.4). 


A further simplification, 
xy VxZ=x(y V2), (11.5.6) 


is possible using the distributive law [Definition 11.3.1(c)]. The combinatorial circuit 
corresponding to (11.5.6), which requires only three gates, is shown in Figure 11.5.5. 


Figure 11.5.5 A three-gate combinatorial 
circuit equivalent to that of Figure 11.5.4. 


The combinatorial circuit in Figure 11.4.1 uses five AND, OR, and NOT gates to com- 
pute the exclusive-OR x @ y of x and y. Design a circuit that computes x © y using fewer 
AND, OR, and NOT gates. 


SOLUTION Unfortunately, (11.5.4) and (11.5.5) do not help us simplify the disjunc- 
tive normal form xy Vv xy of x @ y. Thus we must experiment with various Boolean rules 
until we produce an expression that requires fewer than five gates. One solution is pro- 
vided by the expression (x V y)xy whose implementation requires only four gates. This 
combinatorial circuit is shown in Figure 11.5.6. 


x 


y 
x@y 


Figure 11.5.6 A four-gate combinatorial circuit that 
computes the exclusive-OR x @ y of x and y. < 


The set of gates available determines the minimization problem. Since the state of 
technology determines the available gates, the minimization problem changes through 
time. In the 1950s, the typical problem was to minimize circuits consisting of AND, 
OR, and NOT gates. Solutions such as the Quine—-McCluskey method and the method 
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Example 11.5.10 


Example 11.5.12 


of Karnaugh maps were provided. The reader is referred to [Mendelson] for the details 
of these methods. 

Advances in solid-state technology have made it possible to manufacture very 
small components, called integrated circuits, which are themselves entire circuits. Thus 
circuit design today consists of combining basic gates such as AND, OR, NOT, and 
NAND gates and integrated circuits to compute the desired functions. Boolean algebra 
remains an essential tool, as a glance at a book on logic design such as [McCalla] will 
show. 

We conclude this section by considering several useful combinatorial circuits hav- 
ing multiple outputs. A circuit with n outputs can be characterized by n Boolean expres- 
sions, as the next example shows. 


Write two Boolean expressions to describe the combinatorial circuit of Figure 11.5.7. 


SOLUTION The output y; is described by the expression y = ab, and y> is described 
by the expression y2 = bc Vv ab. 


a 


b yi 


Ba) 
Cc 


Figure 11.5.7 A combinatorial circuit with 
two outputs. < 


Our first circuit is called a half adder. 


Definition 11.5.11 > A half adder accepts as input two bits x and y and pro- 
duces as output the binary sum cs of x and y. The term cs is a two-bit binary number. We 
call s the sum bit and c the carry bit. < 


Half-Adder Circuit Design a half-adder combinatorial circuit. 


SOLUTION The table for the half-adder circuit is as follows: 


x oy cs 
1 1 1 0 
1 O 0 1 
0 1 dQ 1 
0 0 0 0 


This function has two outputs, c and s. We observe that c = xy and s = x @ y. Thus 
we obtain the half-adder circuit of Figure 11.5.8. We used the circuit of Figure 11.5.6 to 
realize the exclusive-OR. 


x 
y AY 


Figure 11.5.8 A half-adder circuit. < 


Example 11.5.14 
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A full adder sums three bits and is useful for adding two bits and a third carry bit 
from a previous addition. 


Definition 11.5.13 > A full adder accepts as input three bits x, y, and z and 
produces as output the binary sum cs of x, y, and z. The term cs is a two-bit binary 
number. < 


Full-Adder Circuit Design a full-adder combinatorial circuit. 


SOLUTION The table for the full-adder circuit is as follows: 


x y Zz c s 
1 1 1 1 1 
1 1 0 1 0 
1 0 1 1 0 
1 0 0 0 1 
0 1 1 1 0 
0 1 0 0 1 
0 0 1 0 1 
0 0 0 0 0 


Checking the eight possibilities, we see that s = x @ y @ z; hence we can use two 
exclusive-OR circuits to compute s. 
To compute c, we first find the disjunctive normal form 


C = XyZV XYZ V XYZ V XZ (11.5.7) 
of c. Next, we use (11.5.4) and (11.5.5) to simplify (11.5.7) as follows: 


XYZ V XYZ V XYZ V XyZ = XY V Xz V XyZ 
= xy V xyz V XYZ V XYZ 
= XV V xXZV XyZ 
= xy V XZ V XYZ V Xyz 
= xy VXZV yz. 


Additional gates can be eliminated by writing c = xy Vz(x Vy). We obtain the full-adder 
circuit given in Figure 11.5.9. 


x 
y 


Figure 11.5.9 A full-adder circuit. < 


Our last example shows how we may use half-adder and full-adder circuits to 
construct a circuit to add binary numbers. 
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Example 11.5.15 A Circuit to Add Binary Numbers Using half-adder and full-adder circuits, design a 


combinatorial circuit that computes the sum of two three-bit numbers. 


SOLUTION We will let M = x3x2x,; and N = y3y2y, denote the numbers to be added 
and let z4z3z2z; denote the sum. The circuit that computes the sum of M and N is drawn 
in Figure 11.5.10. It is an implementation of the standard algorithm for adding numbers 
since the “carry bit” is indeed carried into the next binary addition. 


Figure 11.5.10 A combinatorial circuit that 
computes the sum of two three-bit numbers. 


Figure 11.5.11 shows how the circuit adds 011 = x3x2x; and 101 = y3y2y,. The 
bits x, = 1 and y; = 1 are input to the half adder. Since 1 + 1 = 10, the sum bit 
s = 0 = z and the carry bit c = 1. Next the carry bit (1) and the bits x. = 1 and y2 = 0 
are input to the (middle) full adder. Since | + 1 + 0 = 10, the sum bit s = 0 = z) and 
the carry bit c = 1. At the last step, the carry bit (1) and the bits x3 = 0 and y3 = | are 
input to the (bottom) full adder. Since 1 + 0+ 1 = 10, the sum bit s = 0 = z;3 and the 
carry bit c = 1 = zy. The sum 011 + 101 = 1000 = 2473292). 


Figure 11.5.11 The combinatorial circuit of Figure 11.5.10 
adding 011 = x3x2x; and 101 = y3y2y;. The sum is 
1000 = 74732221. < 


If we were using three-bit registers for addition, so that the sum of two three-bit 
numbers would have to be no more than three bits, we could use the z, bit in Example 
11.5.15 as an overflow flag. If zs = 1, overflow occurred; if zy = 0, there was no 
overflow. 

In the next chapter (Example 12.1.3), we discuss a sequential circuit that makes 
use of a primitive internal memory to add binary numbers. 


11.5 Review Exercises 


— 


. What is a gate? 
. What is a functionally complete set of gates? 
. Give examples of functionally complete sets of gates. 


. What is a NAND gate? 


an & WwW NY 


. Is the set {NAND} functionally complete? 


\— i 
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. What is the minimization problem? 
. What is an integrated circuit? 
. Describe a half-adder circuit. 


. Describe a full-adder circuit. 


11.5 Exercises 


1. Show that the set of gates {OR, NOT} is functionally complete. 


Show that each set of gates in Exercises 2-5 is not functionally 
complete. 


. {AND} 
. {OR} 


3. {NOT} 
5. {AND, OR} 


2 
4 
6. Draw a circuit using only NAND gates that computes xy. 
7. Write xy using only ¢. 

8 


. Prove or disprove: x * (vy t z) = (x t y) ¢ g, for all 
X,y,Z€ Zp. 


Write Boolean expressions to describe the multiple output circuits 
in Exercises 9-11, 


9. 
x4 
*2 yi 
x3 y2 
10. 
x1 
x Bl 
y2 
*s y3 
x4 y4 
11. 
yy 
x4 
y2 
x 
7 3 
x3 


15. 
x16. 


. Design circuits using only NAND gates to compute the func- 


tions of Exercises 1-10, Section 11.4. 


. Can you reduce the number of NAND gates used in any of 


your circuits for Exercise 12? 


. Design circuits using as few AND, OR, and NOT gates 


as you can to compute the functions of Exercises 1-10, 
Section 11.4. 


Design a half-adder circuit using only NAND gates. 
Design a half-adder circuit using five NAND gates. 


A NOR gate receives inputs x; and x2, where x; and x2 are bits, 
and produces output denoted x, | x2, where 


ifx; =lorx=1 
otherwise. 


auba={9 


. Write xy, x V y, x, and x ¢ y in terms of J. 

. Write x | y in terms of +. 

. Write the logic table for the NOR function. 

. Show that the set of gates {NOR} is functionally complete. 


. Design circuits using only NOR gates to compute the func- 


tions of Exercises 1-10, Section 11.4. 


. Can you reduce the number of NOR gates used in any of your 


circuits for Exercise 21? 


. Design a half-adder circuit using only NOR gates. 
. Design a half-adder circuit using five NOR gates. 


. Design a circuit with three inputs that outputs | precisely when 


two or three inputs have value 1. 


. Design a circuit that multiplies the binary numbers x2x; and 


y2y1. The output will be of the form z4z3z221. 


. A 2’s module is a circuit that accepts as input two bits b and 


FLAGIN and outputs bits c and FLAGOUT. If FLAGIN = 1, 
then c = b and FLAGOUT = 1. If FLAGIN = 0 and b = 1, 
then FLAGOUT = 1. If FLAGIN = 0 and b = 0, then 
FLAGOUT = 0. If FLAGIN = 0, then c = b. Design a circuit 
to implement a 2’s module. 
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The 2’s complement of a binary number can be computed by using 


the following algorithm. 


Algorithm 11.5.16 
Finding the 2's Complement 


This algorithm computes the 2’s complement Cy Cy_ :-- 
C2C; of the binary number M = ByBy-\--- B2B,. The 
number M is scanned from right to left and the bits are 
copied until 1 is found. Thereafter, if B; = 0, we set C; = 1 
and if B; = 1, we set C; = 0. The flag F indicates whether 
a | has been found (F = true) or not (F = false). 


Input: ByBy-1--: By 
Output: CyCy—1---Ci 


twos_complement(B) { 

ie ttalse 

all 

while (=F Ai < N) { 
CB; 
ift(B;3——l))) 

P= true 

i=i+1 

} 

while (i < N) { 
C;=8;@1 
i=i+1 

} 


return C 
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Find the 2’s complement of the numbers in Exercises 28-30 using 
Algorithm 11.5.16. 


28. 
31. 


«32, 


«33, 


«34, 
x35, 


101100 29. 11011 30. 011010110 


Using 2’s modules, design a circuit that computes the 2’s com- 
plement y3y2y; of the three-bit binary number x3x2.1. 


Let * be a binary operator on a set S containing 0 and 1. Write 
a set of axioms for *, modeled after rules that NAND satisfies, 
so that if we define 


X= XX 
xVy = (x *x) * (y *y) 
xAy = (x*y) * (x *y), 


then (S, V, A, , 0, 1) is a Boolean algebra. 


Let * be a binary operator on a set S containing 0 and 1. Write 
a set of axioms for *, modeled after rules that NOR satisfies, 
and definitions for ~, Vv, and A so that (S$, V, A,~,0, 1) isa 
Boolean algebra. 


Show that {>} is functionally complete (see Definition 1.3.3). 


Let B(x, y) be a Boolean expression in the variables x and y 

that uses only the operator <> (see Definition 1.3.8). 

(a) Show that if B contains an even number of x’s, the values 
of B(x, y) and B(x, y) are the same for all x and y. 


(b) Show that if B contains an odd number of x’s, the values 
of B(x, y) and B(x, y) are the same for all x and y. 


(c) Use parts (a) and (b) to show that {<>} is not functionally 
complete. 


This exercise was contributed by Paul Pluznikov. 


Chapter 11 Notes 


General references on Boolean algebras are [Hohn; and Mendelson]. [Mendelson] contains 
over 150 references on Boolean algebras and combinatorial circuits. Books on logic design 
include [Kohavi; McCalla; and Ward]. 

[Hailperin] gives a technical discussion of Boole’s mathematics. Additional references 
are also provided. Boole’s book, The Laws of Thought, has been reprinted (see [Boole]). 

Because of our interest in applications of Boolean algebra, most of our discussion was 
limited to the Boolean algebra (Z2, V, A, ~, 0, 1). However, versions of most of our results 
remain valid for arbitrary, finite Boolean algebras. 


Boolean expressions in the symbols x;,.. 


.,X, Over an arbitrary Boolean algebra 


(S,+, +, ',0, 1) are defined recursively as 


@ For each s € S, s is a Boolean expression. 


™@ x ,,...,X, are Boolean expressions. 


If X; and X> are Boolean expressions, so are 


(Mi), Xi, 


X; +X, X,-X. 


A Boolean function over S is defined as a function from S” to S of the form 


where X is a Boolean expression in the symbols x;, . . 


F Qis 005 Mn) =X (Xie: 


Xn); 


.,X, over S. A disjunctive normal form 


can be defined for f. Another result is that if X and Y are Boolean expressions over S and 


XX, +++ Xn) = YX, ~ ++ Xn) 


PAADAMNEWNE 
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for all x; € S, then Y is derivable from X using the definition (Definition 11.3.1) of a Boolean 
algebra. Other results are that any finite Boolean algebra has 2” elements and that if two 
Boolean algebras both have 2” elements they are essentially the same. It follows that any 
finite Boolean algebra is essentially Example 11.3.3, the Boolean algebra of subsets of a 
finite, universal set U. The proofs of these results can be found in [Mendelson]. 
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Section 11.1 


. Combinatorial circuit 
. Sequential circuit 


AND gate 
OR gate 


. NOT gate (inverter) 

. Logic table of a combinatorial circuit 
- Boolean expression 

. Literal 


Section 11.2 


9. 


10. 
11. 
12. 


Properties of A, V, and ~: associative laws; commutative 
laws; distributive laws; identity laws; complement laws (see 
Theorem 11.2.1) 

Equal Boolean expressions 

Equivalent combinatorial expressions 

Combinatorial expressions are equivalent if and only if the 
Boolean expressions that represent them are equal. 


Section 11.3 


13. 
14. 
15. 


16. 


Boolean algebra 

x’: Complement of x 

Properties of Boolean algebras: Idempotent laws; bound 
laws; absorption laws; involution law; 0 and 1 laws; 
De Morgan’s laws 

Dual of statement involving Boolean expressions 
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17. 


The dual of a theorem about Boolean algebras is also a 
theorem. 


Section 11.4 


18. 
19. 
20. 
21. 
22. 


23. 
24. 


Exclusive-OR 

Boolean function 

Minterm: yj A y2 A-++ A yp, where each y; is x; or X; 
Disjunctive normal form 

How to write a Boolean function in disjunctive normal form 
(Theorem 11.4.6) 

Maxterm: y; V y2 V --+ V y,, where each y; is x; or X; 
Conjunctive normal form 


Section 11.5 


25. 
26. 
27. 


28. 
29. 
30. 
31. 
32. 
33. 


Gate 

Functionally complete set of gates 

The sets of gates {AND, NOT} and {OR, NOT} are func- 
tionally complete. 

NAND gate 

The set {NAND} is a functionally complete set of gates. 
Minimization problem 

Integrated circuit 

Half-adder circuit 

Full-adder circuit 


1. 


Write a Boolean expression that represents the combinato- 
rial circuit and write the logic table. 


Are the combinatorial circuits in Exercises 2 and 3 equivalent? 
Explain. 


: 2. 
aay eS 
Zz 
y y 
(a) (b) 
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3. In Exercises 13-16, find the disjunctive normal form of a 
Boolean expression having a logic table the same as the given 
table and draw the combinatorial circuit corresponding to the 

: disjunctive normal form. 

. 13. 

xX} x2 Xx3 y 

Zz 

1 1 1 0 
1 1 0 0 
1 0 1 0 
(a) 1 0 0 1 
0 1 1 0 
0 1 0 0 
0 0 1 0 
. 0 oO 0 0 
» 
14. 
+ MX y 
(b) ae ae 0 
4. If U is a universal set and S = P(U), the power set of U, ! i 0 ! 
th 1 0 1 0) 
en 
1 0 0 1 
_ 0 1 1 0 
(S,U, 9,7, @, U) 0 1 0 0 
0 0 1 0 
is a Boolean algebra. State the bound and absorption laws 0 0 0 0 
for this Boolean algebra. 
5. Find the value of the Boolean expression 
2 a5. 
(x1 A X2) V 2 A x3) 

nal x2 x3 
if x; =x. =Oandx3= 1. 1 1 1 1 
6. Find a combinatorial circuit corresponding to the Boolean : : 0 0 
expression of Exercise 5. ! 0 ! 0 
1 0 0 1 
Prove or disprove the equations in Exercises 7 and 8. 0 1 1 0 

7. XAY)VEADV KAVAD =VV (KAZ) : : : 

8. (XKAVADVAVD=(XKADVEAD 0 0 0 1 

9. Show that the following circuit is not a combinatorial cir- 

cuit. 

x 
16. 
y 

x] x2 x3 y 
1 1 1 0 
1 1 0 1 
10. Prove that in any Boolean algebra, (x- (x +y-0))! = for 1 0 1 0 
all x and y. 1 0 0) 1 
11. Write the dual of the statement of Exercise 10 and prove it. 0 1 1 1 
12. Let U be the set of positive integers. Let S be the collection 0 l 0 0 
of finite subsets of U. Why does (S,U, 9,7, 2, U) fail to 0 0 1 1 
be a Boolean algebra? 0 0 0 0 


17. Write the logic table for the circuit 
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18. Find a Boolean expression in disjunctive normal form for 


the circuit of part (a) of Exercise 3. Use algebraic methods 
to simplify the disjunctive normal form. Draw the circuit 
corresponding to the simplified expression. 


19. Design a circuit using only NAND gates to compute x © y. 


20. Design a full-adder circuit that uses two half adders and one 


OR gate. 
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1. 


Write a program that inputs a Boolean expression in x andy 5. 
and prints the logic table of the expression. 


. Write a program that inputs a Boolean expression inx,y,and 6. 


z and prints the logic table of the expression. 


. Write a program that outputs the disjunctive normal form of — 7. 


a Boolean expression p(x, y). 


. Write a program that outputs the conjunctive normal form of 


a Boolean expression p(x, y). 


Write a program that outputs the disjunctive normal form of 
a Boolean expression p(x, y, Z). 


Write a program that outputs the conjunctive normal form of 
a Boolean expression p(x, y, Z). 


Write a program that computes the two’s complement of an 
n-bit binary number. 


12.1 Sequential Circuits and 
Finite-State Machines 

12.2  Finite-State Automata 

12.3 Languages and 
Grammars 

12.4 Nondeterministic 
Finite-State Automata 

12.5 Relationships Between 
Languages and 
Automata 

12.1 
Go Online 


For more on sequential 
circuits and finite-state 
machines, see 
goo.gl/VVncUc 
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Chapter 12 


AUTOMATA, 
GRAMMARS, 
AND LANGUAGES 


In Chapter 11 we discussed combinatorial circuits in which the output depended only 
on the input. These circuits have no memory. In this chapter we begin by discussing 
circuits in which the output depends not only on the input but also on the state of the 
system at the time the input is introduced. The state of the system is determined by 
previous processing. In this sense, these circuits have memory. Such circuits are called 
sequential circuits and are obviously important in computer design. 

Finite-state machines are abstract models of machines with a primitive internal 
memory. A finite-state automaton is a special kind of finite-state machine that is closely 
linked to a particular type of language. In the latter part of this chapter, we will discuss 
finite-state machines, finite-state automata, and languages in some detail. 


Sequential Circuits and Finite-State 
Machines 


Operations within a digital computer are carried out at discrete intervals of time. Output 
depends on the state of the system as well as on the input. We will assume that the state 
of the system changes only at time ¢ = 0, 1,.... A simple way to introduce sequencing 
in circuits is to introduce a unit time delay. 


Definition 12.1.1 >» A unit time delay accepts as input a bit x, at time t and 
outputs x;_1, the bit received as input at time ¢ — 1. The unit time delay is drawn as 
shown in Figure 12.1.1. 


xy 


Delay Xt-1 


Figure 12.1.1 Unit time delay. < 
As an example of the use of the unit time delay, we discuss the serial adder. 


Definition 12.1.2 >» A serial adder accepts as input two binary numbers 


and 


xX = Oxyxy-1---X0 y = Oywyn-1--- Yo 


and outputs the sum zy+1Zy -- + Zo of x and y. The numbers x and y are input sequentially 
in pairs, X, Yo; -.-3 Xv, yn; O, O. The sum is output Zo, Z1,.-., Zv41- > | 


Example 12.1.3 


Figure 12.1.2 A 


serial-adder circuit. 
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Serial-Adder Circuit A circuit, using a unit time delay, that implements a serial adder 
is shown in Figure 12.1.2. Show how the serial adder computes the sum of x = 010 and 
y=Oll1. 


SOLUTION We begin by setting x9 = 0 and yo = 1. (We assume that at this instant 
i = 0. This can be arranged by first setting x = y = 0.) The state of the system is 
shown in Figure 12.1.3(a). Next, we set x} = y,; = 1. The unit time delay sends i = 0 
as the third bit to the full adder. The state of the system is shown in Figure 12.1.3(b). 
Finally, we set x2 = y2 = O. This time the unit time delay sends i = 1 as the third bit 
to the full adder. The state of the system is shown in Figure 12.1.3(c). We obtain the 
sum z = 101. 


Figure 12.1.3 Computing 010 + 011 with the serial-adder circuit. < 


A finite-state machine is an abstract model of a machine with a primitive internal 
memory. 


Definition 12.1.4 > A finite-state machine M consists of 


(a) A finite set Z of input symbols. 

(b) A finite set O of output symbols. 

(c) A finite set S of states. 

(d) A next-state function f from S x T into S. 
(e) An output function g from S x T into O. 
(f) An initial stateo € S. 


We write M = (Z,O,S,f, g,0). < 


Example 12.1.5 Let Z= {a, b}, O = {0, 1}, and S = {o0, 01}. Define the pair of functions f: Sx I > S 


and g: S x I— O by the rules given in Table 12.1.1. 


TABLE 12.1.1 & 


Then M = (Z, O, S,f, g, 00) is a finite-state machine. 
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Example 12.1.6 


Figure 12.1.4 A transition 


diagram. 


Example 12.1.9 


Table 12.1.1 is interpreted to mean 


(Go, 4) = 90 g(o0, a) = 0, 
f(oo, b) = % g(oo, b) = 1, 
f(,a =o g(o,,a) = 1, 
f(,b =o g(o1, b) = 0. < 


The next-state and output functions can also be defined by a transition diagram. 
Before formally defining a transition diagram, we will illustrate how a transition diagram 
is constructed. 


Draw the transition diagram for the finite-state machine of Example 12.1.5. 


SOLUTION The transition diagram is a digraph. The vertices are the states (see 
Figure 12.1.4). The initial state is indicated by an arrow as shown. If we are in state 
o and inputting i causes output o and moves us to state 0’, we draw a directed edge from 
vertex o to vertex o’ and label it i/o. For example, if we are in state oo, and we input a, 
Table 12.1.1 tells us that we output 0 and remain in state 09. Thus we draw a directed 
loop on vertex o and label it a/0 (see Figure 12.1.4). On the other hand, if we are in state 
oo and we input b, we output 1 and move to state o,. Thus we draw a directed edge from 
do to oj and label it b/1. By considering all such possibilities, we obtain the transition 
diagram of Figure 12.1.4. < 


Definition 12.1.7 >» LetM=(Z,0,S,f,g,c) bea finite-state machine. The 
transition diagram of M is a digraph G whose vertices are the members of S. An arrow 
designates the initial state o. A directed edge (0), 02) exists in G if there exists an input 
i with f (1, i) = o2. In this case, if g(o1, 1) = 0, the edge (0), 02) is labeled i/o. < 


We can regard the finite-state machine M = (Z, O, S,f, g,0) as a simple com- 
puter. We begin in state o, input a string over Z, and produce a string of output. 


Definition 12.1.8 >» Let M = (Z,O,S,f,g,c) be a finite-state machine. An 
input string for M is a string over Z. The string 


Yiec'Yn 


is the output string for M corresponding to the input string 


=X, +++ Xp 
if there exist states 09, ..., 0, € S with 

09 =O 

oj = f(Oi-1, Xi) fort = 154.250 

Yi = B(Gi-1, Xi) for i= 1.55.57 4 


Find the output string corresponding to the input string 
aababba (12.1.1) 
for the finite-state machine of Example 12.1.5. 


SOLUTION Initially, we are in state oo. The first symbol input is a. We locate the out- 
going edge in the transition diagram of M (Figure 12.1.4) from oo labeled a/x, which 


Example 12.1.10 


Example 12.1.11 
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tells us that if a is input, x is output. In our case, 0 is output. The edge points to the next 
state, og. Next, a is input again. As before, we output 0 and remain in state o9. Next, b 
is input. In this case, we output | and change to state 0. Continuing in this way, we find 
that the output string is 


0011001. (12.1.2) 
< 


A Serial-Adder Finite-State Machine Design a finite-state machine that performs serial 
addition. 


SOLUTION We will represent the finite-state machine by its transition diagram. Since 
the serial adder accepts pairs of bits, the input set is {00, 01, 10, 11} and the output set 
is {0, 1}. 

Given an input xy, we take one of two actions: Either we add x and y, or we add 
x, y, and 1, depending on whether the carry bit was 0 or 1. Thus there are two states, 
which we will call C (carry) and NC (no carry). The initial state is NC. At this point, 
we can draw the vertices and designate the initial state in our transition diagram (see 
Figure 12.1.5). 

Next, we consider the possible inputs at each vertex. For example, if 00 is input to 
NC, we should output 0 and remain in state NC. Thus NC has a loop labeled 00/0. As 
another example, if 11 is input to C, we compute | + 1+ 1 = 11. In this case we output 
1 and remain in state C. Thus C has a loop labeled 11/1. As a final example, if we are 
in state VC and 11 is input, we should output 0 and move to state C. By considering all 
possibilities, we arrive at the transition diagram of Figure 12.1.6. 


00/0 o1/1 01/0 
11/0 
—— 10/0 
—@ © a 
: 10/1 11/1 

Figure 12.1.5 Two states for 
the serial-adder finite-state Figure 12.1.6 A finite-state machine 
machine. that performs serial addition. 


The SR Flip-Flop A flip-flop is a basic component of digital circuits since it serves as a 
one-bit memory cell. The SR flip-flop (or set-reset flip-flop) can be defined by the table 


Ss R | Q 
1 1 Not allowed 

1 0 1 

0 1 0 


0 0 f if S was last equal to 1 


0 if R was last equal to 1 


The SR flip-flop “remembers” whether S or R was last equal to 1. (If Q = 1, S was last 
equal to 1; if Q = 0, R was last equal to 1.) We can model the SR flip-flop as a finite-state 
machine by defining two states: “S was last equal to 1” and “R was last equal to 1” (see 
Figure 12.1.7). We define the input to be the new values of S and R; the notation sy means 
that S = s and R = r. We define Q to be the output. We have arbitrarily designated the 
initial state as “S was last equal to 1.” A sequential circuit implementation of the SR 
flip-flop is shown in Figure 12.1.8. 
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S 
00/1 10/1 00/0 01/0 
S was last equal to 1 R was last equal to 1 
Q 
10/1 R 

Figure 12.1.7 The SR flip-flop as a finite-state machine. Figure 12.1.8 A sequential circuit implementation of 

the SR flip-flop. < 
12.1 Review Exercises 
1. What is a unit time delay? 4. What is a transition diagram? 
2. What is a serial adder? 5. What is the SR flip-flop? 


3. Define finite-state machine. 


12.1 Exercises 


In Exercises 1-5, draw the transition diagram of the finite-state 4. T= {a, b,c}, O = {0, 1}, S= {o0, o1, 02} 
machine (I, O, S, f, g, 00). 


1. T= {a,b}, O = {0, 1}, S = {o0, 1} f . 
we a b c a c 
f 8 S 
ft a b ¢ b 00 00 onl 02 1 0 
2 o| Oo oO 00 1 1 1 
on) onl onl 1 1 02 02 onl 00 1 0 0 
| 00 oO 0 1 


5. I= {a, b,c}, O = {0, 1, 2}, S = {o0, 01, 02, 03} 
2. T= {a, b}, O = {0, 1}, 5 = {o0, o1} 


f 8 
f 8 
T a b c a b c 
T a b a b S 

Ss 
er) oO} 00 02 1 1 2 
00 oO! 00 oO! 00 02 02 2 0 0 
oO! o% §©9% 1 1 o 03 03 + «0% 1 0 1 
03 oO! o1 0% 2 0 2 


3. Z= {a,b}, O = {0, 1}, S = {o0, 01, 02} 


In Exercises 6-10, find the sets T, O, and S, the initial state, and 
f & the table defining the next-state and output functions for each finite- 
state machine. 


Ss 6. 


b/1 all 
er) o| o| 0 1 
eal o2 oO! 1 
02 G90 0 9 —* ald (21) bil 


a 


10. 


b/2 
() al0 
Camnost 
b/L 


b/1 


al0 
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18 
19 
20. 


In 


. aabbabaab; Exercise 8 
. bbababbabaaa; Exercise 9 


. cacbccbhaabac; Exercise 10 


Exercises 21-26, design a finite-state machine having the given 


properties. The input is always a bit string. 


21 


22. 
23. 


24. 
25. 


26. 
27. 


«28. 


*29, 


In Exercises 11-20, find the output string for the given input string 
and finite-state machine. 


11. 
12. 
13. 
14, 
15. 
16. 
17. 


abba; Exercise 1 
abba; Exercise 2 
aabbaba; Exercise 3 
aabbcc; Exercise 4 
aabaab; Exercise 5 
aaa; Exercise 6 


baaba; Exercise 7 


. Outputs 1 if an even number of 1’s have been input; otherwise, 
outputs 0 


Outputs 1 if two or more 1’s are input; otherwise, outputs 0 


Outputs | if k 1’s have been input, where k is a multiple of 3; 
otherwise, outputs 0 


Outputs 1 whenever it sees 101; otherwise, outputs 0 


Outputs | when it sees the first 0 and until it sees another 0; 
thereafter, outputs 0; in all other cases, outputs 0 


Outputs | when it sees 101 and thereafter; otherwise, outputs 0 
Let a = x, --- Xx, bea bit string. Let 6B = y; --- yn, where 
_ ja ifx=0 
Vb if = 


fori=1,...,n.Lety =yn---y1. 

Show that if y is input to the finite-state machine of 
Figure 12.1.4, the output is the 2’s complement of a (see 
Algorithm 11.5.16 for a description of 2’s complement). 


Show that there is no finite-state machine that receives a bit 
string and outputs | whenever the number of 1’s input equals 
the number of 0’s input and outputs 0 otherwise. 


Show that there is no finite-state machine that performs 
serial multiplication. Specifically, show that there is no finite- 
state machine that inputs binary numbers X = x1 ---X,, Y = 
y1+++Yn, as the sequence of two-bit numbers 

XnYn> = Xn—-1Yn-1, seey XLVI, 00, tees 00, 
where there are n 00’s, and outputs z2,,...,Z1, where Z = 
Z1+*+ Zan = XY. 


Example: {f there is such a machine, to multiply 101 x 
1001 we would input 11,00,10,01,00,00,00,00. The first pair 
11 is the pair of rightmost bits (101, 1001); the second pair 
00 is the next pair of bits (101, 1001); and so on. We pad the 
input string with four pairs of 00’s—the length of the longest 
number 1001 to be multiplied. Since 101 x 1001 = 101101, 
it is alleged that we obtain the output shown in the following 
table. 


Input Output 


11 
00 
10 
01 
00 
00 
00 
00 


COOrR OFF OF 
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12.2 


Go Online 

A finite-state automaton 
simulator is at 
goo.gl/VVncUc 


Example 12.2.2 


Finite-State Automata 


A finite-state automaton is a special kind of finite-state machine. Finite-state automata 
are of special interest because of their relationship to languages, as we shall see in 
Section 12.5. 


Definition 12.2.1 > A finite-state automaton A = (Z, O, S,f, g, 0) is a finite- 
state machine in which the set of output symbols is {0, 1} and where the current state 
determines the last output. Those states for which the last output was | are called 
accepting states. < 


Draw the transition diagram of the finite-state machine A defined by the table. The initial 
state is og. Show that A is a finite-state automaton, and determine the set of accepting 
states. 


f 8 
T a b a b 
Ss 
90 Oo! 00 1 0 
oO! o2 fer) i 0 
o2 02 00 i 0) 


SOLUTION The transition diagram is shown in Figure 12.2.1. If we are in state oo, the 
last output was 0. If we are in either state o; or o2, the last output was 1; thus A is a 
finite-state automaton. The accepting states are o; and 02. 


() b/0 i a/l 
all 
ly 0 


b/0 


Figure 12.2.1 The transition diagram for 
Example 12.2.2. < 


Example 12.2.2 shows that the finite-state machine defined by a transition diagram 
will be a finite-state automaton if the set of output symbols is {0, 1} and if, for each state 
o, all incoming edges to o have the same output label. 

The transition diagram of a finite-state automaton is usually drawn with the 
accepting states in double circles and the output symbols omitted. When the transition 
diagram of Figure 12.2.1 is redrawn in this way, we obtain the transition diagram of 
Figure 12.2.2. 


Figure 12.2.2 The transition diagram of 
Figure 12.2.1 redrawn with accepting states in double 
circles and output symbols omitted. 
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Example 12.2.3 Draw the transition diagram of the finite-state automaton of Figure 12.2.3 as a transition 
= _ diagram of a finite-state machine. 


SOLUTION Since 02 is an accepting state, we label all its incoming edges with output 1 
(see Figure 12.2.4). The states og and oj are not accepting, so we label all their incoming 
edges with output 0. We obtain the transition diagram of Figure 12.2.4. 


b/0 
Figure 12.2.3 A finite-state Figure 12.2.4 The finite-state 
automaton. automaton of Figure 12.2.3 redrawn 
as a transition diagram of a 
finite-state machine. < 


As an alternative to Definition 12.2.1, we can regard a finite-state automaton A as 
consisting of 


1. A finite set Z of input symbols 

2. A finite set S of states 

3. A next-state function f from S x Tinto S 
4. A subset A of S of accepting states 

5. An initial stateo € S. 


If we use this characterization, we write A = (Z, S,f, A, o). 


Example 12.2.4 The transition diagram of the finite-state automaton A = (Z, S, f, A, o), where 
T= {a,b}, S={o0,01,02}, A= {or}, o =o, 
and f is given by the following table 


£ a b 
S 
00 00 onl 
oO! 0 02 
02 90 0o2 


is shown in Figure 12.2.5. 


Figure 12.2.5 The transition diagram for 
Example 12.2.4. < 
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Example 12.2.6 


Example 12.2.7 


Example 12.2.8 


Figure 12.2.6 A finite-state 
automaton that accepts precisely 
those strings over {a, b} that 
contain no a’s. 


Example 12.2.9 


If a string is input to a finite-state automaton, we will end at either an accepting 
or a nonaccepting state. The status of this final state determines whether the string is 
accepted by the finite-state automaton. 


Definition 12.2.5 >» LetA = (Z,S,f,A,c) be a finite-state automaton. Let 


a = x,--+-Xx, bea string over Z. If there exist states 00, ..., 0, satisfying 
(a) 09 =o 
(b) S(Gi-1, Xi) = 0; fori = 1, ong AL 
(c) o, € A, 


we say that a is accepted by A. The null string is accepted if and only if o € A. We let 
Ac(A) denote the set of strings accepted by A and we say that A accepts Ac(A). 

Let a = x, ---x, bea string over Z. Define states 09, ..., 0, by conditions (a) and 
(b) above. We call the (directed) path (0, ..., 0) the path representing a in A. <4 


It follows from Definition 12.2.5 that if the path P represents the string a in a 
finite-state automaton A, then A accepts a if and only if P ends at an accepting state. 


Is the string abaa accepted by the finite-state automaton of Figure 12.2.2? 


SOLUTION We begin at state 09. When a is input, we move to state 0}. When b is input, 
we move to state o9. When a is input, we move to state o;. Finally, when the last symbol 
ais input, we move to state 02. The path (09, 01, 00, 01, 02) represents the string abaa. 
Since the final state o> is an accepting state, the string abaa is accepted by the finite-state 
automaton of Figure 12.2.2. < 


Is the string @ = abbabba accepted by the finite-state automaton of Figure 12.2.3? 


SOLUTION The path representing @ terminates at 0). Since o is not an accepting state, 
the string a is not accepted by the finite-state automaton of Figure 12.2.3. < 


We next give two examples illustrating design problems. 
Design a finite-state automaton that accepts precisely those strings over {a, b} that con- 
tain no a’s. 
SOLUTION The idea is to use two states: 


A: Ana was found. 


NA: Noa’s were found. 


The state NA is the initial state and the only accepting state. It is now a simple matter 
to draw the edges (see Figure 12.2.6). Notice that the finite-state automaton correctly 
accepts the null string. < 


Design a finite-state automaton that accepts precisely those strings over {a, b} that con- 
tain an odd number of a’s. 


SOLUTION This time the two states are 


E: Aneven number of a’s was found. 


O: An odd number of a’s was found. 


12.2 @ Finite-State Automata 577 


The initial state is F and the accepting state is O. We obtain the transition diagram 
shown in Figure 12.2.7. < 


A finite-state automaton is essentially an algorithm to decide whether or not a given 
string is accepted. As an example, we convert the transition diagram of Figure 12.2.7 to 
an algorithm. 


Algorithm 12.2.10 This algorithm determines whether a string over {a, b} is accepted by the finite-state 
automaton whose transition diagram is given in Figure 12.2.7. 


Input: _n, the length of the string (n = O designates the null string); 5152 --- 5, 
the string 


Output: “Accept” if the string is accepted 
“Reject” if the string is not accepted 


Figure 12.2.7 A finite-state fsa(s, n) { 
automaton that accepts precisely Slate sale 
those strings over {a, b} that fori —s lator 
contain an odd number of a’s. if (state == ‘E’” \ 5; == ‘a’) 
Siac On 
ita (State—— mi Oy as — — sa) 
ee = 1a" 
} 
lie(Staiei—— ©) 
return “Accept” 
else 


return “Reject” 
If two finite-state automata accept precisely the same strings, we say that the au- 
tomata are equivalent. 


Definition 12.2.11 > The finite-state automata A and A’ are equivalent if 
Ac(A) = Ac(A’). < 


Example 12.2.12 Itcan be verified that the finite-state automata of Figures 12.2.6 and 12.2.8 are equivalent 
ep-————p\pehpv == (See Exercise 33). 


Figure 12.2.8 A finite-state automaton equivalent to that 


in Figure 12.2.6. 4 


If we define a relation R on a set of finite-state automata by the rule A R A’ if A 
and A’ are equivalent (in the sense of Definition 12.2.11), R is an equivalence relation. 
Each equivalence class consists of a set of mutually equivalent finite-state automata. 
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12.2 Review Exercises 


1. Define finite-state automaton. 3. What are equivalent finite-state automata? 


2. What does it mean for a string to be accepted by a finite-state 
automaton? 


12.2 Exercises 


In Exercises 1-3, show that each finite-state machine is a finite- 6. 
state automaton and redraw the transition diagram as the diagram 
of a finite-state automaton. 


ik 
all 


— al0 
(20) (1) In Exercises 7-9, draw the transition diagram of the finite-state 
b/L automaton (Z, S, f, A, 00). 


2. 7. I= {a, b}, S = {o0, 01, 02}, A = {00} 


a/1 
ete HD a em 
b/0 SS 
HG a b 
al0 S 
3. 00 ol 00 
o1 02 «90 
02 a 02 
8. Z = {a, b,c}, S = {00, 01, 02, 03}, A = {00, 02} 
b/0 
f 
In Exercises 4-6, redraw the transition diagram of the finite-state Tt a b e 
automaton as the transition diagram of a finite-state machine. Ss 
~ cor) oO! 00 02 
oO! en) 03 er) 
02 03 02 00 
03 forl 00 foal 
5. 


we a b 
Ss 
00 oO oO 
o1 0 02 
02 0 oO 


10. 


11. 


12. 


For each finite-state automaton in Exercises 1-6, find the sets 
T, S, and A, the initial state, and the table defining the next- 
state function. 

Which of the finite-state machines of Exercises 1-10, Sec- 
tion 12.1, are finite-state automata? 

What must the table of a finite-state machine M look like in 
order for M to be a finite-state automaton? 


In Exercises 13-17, determine whether the given string is accepted 
by the given finite-state automaton. 


13. 
15. 
17. 
18. 


19. 


x20. 


abbaa; Figure 12.2.2 
aabaabb; Figure 12.2.5 


aaabbbaab; Exercise 6 


14. abbaa; Figure 12.2.3 
16. aaababbab; Exercise 5 


Show that a string a over {a, b} is accepted by the finite-state 
automaton of Figure 12.2.2 if and only if a ends with a. 
Show that a string a over {a, b} is accepted by the finite-state 
automaton of Figure 12.2.5 if and only if a ends with bb. 
Characterize the strings accepted by the finite-state automata 
of Exercises 1-9. 


In Exercises 21-31, draw the transition diagram of a finite-state 
automaton that accepts the given set of strings over {a, b}. 


21. 
23. 
25. 
26. 
2. 
29. 
*31. 
32, 


33. 


Even number of a’s 22. At least one b 


Exactly one b 24. Exactly two a’s 
Contains m a’s, where m is a multiple of 3 

At least two a’s 

*28. Contains abba 
Every b is followed bya «30. Ends with aba 


Starts with ab and ends with baa 


Starts with baa 


Write algorithms, similar to Algorithm 12.2.10, that decide 
whether or not a given string is accepted by the finite-state 
automata of Exercises 1-9. 


Give a formal argument to show that the finite-state automata 
of Figures 12.2.6 and 12.2.8 are equivalent. 


34 


35; 


36. 


37. 
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. Let L be a finite set of strings over {a, b}. Show that there is a 
finite-state automaton that accepts L. 


Let L be the set of strings accepted by the finite-state automa- 
ton of Exercise 5. Let S denote the set of all strings over {a, b}. 
Design a finite-state automaton that accepts S — L. 


Let L; be the set of strings accepted by the finite-state 
automaton A; = (Z, S;, fi, Aj, oj), i= 1, 2. Let 


A=(,S, x S,f,A,o), 
where 


fUS1, S2), x) = fii, x), f2(S2, x) 
A = {(Aj, A2) | Ay € Aj and Az € Ap} 
o = (01, 02). 
Show that Ac(A) = LZ; NL». 


Let L; be the set of strings accepted by the finite-state 
automaton A; = (Z, S;, fj, Aj, a7), i= 1, 2. Let 


A=(,8| x &.f,A,o), 
where 


FCUS1, 82), x) = Fi (S1, x), f2(S2, x) 
A = {(A1, Az) | At € Aj or Az € Ap} 


o = (01, 02). 


Show that Ac(A) = LZ; UL». 


In Exercises 38-42, let L; = Ac(Aj), i = 1,2. Draw the transi- 
tion diagrams of the finite-state automata that accept L, 1 Lz and 
L, Ul». 

38. Aj given by Exercise 4; Az given by Exercise 6 


39. 
40. 
41. 
42. 


A, given by Exercise 4; Az given by Exercise 5 
A, given by Exercise 6; Az given by Exercise 5 
A, given by Exercise 5; A> given by Exercise 5 


A; given by Figure 12.5.7, Section 12.5; Az given by 
Exercise 5 


12.3 Languages and Grammars 


According to Merriam-Webster’s Collegiate® Dictionary, language is “the words, their 
pronunciation, and the methods of combining them used and understood by a commu- 
nity.”’ Such languages are often called natural languages to distinguish them from for- 
mal languages, which are used to model natural languages and to communicate with 
computers. The rules of a natural language are very complex and difficult to character- 
ize completely. On the other hand, it is possible to specify completely the rules by which 
certain formal languages are constructed. We begin with the definition of a formal lan- 


guage. 


iBy Permission. From Merriam-Webster’s Collegiate® Dictionary, 11th Edition ©2016 Merriam Webster, 
Inc. (www.Merriam-Webster.com). 
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Example 12.3.2 


Example 12.3.4 


Definition 12.3.1 >» Let A be a finite set. A (formal) language L over A is a 
subset of A*, the set of all strings over A. < 


Let A = {a, b}. The set L of all strings over A containing an odd number of a’s is a 
language over A. As we saw in Example 12.2.9, L is precisely the set of strings over A 
accepted by the finite-state automaton of Figure 12.2.7. < 


One way to define a language is to give a list of rules that the language is assumed 


to obey. 


Definition 12.3.3 > A phrase-structure grammar (or, simply, grammar) G con- 
sists of 


(a) A finite set N of nonterminal symbols 

(b) A finite set T of terminal symbols where NOT = @ 

(c) A finite subset P of [(N U T)* — T*] x (N UT)*, called the set of productions 
(d) A starting symbolo € N. 


We write G = (N,T,P, 0). < 


A production (A, B) € P is usually written 
A> B. 


Definition 12.3.3(c) states that in the production A > B, A € (N UT)* — T* and 
B e€ (NUT)*; thus A must include at least one nonterminal symbol, whereas B can 
consist of any combination of nonterminal and terminal symbols. 


Let 
N = {o, S} 
T = {a, b} 
P= {o — bo,o — aS, S —> bS, S — b}. 
Then G = (N, T, P, ©) is a grammar. < 


Given a grammar G, we can construct a language L(G) from G by using the produc- 
tions to derive the strings that make up L(G). The idea is to start with the starting symbol 
and then repeatedly use productions until a string of terminal symbols is obtained. The 
language L(G) is the set of all such strings obtained. Definition 12.3.5 gives the formal 
details. 


Definition 12.3.5 >» Let G=(N,T, P,o) bea grammar. 
Ifa — Bis aproduction and xwy € (NUT)*, we say that xBy is directly derivable 
from xay and write 


xay => xBy. 


Ifa; € (NUT)* fori =1,...,”, and a+; is directly derivable from a; fori = 1,..., 
n — 1, we say that a, is derivable from a, and write 


a) => Qn. 
We call 
a> a> San 


the derivation of a, (from a). By convention, any element of (NV UT)* is derivable from 
itself. 


Example 12.3.6 


Go Online 

For more on Backus 
normal form, see 
goo.gl/VVncUc 


Example 12.3.7 
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The language generated by G, written L(G), consists of all strings over T derivable 
from o. < 


Let G be the grammar of Example 12.3.4. 
The string abSbb is directly derivable from aSbb, written 


aSbb = abSbb, 


by using the production S > DS. 
The string bbab is derivable from o, written 


o => bbab. 


The derivation is 


o => bo = bbo bbaS = bbab. 


The only derivations from o are 


o => bo 
=> b'o n>0 
=> b"aS 
=> b"ab""'§ 
=> b'ab” n>0, m>1. 


Thus L(G) consists of the strings over {a,b} containing precisely one a that end 
with b. < 


An alternative way to state the productions of a grammar is by using Backus 
normal form (or Backus-Naur form or BNF). In BNF the nonterminal symbols typ- 
ically begin with “<” and end with “>.” The production S — T is written S ::= T. 
Productions of the form 


SS Tine StS. -sseg SST, 
may be combined as 
S2:= 7, | To |---| Th. 


The bar “|” is read “or.” 


A Grammar for Integers An integer is defined as a string consisting of an optional sign 
(+ or —) followed by a string of digits (0 through 9). The following grammar generates 
all integers. 


< digit > ::=0]1/2/3|/4|5|6|7|8|9 


< integer > ::= < signed integer >|< unsigned integer > 


< signed integer > ::= + < unsigned integer >| — < unsigned integer > 


< unsigned integer > ::= < digit >|< digit >< unsigned integer > 


The starting symbol is < integer >. 
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For example, the derivation of the integer —901 is 


< integer > > < signed integer > 
=> — < unsigned integer > 
=> — < digit >< unsigned integer > 
=> — < digit >< digit >< unsigned integer > 
=> — < digit >< digit >< digit > 
= —9 < digit >< digit > 
= —90 < digit > 
= —901. 


In the notation of Definition 12.3.3, this language consists of 


1. The set VN = {< digit >, < integer >, < signed integer >, < unsigned integer >} 
of nonterminal symbols 


2. The set T = {0, 1, 2,3, 4,5, 6, 7, 8,9, +, —} of terminal symbols 


3. The productions 


< digit > > 0,..., < digit> > 9 
< integer > — < signed integer > 
< integer > — < unsigned integer > 
< signed integer > — + < unsigned integer > 
< signed integer > — — < unsigned integer > 
< unsigned integer > — < digit > 
< unsigned integer > — < digit >< unsigned integer > 


4. The starting symbol < integer >. < 


Computer languages, such as Java and C++, are typically specified in BNF. 
Example 12.3.7 shows how an integer constant in a computer language might be speci- 
fied in BNF. 

Grammars are classified according to the types of productions that define the 
grammars. 


Definition 12.3.8 >» Let Gbea grammar and let A denote the null string. 


(a) If every production is of the form 
aAB > adp, wherea,B € (NUT)*, AEN, 
5 € (NUT)* — {A}, (12.3.1) 


we call G a context-sensitive (or type 1) grammar. 


(b) If every production is of the form 
A> 6, whereA Ee N, S5€(NUT)*, (12.3.2) 


we call G a context-free (or type 2) grammar. 


(c) If every production is of the form 
A->aorA—>aBorA—>di, whereA,BeEN, aeT, 


we call G a regular (or type 3) grammar. < 


Example 12.3.9 


Example 12.3.11 


Example 12.3.12 
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According to (12.3.1), in a context-sensitive grammar, we may replace A by 4 if 
A is in the context of a and £. In a context-free grammar, (12.3.2) states that we may 
replace A by 6 anytime. A regular grammar has especially simple substitution rules: We 
replace a nonterminal symbol by a terminal symbol, by a terminal symbol followed by 
a nonterminal symbol, or by the null string. 

Notice that a regular grammar is a context-free grammar and that a context-free 
grammar with no productions of the form A — A is a context-sensitive grammar. 

Some definitions allow a to be replaced by a string of terminals in Defini- 
tion 12.3.8(c); however, it can be shown (see Exercise 32) that the two definitions pro- 
duce the same languages. 


The grammar G defined by 
T = {a,b,c}, N = {o,A, B, C, D, E}, 
with productions 


o — aAB, o — aB, A— aAC, A> aC, B- De, 
D— b, CD —> CE, CE — DE, DE > DC, Cc > Dec 


and starting symbol o is context-sensitive. For example, the production CE — DE says 
that we can replace C by D if C is followed by E, and the production Cc — Dec says 
that we can replace C by Dc if C is followed by c. 

We can derive DC from CD since 


CD => CE => DE => DC. 


The string ab°c is in L(G), since we have 


o = aAB => aaACB aaaCCDc => aaaDCCc => aaaDCDcc 
= aaaDDCcc => aaaDDDccc => aaabbbccc. 


It can be shown (see Exercise 33) that 


L(G) = {a"b'c |n =1,2,...}. < 


It is natural to allow the language L(G) to inherit a property of a grammar G. The 
next definition makes this concept precise. 


Definition 12.3.10 > A language L is context-sensitive (respectively, context- 
free, regular) if there is a context-sensitive (respectively, context-free, regular) grammar 
G with L = L(G). < 


According to Example 12.3.9, the language 
L={a ble | n=1, 2; 3.2.3 
is context-sensitive. It can be shown (see [Hopcroft]) that there is no context-free gram- 
mar G with L = L(G); hence L is not a context-free language. < 
The grammar G defined by 
T = {a, b} and N = {o} 
with productions 


o — aobando — ab 
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Example 12.3.13 


Example 12.3.14 


Example 12.3.16 


Go Online 

For more on Lindenmayer 
grammars, see 
goo.gl/VVncUc 


and starting symbol o is context-free. The only derivations of o are 


o =>aob 
=> adap"! 
=> a’ abb"—! = q"p" 


Thus L(G) consists of the strings over {a, b} of the form a"b",n = 1,2.... This lan- 
guage is context-free. In Section 12.5 (see Example 12.5.6), we will show that L(G) is 
not regular. < 


It follows from Examples 12.3.11 and 12.3.12 that the set of context-free 
languages that do not contain the null string is a proper subset of the set of context- 
sensitive languages and that the set of regular languages is a proper subset of the set 
of context-free languages. It can also be shown that there are languages that are not 
context-sensitive. 


The grammar G defined in Example 12.3.4 is regular. Thus the language 
L(G) = {b"ab” |n=0,1,...;m=1,2,...} 


it generates is regular. < 


The grammar of Example 12.3.7 is context-free but not regular. However, if we change 
the productions to 


< integer > ::= + < unsigned integer >| — < unsigned integer >| 
0 < digits >| 1 < digits >| --- | 9 < digits > 
< unsigned integer > ::= 0 < digits >| 1 < digits >| --- | 9 < digits > 
< digits > ::= 0 < digits >| 1 < digits >| --- | 9 < digits >| A, 


the resulting grammar is regular. Since the language generated is unchanged, it follows 
that the set of strings representing integers is a regular language. < 


Example 12.3.14 motivates the following definition. 
Definition 12.3.15 >» Grammars G and G’ are equivalent if L(G) = L(G’). 4 


The grammars of Examples 12.3.7 and 12.3.14 are equivalent. < 


If we define a relation R on a set of grammars by the rule GRG’ if G and G’ 
are equivalent (in the sense of Definition 12.3.15), R is an equivalence relation. Each 
equivalence class consists of a set of mutually equivalent grammars. 

We close this section by briefly introducing another kind of grammar that can be 
used to generate fractal curves. 


Definition 12.3.17 » A context-free interactive Lindenmayer grammar con- 
sists of 


(a) A finite set N of nonterminal symbols 
(b) A finite set T of terminal symbols where NOT = @ 


Example 12.3.19 


Go Online 
A biography of von Koch 


is at goo. gl/VVncUc 
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(c) A finite set P of productions A > B, where A € N UT and B € (NUT)* 
(d) A starting symbolo € N. < 


The difference between a context-free interactive Lindenmayer grammar and a 
context-free grammar is that a context-free interactive Lindenmayer grammar allows 
productions of the form A — B, where A is a terminal or a nonterminal. (In a context- 
free grammar, A must be a nonterminal.) 

The rules for deriving strings in a context-free interactive Lindenmayer grammar 
are different from the rules for deriving strings in a phrase-structure grammar (see Def- 
inition 12.3.5). In a context-free interactive Lindenmayer grammar, to derive the string 
B from the string a, all symbols in w must be replaced simultaneously. The formal def- 
inition follows. 


Definition 12.3.18 >» Let G = (N,T, P,c) be a context-free interactive Lin- 
denmayer grammar. If 


=X +++ Xy 
and there are productions 

Xi > Bi 
in P, fori=1,...,n, we write 

a => Bi--- Bn 

and say that £; --- 6, is directly derivable from a. If a4; is directly derivable from a; 
fori=1,...,n—1, we say that a, is derivable from a, and write 

a) > Qn. 
We call 


>>: San 


the derivation of a, (from a). The language generated by G, written L(G), consists of 
all strings over T derivable from o. < 


The von Koch Snowflake 


Let 
N = {D} 
T = {d,+,—} 
P={D—> D-D++4+D-D,D—>d,4+—>4, > —}. 


We regard G(N, T, P, D) as a context-free Lindenmayer grammar. As an example of a 
derivation from D, we have 


D>D-D++D-D>d-d++d-d. 


Thus d—-d++d—deL(G). 

We now impose a meaning on the strings in L(G). We interpret the symbol d as a 
command to draw a straight line of a fixed length in the current direction; we interpret 
+ as a command to turn right by 60°; and we interpret — as a command to turn left by 
60°. If we begin at the left and the first move is horizontal to the right, when the string 
d—d-+-+d — dis interpreted, we obtain the curve shown in Figure 12.3.1(a). 

The next-longest string in L(G) is 


d—d++d—d—d—d++d—d++d—d++d—d—d—d++d—d, 
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(a) (b) 
(c) (d) 
(e) 
Figure 12.3.1 von Koch snowflakes. 
whose derivation is 
D>D-—D++D-D 
= D—-D++D—-D—-D—D++D—D++4+D 
D++D—-D—D—-D D—D 
=>d-—d++d-—d—d—d++d-—d++4+d 


i 
d++d—d—d—d++d-d. 


No shorter string is possible because all symbols must be replaced simultaneously using 
productions (Definition 12.3.18). If we replace some D’s by d and other D’s by 
D—D++D — D, we cannot derive any string from the resulting string, let alone a 
terminal string, since d does not occur on the left side of any production. 

When the string 


d—d++d—d—d—d++d—d++d—d++d—d—d—d++d—d 


is interpreted, we obtain the curve shown in Figure 12.3.1(b). 
The curves obtained by interpreting the next-longest strings in L(G) are shown in 
Figure 12.3.1(c)-(e). These curves are known as von Koch snowflakes. < 


Curves such as the von Koch snowflake are called fractal curves (see [Peitgen]). 
A characteristic of fractal curves is that a part of the whole resembles the whole. For 
example, as shown in Figure 12.3.2, when the part of the von Koch snowflake indicated 
is extracted and enlarged, it resembles the original. 

Context-free and context-sensitive interactive Lindenmayer grammars were 
invented in 1968 by A. Lindenmayer (see [Lindenmayer]) to model the growth of plants. 
As Example 12.3.19 suggests, these grammars can be used in computer graphics to 
generate images (see [Prusinkiewicz 1986, 1988; Smith]). It can be shown (see [Wood, 
p. 503]) that the class of languages generated by context-sensitive Lindenmayer gram- 
mars is exactly the same as the class of languages generated by phrase-structure 
grammars. 
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Figure 12.3.2 The fractal nature of the 
von Koch snowflake. When the top part of 
the von Koch snowflake is extracted and 
enlarged, it resembles the original. 
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12.3 Review Exercises 


10. 


1. Contrast natural and formal languages. 
2. Define phrase-structure grammar. 

3. What is a directly derivable string? 

4. What is a derivable string? 

3. 
6 
7 
8. 
9 


What is a derivation? 


. What is the language generated by a grammar? 
. What is Backus normal form? 
. Define context-sensitive grammar. 


. Define context-free grammar. 


Define regular grammar. 


11. Which grammar is equivalent to a type | grammar? 
12. Which grammar is equivalent to a type 2 grammar? 
13. Which grammar is equivalent to a type 3 grammar? 
14. Define context-sensitive language. 

15. Define context-free language. 


16. Define regular language. 


17. What is a context-free interactive Lindenmayer grammar? 


18. How is the von Koch snowflake generated? 


19. What is a fractal curve? 


12.3 Exercises 


In Exercises 1-6, determine whether the given grammar is context- 
sensitive, context-free, regular, or none of these. Give all charac- 


terizations that apply. 


1. 


2. 


T = {a, b}, N = {o, A}, with productions 


o => bo, o> aA, A— ao, 
A — DA, A> a, o—>b, 
and starting symbol o. 
T = {a, b}, N = {o, A, B}, with productions 
o >A, o — AAB, Aa — ABa, 
A — aa, Bb — ABb, AB — ABB, 
B= b, 


and starting symbol o. 


3. T = {a,b,c}, N = {o, A, B}, with productions 


o — AB, 
B= Bb, 


AB — BA, 
Aa, 


A— aA, 
Bb, 


and starting symbol o. 


4. T = {a,b,c}, N = {o, A, B}, with productions 


o — BAB, o — ABA, A— AB, 
B= BA, A> aA, A— ab, 
B= b, 


and starting symbol o. 
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5. <S>::=b<S>|a<A>la 
<A>::=a<S>|b<B> 
<B>::=b<A>|a<S>|b 


with starting symbol <S>. 
6. T = {a,b}, N = {o, A, B}, with productions 


o — AAo, AA > B, B= bB, A->a 


and starting symbol o. 

In Exercises 7-11, show that the given string a is in L(G) for the 
given grammar G by giving a derivation of a. 

7. bbabbab, Exercise 1 

8. aabbaab, Exercise 2 

9. abab, Exercise 3 
10. abbbaabab, Exercise 4 
11. abaabbabba, Exercise 5 


12. Write the grammars of Examples 12.3.4 and 12.3.9 and Exer- 
cises 1-4 and 6 in BNF. 


Let G be the grammar of Exercise 1. Show that a € L(G) if 
and only if a is nonnull and contains an even number of a’s. 


*14. Let G be the grammar of Exercise 5. Characterize L(G). 


«13. 


In Exercises 15-24, write a grammar that generates the strings 
having the given property. 
15. Strings over {a, b} starting with a 
16. Strings over {a, b} containing ba 
17. Strings over {a, b} ending with ba 
«18. 
19. Integers with no leading 0’s 
20. Floating-point numbers (numbers such as .294, 89., 67.284) 


21. Exponential numbers (numbers including floating-point num- 
bers and numbers such as 6.9E3, 8E12, 9.6E—4, 9E-10) 


22. All strings over {a, b} 


Strings over {a, b} not ending with ab 


23. Boolean expressions in Xj, ..., Xn 

24. Strings x; -- +X, over {a, b} with xy ++ +X) = Xp_-- +X 

Each grammar in Exercises 25-31 is proposed as generating the 
set L of strings over {a, b} that contain equal numbers of a’s and 
b’s. If the grammar generates L, prove that it does so. If the gram- 
mar does not generate L, give a counterexample and prove that 
your counterexample is correct. In each grammar, S is the starting 
symbol. 

25. S — aSb | bSa|z 

26. S— aB|bA|24,B > b| bA,A > a|aB 

27. S — aSb | bSa| SS | A 

28. S — abS | baS | aSb | bSa |x 

29. S — aB | bA,A > a| SA,B— b| SB 

30. S — aSb | bSa | abS | baS | Sab | Sha |x 
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31. S — aSbS | bSaS | 2 


«32. Let G be a grammar and let A denote the null string. Show that 
if every production is of the form 


A->aorA>aBorA—> A, 
whereA,BEN, a €T*— {A}, 


there is a regular grammar G’ with L(G) = L(G’). 


x33. Let G be the grammar of Example 12.3.9. Show that 


L(G) = {a"b"c" |n=1,2,...}. 
34. Show that the language 
{a"b"ck [nk € {1,2,...}} 
is a context-free language. 
35. Let 


N = {S, D} 

T = {d,+, -} 

P={S>D+D+D+4D, 
D> D+D—-D—-DD+D+4+D-D\|d, 
+>+4+,-—-> -}. 


Regard G = (N, T, P, S) as acontext-free Lindenmayer gram- 
mar. Interpret the symbol d as a command to draw a straight 
line of a fixed length in the current direction; interpret + as a 
command to turn right by 90°; and interpret — as a command 
to turn left by 90°. Generate the two smallest strings in L(G) 
and draw the corresponding curves. These curves are known 
as quadratic Koch islands. 


x36. The following figure 


shows the first three stages of the Hilbert curve. Define a 
context-free Lindenmayer grammar that generates strings that 
when appropriately interpreted generate the Hilbert curve. 


37. This exercise assumes familiarity with musical notation. 

Pictures such as those in Exercise 36 that consist of only 
horizontal and vertical lines can be interpreted as music. An 
arbitrary starting note is chosen. Thereafter, when we follow 
the curve, the length of a horizontal segment determines the 
duration of the note, and the length of a vertical segment tells 
how to change the pitch. Following an n-unit vertical line up 
is interpreted as changing the pitch by moving up v half-steps. 
Following an n-unit vertical line down is interpreted as chang- 
ing the pitch by moving down n half-steps. 

Write out the music corresponding to the second figure 
in Exercise 36. Assume that we start at the lower-left of the 
figure and that C is the first note. Assume also that the first 
horizontal segment is two units long, which is interpreted as a 
quarter note. 

For more on mathematics and music, see [Harkleroad]. 


12.4 


Example 12.4.1 


Theorem 12.4.2 
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Nondeterministic Finite-State Automata 


In this section and the next, we show that regular grammars and finite-state automata 
are essentially the same in that either is a specification of a regular language. We begin 
with an example that illustrates how we can convert a finite-state automaton to a regular 
grammar. 


Write the regular grammar given by the finite-state automaton of Figure 12.2.7. 


SOLUTION The terminal symbols are the input symbols {a,b}. The states E and O 
become the nonterminal symbols. The initial state E becomes the starting symbol. The 
productions correspond to the directed edges. If there is an edge labeled x from S to S’, 
we write the production S —> x5’. In our case, we obtain the productions 


E— bE, E— ao, O- aE, O— bO. (12.4.1) 


In addition, if S is an accepting state, we include the production S — 2. In our case, we 
obtain the additional production 


Ok. (12.4.2) 


Then the grammar G = (N, T, P, E), with N = {O, E}, T = {a, b}, and P consisting of 
the productions (12.4.1) and (12.4.2), generates the language L(G), which is the same 
as the set of strings accepted by the finite-state automaton of Figure 12.2.7. < 


Let A be a finite-state automaton given as a transition diagram. Let o be the initial 
state. Let T be the set of input symbols and let N be the set of states. Let P be the set 
of productions 


S > x5’ 
if there is an edge labeled x from S to S’ and 
Sy 
if S is an accepting state. Let G be the regular grammar 
G= (OM, UIP) 
Then the set of strings accepted by A is equal to L(G). 


Proof First we show that Ac(A) C L(G). Let a € Ac(A). If is the null string, 
then o is an accepting state. In this case, G contains the production o — A. The 
derivation 


o=>k (12.4.3) 


shows that a € L(G). 

Now suppose a € Ac(A) and q@ is not the null string. Then a = x,---x, for 
some x; € T. Since @ is accepted by A, there is a path (o, S1,...,S,), where S, is an 
accepting state, with edges successively labeled x, ...,x,. It follows that G contains 
the productions 


OS x18] 


Si) > x;S; fori=2,...,n. 
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Example 12.4.3 


b b 


Figure 12.4.1 The graph 
corresponding to the productions 
ao > bo,o > aC,C > bc. 


b b 
by a by b 
Figure 12.4.2 The 
nondeterministic finite-state 
automaton corresponding to the 


grammar o — bo,o — aC, 
C—> bC,C > b. 


Since S,, is an accepting state, G also contains the production S,, + 2. The derivation 


o => x5) 
=> X1X72S9 
=> X1°° *XnSn 
hoch (12.4.4) 


shows that a € L(G). 

We complete the proof by showing that L(G) C Ac(A). Suppose that a € L(G). If 
a is the null string, w must result from the derivation (12.4.3) since a derivation that starts 
with any other production would yield a nonnull string. Thus the production o — 2 is 
in the grammar. Therefore, o is an accepting state in A. It follows that a € Ac(A). 

Now suppose a € L(G) and a is not the null string. Then aw = x, ---x, for some 
x; € T. It follows that there is a derivation of the form (12.4.4). If, in the transition 
diagram, we begin at o and trace the path (a, S},..., S;,), we can generate the string a. 
The last production used in (12.4.4) is S, — A; thus the last state reached is an accepting 
state. Therefore, a is accepted by A, so L(G) C Ac(A). The proof is complete. < 


Next, we consider the reverse situation. Given a regular grammar G, we want to 
construct a finite-state automaton A so that L(G) is precisely the set of strings accepted 
by A. It might seem, at first glance, that we can simply reverse the procedure of Theo- 
rem 12.4.2. However, the next example shows that the situation is a bit more complex. 


Consider the regular grammar defined by T = {a, b} and N = {o, C}, with productions 


ao > bo, o > aC, C> dC, C> bd, 


and starting symbol o. 
The nonterminals become states with o as the initial state. For each production of 
the form S — x5’, we draw an edge from state S to state S’ and label it x. The productions 


o — bo, o > aC, andC > bC 


give the graph shown in Figure 12.4.1. The production C — b is equivalent to the two 
productions 


C > bF and F => i, 
where F is an additional nonterminal symbol. The productions 
o — bo, ao > aC, C—> bC, C—bF 


give the graph shown in Figure 12.4.2. The production F — A tells us that F should be 
an accepting state (see Figure 12.4.2). < 


Unfortunately, the graph of Figure 12.4.2 is not a finite-state automaton. There 
are several problems. Vertex C has no outgoing edge labeled a, and vertex F has no 
outgoing edges at all. Also, vertex C has two outgoing edges labeled b. A diagram such 
as that of Figure 12.4.2 defines another kind of automaton called a nondeterministic 
finite-state automaton. The reason for the word “nondeterministic” is that when we are 
in a state where there are multiple outgoing edges all having the same label x, if x is 
input the situation is nondeterministic—we have a choice of next states. For example, if 
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in Figure 12.4.2 we are in state C and 5 is input, we have a choice of next states—we 
can either remain in state C or go to state F. 


Definition 12.4.4 >» A nondeterministic finite-state automaton A consists of 


(a) A finite set Z of input symbols 


(b) A finite set S of states 


(c) A next-state function f from S x T into P(S) 
(d) A subset A of S of accepting states 


(e) An initial stateo € S. 


We write A = (Z, S, f, A, o). 


< 


The only difference between a nondeterministic finite-state automaton and a finite- 
state automaton is that in a finite-state automaton the next-state function takes us to a 
uniquely defined state, whereas in a nondeterministic finite-state automaton the next- 


state function takes us to a set of states. 


Example 12.4.5 For the nondeterministic finite-state automaton of Figure 12.4.2, we have 


T = {a, b}, 


S= {o,C, F}, 


A= {F}. 


The initial state is o and the next-state function f is given by 


ff 
£ a b 
S 
o {C} {o} 
Cc @ {C, F} 
F ia) i) 


< 


We draw the transition diagram of a nondeterministic finite-state automaton sim- 
ilarly to that of a finite-state automaton. We draw an edge from state S to each state in 
the set f(S, x) and label each x. 


Example 12.4.6 The transition diagram of the nondeterministic finite-state automaton 


T= {a, dj, 


with initial state o and next-state function 


is shown in Figure 12.4.3. 


S = {o, C, D}, A= {C,D} 
f 
Z a b 
Ss 
o {o, C} {D} 
€ @ {C} 
D {C, D} ia 
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Example 12.4.8 


Example 12.4.9 


Example 12.4.10 


a b 


a a 4 } 
©) ©) ; 


b 


Figure 12.4.3 The transition diagram of the 
nondeterministic finite-state automaton of 
Example 12.4.6. < 


A string @ is accepted by a nondeterministic finite-state automaton A if there is 
some path representing @ in the transition diagram of A beginning at the initial state and 
ending in an accepting state. The formal definition follows. 


Definition 12.4.7 >» LetA = (Z,S,f,A,c) bea nondeterministic finite-state 
automaton. The null string is accepted by A if and only ifo € A. Ifa = x,---x, isa 


nonnull string over Z and there exist states 00, . .. , , Satisfying the following conditions: 
(a) 99 =o 
(b) 0; € f(Oi-1, Xj) fori=1,...,n 
(Cc) On € A, 


we say that a is accepted by A. We let Ac(A) denote the set of strings accepted by A and 
we say that A accepts Ac(A). 

If A and A’ are nondeterministic finite-state automata and Ac(A) = Ac(A’), we 
say that A and A’ are equivalent. 

If ~@ = x, --+-X, iS a string over Z and there exist states 09, ..., 0, Satisfying con- 
ditions (a) and (b), we call the path (09, ..., 0) a path representing a in A. < 


The string a = bbabb is accepted by the nondeterministic finite-state automaton of 
Figure 12.4.2, since the path (o, 0, 0, C, C, F), which ends at an accepting state, repre- 
sents a. Notice that the path P = (o, 0, 0, C, C, C) also represents @ but that P does not 
end at an accepting state. Nevertheless, the string a is accepted because there is at least 
one path representing a that ends at an accepting state. A string f will fail to be accepted 
if no path represents 6 or every path representing 6 ends at a nonaccepting state. < 


The string @ = aabaabbb is accepted by the nondeterministic finite-state automaton of 
Figure 12.4.3. The reader should locate the path representing a, which ends at 
state C. < 


The string ~ = abba is not accepted by the nondeterministic finite-state automaton of 
Figure 12.4.3. Starting at o, when we input a, there are two choices: Go to C or remain 
at o. If we go to C, when we input two b’s, our moves are determined and we remain 
at C. But now when we input the final a, there is no edge along which to move. On the 
other hand, suppose that when we input the first a, we remain at 0. Then, when we input 
b, we move to D. But now when we input the next b, there is no edge along which to 
move. Since there is no path representing a in Figure 12.4.3, the string a is not accepted 
by the nondeterministic finite-state automaton of Figure 12.4.3. < 


We formulate the construction of Example 12.4.3 as a theorem. 


Theorem 12.4.11 


Example 12.4.12 


()" 
©--© 
Figure 12.4.4 The 
transition diagram 


corresponding to the regular 
grammar of Example 12.4.12. 
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Let G = (N,T, P, 0) be a regular grammar. Let 
c= 
S=NU{F}, where F ¢é NUT 
f(S,x) = {S' |S > x8’ € P}U{F|S>xeP} 
A= {FJ} U{S|S—>A€ P}. 


Then the nondeterministic finite-state automaton A = (Z, S,f,.A,o) accepts pre- 
cisely the strings L(G). 


Proof The proof is essentially the same as the proof of Theorem 12.4.2 and is there- 
fore omitted. < 


Consider the regular grammar G defined by T = {a, b} and N = {5S} with productions 
S—+iA, S— b, andS > aS, 


and starting symbol S. Construct the nondeterministic finite-state automaton given by 
Theorem 12.4.11 that accepts the strings L(G). 


SOLUTION The set Z of input symbols is the same as the set {a, b} of terminal symbols. 
The set S of states is the set {S, F} of nonterminal symbols together with a new state F, 
which is also an accepting state. The initial state is S, the starting symbol. 

The production S — A means that S is an accepting state. Thus the set A of 
accepting states is {S, F}. The production § — b yields a transition from state S to state 
F on input symbol b. The production § — aS yields a transition from state S to state S 
on input symbol a. We obtain the transition diagram shown in Figure 12.4.4. < 


It may seem that a nondeterministic finite-state automaton is a more general con- 
cept than a finite-state automaton; however, in the next section we will show that given a 
nondeterministic finite-state automaton A, we can construct a finite-state automaton that 
is equivalent to A. 


12.4 Review Exercises 


= 


. Given a finite-state automaton A, how can we construct a regu- 4. What are equivalent nondeterministic finite-state automata? 


lar grammar G so that the set of strings accepted by A is equal 


to the language generated by G? 


wi N 


istic finite-state automaton? 


5. Given a regular grammar G, how can we construct a nondeter- 
ministic finite-state automaton A so that the language generated 


. What is a nondeterministic finite-state automaton? by G is equal to the set of strings accepted by A? 


. What does it mean for a string to be accepted by a nondetermin- 


12.4 Exercises 


In Exercises 1-5, draw the transition diagram of the nondetermin- 
istic finite-state automaton (Z, S,f, A, 0). 


1. T= {a, b}, S = {o0, 01, 02}, A = {00} 


L a b 
S 
00 @ {o1, 02} 
O1 {o2} {o0, o1} 
02 {oo} © 
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2. T= {a, b}, S = {o0, 01, 02, 03}, A = {oy} 


we a b 
S 


00 @ {o3} 
oO! {o1, o2} {o3} 
02 © {00, 01, 03} 
03 @ D 


3. T= {a,b}, S = {o0, 01, 02}, A = {o0, o1} 


8 a b 
S 
00 {oi} {o0, 02} 
coal @ {o2} 
02 {oi} @ 


4. T= {a,b,c}, S = {o0, 01, 02}, A = {a0} 


we a b c 
S 
90 {or} 2) @ 
onl {oo} {o2}  {o0, o2} 
02 {o0,01,02} {oo} {oo} 


8. 


5. T= {a, b, c}, S= {00, 01, 62, 03}, A = {00, 03} 


a a b ¢ 
S 
00 {oi} {00, 01, 03} © 
oO} {o2, 03} © @ 
o @ {00,03} — {o1, o2} 
03 12] 12) {oo} 


For each nondeterministic finite-state automaton in Exercises 
6-10, find the sets T, S, and A, the initial state, and the table defin- 
ing the next-state function. 


6. 


O90 -@ 


6 oo 


11. Write the regular grammars given by the finite-state automata 


of Exercises 4-9, Section 12.2. 


12. Is the string bbabbb accepted by the nondeterministic finite- 
state automaton of Figure 12.4.2? Prove your answer. 


13. 


oe +L 


Represent the grammars of Exercises | and 5, Section 
12.3, and Example 12.3.14 by nondeterministic finite-state 
automata. 


14. Is the string bbabab accepted by the nondeterministic finite- 


state automaton of Figure 12.4.2? Prove your answer. 


15. 


16. 


17. 


18. 


19. 


x20. 


12.5 ® Relationships Between Languages and Automata 


Is the string aaabba accepted by the nondeterminis- 
tic finite-state automaton of Figure 12.4.3? Prove your 
answer. 

Show that a string a over {a, b} is accepted by the nondeter- 
ministic finite-state automaton of Figure 12.4.2 if and only if 
a contains exactly one a and ends with b. 

Is the string aaaab accepted by the nondeterministic finite- 
state automaton of Figure 12.4.3? Prove your answer. 
Characterize the strings accepted by the nondeterministic 
finite-state automaton of Figure 12.4.3. 

Show that the strings accepted by the nondeterministic finite- 
state automaton of Exercise 10 are precisely those strings over 
{a, b} that end bab. 

Characterize the strings accepted by the nondeterministic 
finite-state automata of Exercises 1-9. 


12.5 
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Design nondeterministic finite-state automata that accept the 
strings over {a,b} having the properties specified in Exercises 
21-29. 


21. 
22. 
23. 
*24, 
25. 
26. 
*27. 
28. 
*29, 
30. 


and Automata 


Starting either abb or ba 

Containing either abb or ba 

Ending either abb or ba 

Containing bab and bb 

Starting with abb and ending with ab 

Having each b preceded and followed by an a 
Starting with ab but not ending with ab 

Not containing ba or bbb 

Not containing abba or bbb 


Write regular grammars that generate the strings of Exercises 
21-29. 


Relationships Between Languages 


In the preceding section we showed (Theorem 12.4.2) that if A is a finite-state 
automaton, there exists a regular grammar G, with L(G) = Ac(A). As a partial con- 
verse, we showed (Theorem 12.4.11) that if G is a regular grammar, there exists a non- 
deterministic finite-state automaton A with L(G) = Ac(A). In this section we show 
(Theorem 12.5.4) that if G is a regular grammar, there exists a finite-state automaton A 
with L(G) = Ac(A). This result will be deduced from Theorem 12.4.11 by showing that 
any nondeterministic finite-state automaton can be converted to an equivalent finite-state 
automaton (Theorem 12.5.3). We will first illustrate the method by an example. 


Example 12.5.1 
TT Of Figure 12.4.2. 


Find a finite-state automaton equivalent to the nondeterministic finite-state automaton 


SOLUTION The set of input symbols is unchanged. The states consist of all subsets 


@, {o}, 


{C}. 


{F}, 


{o,C}, {o,F}, {C.F}, {o,C, F} 


of the original set S = {o, C, F} of states. The initial state is {0}. The accepting states 


are all subsets 


{F}, 


{o, F}, {C.F}, {o,C, F} 


of S that contain an accepting state of the original nondeterministic finite-state 


automaton. 


An edge is drawn from X to Y and labeled x if X = @ = Y or if 


LJre.a =. 


SEX 


We obtain the finite-state automaton of Figure 12.5.1. The states 


{o, F}, 


{o,C}, {o,C,F}, (FI, 


which can never be reached, can be deleted. Thus we obtain the simplified, equivalent 
finite-state automaton of Figure 12.5.2. 
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Figure 12.5.1 A finite-state automaton equivalent to the nondeterministic 
finite-state automaton of Figure 12.4.2. 


Figure 12.5.2 A simplified version of Figure 12.5.1 (with unreachable states 
deleted). 4 


Example 12.5.2 The finite-state automaton equivalent to the nondeterministic finite-state automaton of 
——==——= Example 12.4.6 is shown in Figure 12.5.3. 


Figure 12.5.3 A finite-state automaton equivalent to the nondeterministic 
finite-state automaton of Example 12.4.6. < 


We now formally justify the method of Examples 12.5.1 and 12.5.2. 
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Theorem 12.5.3 Let A = (Z, S, f, A, 7) be a nondeterministic finite-state automaton. Let 


(a) S’ = P(S) 

(b) Z =I 

@© o =e} 

(dd) A ={XCS|XNAZQ} 


(©) #'(%% he ifX=2 
€ 5a) = 
User fi) xX Aor 


Then the finite-state automaton A’ = (Z’, S’, f’, A’, 0’) is equivalent to A. 


Proof Suppose that the string @ = x, -- - x, is accepted by A. Then there exist states 
00, -++,0n € S with 


09 =O 
oj € f(Oj-1, Xi) fori=1,...,n 
on Ee A. 
Set Yo = {oo} and 
Y¥=f(%i-,x)  fori=l,...,n. 


Since 
Y, =f (Yo, x1) =f ({00}, 1) =f (0, *1), 
it follows that 0, € Y;. Now 
02 €f(o1,%2) © LU FS.) =f’ Mi.) = Yo. 
Sey 
Again, 
03 € f (2, x3) © LJ FS, x3) =f’ (Ya, 3) = Ya. 
SEY2 
The argument may be continued (formally, we would use induction) to show that o,, € 
Y,. Since o, is an accepting state in A, Y, is an accepting state in A’. Thus, in A’, 
we have 
fom) =f ox) =" 
f' (11, x2) = Yo 


f' Yn; Xn) = Y,,. 


Therefore, a is accepted by A’. 
Now suppose that the string a = x;---X, is accepted by A’. Then there exist 
subsets Yo,..., Y, of S such that 
Yo =0' = {o} 
f' iis xi) = Yi fori=1,...,n; 


also, there exists a state 0, € Y, A. Since 


On € xp =f (Yn-1, Xn) = U TO, Xn), 


SeY,-1 
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Theorem 12.5.4 


Example 12.5.5 


Example 12.5.6 


there exists 0,_; € Y,—1; with o, € f(O,_1, X,). Similarly, since 


On-1 © Yn-1 = f’ (Yn—2, Xn—-1) = U LS, Xn-1)s 


SEY;-2 


there exists On-2 € Yn—2 with on; € f(On—2, Xn-1). Continuing, we obtain 


0; € Y; fori=0,...,n, 
with 
0; € f (O;-1, Xj) fort = 1y..005 7: 
In particular, 
00 € Yo = {0}. 
Thus o9 = o, the initial state in A. Since o,, is an accepting state in A, the string a is 
accepted by A. < 


The next theorem summarizes these results and those of the preceding section. 


A language L is regular if and only if there exists a finite-state automaton that accepts 
precisely the strings in L. 


Proof This theorem restates Theorems 12.4.2, 12.4.11, and 12.5.3. < 


Find a finite-state automaton A that accepts precisely the strings generated by the regular 
grammar G having productions 


o — bo, o > aC, C= bc, C— b. 


The starting symbol is o, the set of terminal symbols is {a, b}, and the set of nonterminal 
symbols is {o, C}. 


SOLUTION The nondeterministic finite-state automaton A’ that accepts L(G) is shown 
in Figure 12.4.2. A finite-state automaton equivalent to A’ is shown in Figure 12.5.1, 
and an equivalent simplified finite-state automaton A is shown in Figure 12.5.2. The 
finite-state automaton A accepts precisely the strings generated by G. < 


We close this section by giving some applications of the methods and theory we 
have developed. 


A Language That Is Not Regular Show that the language 
L={a"b" |n=1, 2;..+} 
is not regular. 


SOLUTION If Lis regular, there exists a finite-state automaton A such that Ac(A) = L. 
Suppose that A has k states. The string a = a‘b* is accepted by A. Consider the path P, 
which represents a. Since there are k states, some state o is revisited on the part of 
the path representing a‘. Thus there is a cycle C, all of whose edges are labeled a, that 
contains o. We change the path P to obtain a path P’ as follows. When we arrive at o in 
P, we follow C. After returning to o on C, we continue on P to the end. If the length of 
C is j, the path P’ represents the string a’ = a**b*. Since P and P’ end at the same state 
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o’ and o’ is an accepting state, aw’ is accepted by A. This is a contradiction, since a’ is 
not of the form a”b". Therefore, L is not regular. < 


Example 12.5.7 Let L be the set of strings accepted by the finite-state automaton A of Figure 12.5.4. 
TC Construct a finite-state automaton that accepts the strings 


r= {Xp Xy | Xp +++ Xn € Lh. 


Figure 12.5.4 The finite-state automaton for 
Example 12.5.7 that accepts L. 


SOLUTION We want to convert A to a finite-state automaton that accepts L¥. The string 
Q@ = X1---X, is accepted by A if there is a path P in A representing @ that starts at oj 
and ends at 03. If we start at o3 and trace P in reverse, we end at o, and process the 
edges in the order x, ...,.x;. Thus we need only reverse all arrows in Figure 12.5.4 and 
make o3 the starting state and o, the accepting state (see Figure 12.5.5). The result is a 
nondeterministic finite-state automaton that accepts L*. 

After finding an equivalent finite-state automaton and eliminating the unreachable 
states, we obtain the equivalent finite-state automaton of Figure 12.5.6. 


a : b a 
: 
b 
a b 
Figure 12.5.5 A nondeterministic finite-state automaton Figure 12.5.6 A finite-state automaton that 
that accepts L¥. accepts L*. 


< 


Example 12.5.8 Let L be the set of strings accepted by the finite-state automaton A of Figure 12.5.7. 
 _— Construct a nondeterministic finite-state automaton that accepts the strings 


R 
L = {Xp -- x1 | X1p-++Xy EL}. 


SOLUTION If A had only one accepting state, we could use the procedure of Exam- 
ple 12.5.7 to construct the desired nondeterministic finite-state automaton. Thus we first 
construct a nondeterministic finite-state automaton equivalent to A with one accepting 
state. To do this we introduce an additional state o;. Then we arrange for paths termi- 
nating at 03 or o4 to optionally terminate at os (see Figure 12.5.8). The desired non- 
deterministic finite-state automaton is obtained from Figure 12.5.8 by the method of 
Example 12.5.7 (see Figure 12.5.9). Of course, if desired, we could construct an equiv- 
alent finite-state automaton. 
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Figure 12.5.7 The finite-state automaton for 
Example 12.5.8 that accepts L. 


Figure 12.5.8 A nondeterministic Figure 12.5.9 A nondeterministic 
finite-state automaton with one accepting finite-state automaton that accepts L*. 
state equivalent to the finite-state 

automaton of Figure 12.5.7. 


< 


12.5 Review Exercises 


1. Given a nondeterministic finite-state automaton, how can we 2. Give a condition in terms of finite-state automata for a language 
construct an equivalent deterministic finite-state automaton? to be regular. 


12.5 Exercises 


x, Find the finite-state automata equivalent to the nondetermin 5: 28% via weS's|aeds|b=Cs/ 6 
istic finite-state automata of Exercises 1-10, Section 12.4. 
<A>::=b<A>|a<C> 
In Exercises 2-6, find finite-state automata that accept the strings 2he te aehela 
generated by the regular grammars. 
<C> ::=a<B>|a<C> 


2. Grammar of Exercise 1, Section 12.3 


3. Grammar of Exercise 5, Section 12.3 with starting symbol <S> 


4, 
6. <S>::=a<A>|a<B> 

<S>::=a<A>|a<B> 

<A> ::=b<S>|b 
<A> ::=a<B>|b<S>|b 

<B> ::=a<B>|a<C> 
<B> ::=b<S>|b 

<C> ::=a<S>|b<A>|a<C>|a 


with starting symbol <S> with starting symbol <S> 


vis 


«10. 


Find finite-state automata that accept the strings of Exer- 
cises 21-29, Section 12.4. 


. By eliminating unreachable states from the finite-state 


automaton of Figure 12.5.3, find a simpler, equivalent finite- 
state automaton. 


. Show that the nondeterministic finite-state automaton of 


Figure 12.5.5 accepts a string a over {a, b} if and only if a 
begins bb. 

Characterize the strings accepted by the nondeterministic 
finite-state automata of Figures 12.5.7 and 12.5.9. 


In Exercises 11-21, find a nondeterministic finite-state automaton 
that accepts the given set of strings. If S, and S2 are sets of strings, 


we let 
sf = {uju2-+-uy, | uj € Sj,n € {1,2,...}}; 
S,So = {uv | u € Sj, v € So}. 
11. Ac (A), where A is the automaton of Exercise 4, Section 12.2 


12. 
13. 
14. 
15. 
16. 
17. 
18. 


Ac (A)® , where A is the automaton of Exercise 5, Section 12.2 
Ac (A)® , Where A is the automaton of Exercise 6, Section 12.2 
Ac(A)*, where A is the automaton of Exercise 4, Section 12.2 
Ac(A)*, where A is the automaton of Exercise 5, Section 12.2 
Ac(A)*, where A is the automaton of Exercise 6, Section 12.2 
Ac(A)*, where A is the automaton of Figure 12.5.7 


Ac(A;)Ac(A2), where A; is the automaton of Exercise 
4, Section 12.2, and Az is the automaton of Exercise 6, 
Section 12.2 


19. 


20. 


21. 


22. 


23. 


24. 


*25, 


26. 


*27, 


«28. 
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Ac(A;)Ac(A2), where A; is the automaton of Exercise 5, 
Section 12.2, and A» is the automaton of Exercise 6, 
Section 12.2 
Ac(A;)Ac(A;), where A; is the automaton of Exercise 5, 
Section 12.2 
Ac(A,)Ac(A2), where A, is the automaton of Figure 12.5.7 
and A> is the automaton of Exercise 6, Section 12.2 
Find a regular grammar that generates the language L*, where 
L is the language generated by the grammar of Exercise 5, 
Section 12.3. 
Find a regular grammar that generates the language L*, where 
L is the language generated by the grammar of Exercise 5, 
Section 12.3. 
Let L; (respectively, L2) be the language generated by 
the grammar of Exercise 5, Section 12.3 (respectively, 
Example 12.5.5). Find a regular grammar that generates the 
language L,L. 
Show that the set 

L= {x1 --+Xn [Xp +++ Xn =Xn-+-X1} 
of strings over {a, b} is not a regular language. 
Show that if Z; and L» are regular languages over Z and S is the 
set of all strings over Z, then each of S — L;, L; UL2, L} NL2, 
EF, and L)Ly is a regular language. 
Show, by example, that there are context-free languages L; 
and Ly such that L; M L2 is not context-free. 
Prove or disprove: If L is a regular language, so is 


{u" |weL,ne {1,2,...}}. 


Chapter 12 Notes 


General references on automata, grammars, and languages are [Carroll; Cohen; Davis; 
Hopcroft; Kelley; McNaughton; Sudkamp; and Wood]. 

The systematic development of fractal geometry was begun by Benoit B. Mandelbrot 
(see [Mandelbrot, 1977, 1982]). 

A finite-state machine has a primitive internal memory in the sense that it remem- 
bers which state it is in. By permitting an external memory on which the machine can read 
and write data, we can define more powerful machines. Other enhancements are achieved 
by allowing the machine to scan the input string in either direction and by allowing the 
machine to alter the input string. It is then possible to characterize the classes of machines 
that accept context-free languages, context-sensitive languages, and languages generated by 


phrase-structure grammars. 


Turing machines form a particularly important class of machines. Like a finite-state 
machine, a Turing machine is always in a particular state. The input string to a Turing 
machine is assumed to reside on a tape that is infinite in both directions. A Turing machine 
scans one character at a time and after scanning a character, the machine either halts or does 
some, none, or all of the following: alter the character; move one position left or right; change 
states. In particular, the input string can be changed. A Turing machine T accepts a string a 
if, when q@ is input to 7, T halts in an accepting state. It can be shown that a language L is gen- 
erated by a phrase-structure grammar if and only if there is a Turing machine that accepts L. 

The real importance of Turing machines results from the widely held belief that any 
function that can be computed by some, perhaps hypothetical, digital computer can be com- 
puted by some Turing machine. This last assertion is known as Turing’s hypothesis or 
Church’s thesis. Church’s thesis implies that a Turing machine is the correct abstract model 
of a digital computer. These ideas also yield the following formal definition of algorithm. An 
algorithm is a Turing machine that, given an input string, eventually stops. 
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Chapter 12 Review 


Section 12.1 


. Unit time delay 

. Serial adder 

. Finite-state machine 
. Input symbol 

. Output symbol 
State 

. Next-state function 
. Output function 

. Initial state 

. Transition diagram 
. Input and output strings for a finite-state machine 
. SR flip-flop 


Section 12.2 


13. Finite-state automaton 

14. Accepting state 

15. String accepted by a finite-state automaton 
16. Equivalent finite-state automata 


CHIDAM EWN 


ek et 
a 


Section 12.3 


17. Natural language 

18. Formal language 

19. Phrase-structure grammar 
20. Nonterminal symbol 

21. Terminal symbol 

22. Production 

23. Starting symbol 

24. Directly derivable string 
25. Derivable string 

26. Derivation 


27. 
28. 
29. 
30. 
31. 
32. 
33. 
34. 
35. 
36. 
37. 


Language generated by a grammar 

Backus normal form (= Backus—Naur form = BNF) 
Context-sensitive grammar (= type | grammar) 
Context-free grammar (= type 2 grammar) 
Regular grammar (= type 3 grammar) 
Context-sensitive language 

Context-free language 

Regular language 

Context-free interactive Lindenmayer grammar 
von Koch snowflake 

Fractal curves 


Section 12.4 


38. 


39. 


40. 


41. 
42. 


Given a finite-state automaton A, how to construct a regu- 
lar grammar G, such that the set of strings accepted by A is 
equal to the language generated by G (see Theorem 12.4.2) 
Nondeterministic finite-state automaton 

String accepted by a nondeterministic finite-state automa- 
ton 

Equivalent nondeterministic finite-state automata 

Given a regular grammar G, how to construct a nonde- 
terministic finite-state automaton A such that the language 
generated by G is equal to the set of strings accepted by A 
(see Theorem 12.4.11) 


Section 12.5 


43. 


44. 


Given a nondeterministic finite-state automaton, how to 
construct an equivalent deterministic finite-state automaton 
(see Theorem 12.5.3) 

A language L is regular if and only if there exists a finite- 
state automaton that accepts the strings in L. 


Chapter 12 Self-Test 


1. Draw the transition diagram of the finite-state machine 
Z, 0,5, f,8, 00), where ZT = {a,b}, O = {0,1}, and 
S= {00,01}. 


f g 
iL a b a b 
S 
00 oO! oO 0 1 
oO! 0 oO! 1 


2. For the finite-state machine of Exercise 1, find the output 
string for the input string bbaa. 

3. Draw the transition diagram of the finite-state automaton 
(Z, S,f, A, S), where Z = {0,1}, S = {S,A, B}, and 
A= {A}. 


4. 


5: 


Ul 
Ti) 0% J 
S 
Ss AS 
A SB 
B AS 


Is the string 11010 accepted by the finite-state automaton 
of Exercise 3? 


Is the grammar 
S—aSb, S—Ab, A—-aA, A>b, A>A 


context-sensitive, context-free, regular, or none of these? 
Give all characterizations that apply. 


10. 


11. 


12. 


13. 


. Show that the string w = aaaabbbb is in the language gen- 


erated by the grammar of Exercise 5 by giving a derivation 
of a. 


. Characterize the language generated by the grammar of 


Exercise 5. 


Draw the transition diagram of the nondeterministic finite- 
state automaton (Z,S,f,A,oo), where J = {a,b}, S = 
{o0, 01, 02}, and A = {05}. 


Tt a b 
S 
00 {oo} {o2} 
con {o0, 01} @ 
02 {o2} {o0, 01} 


. Find a finite-state automaton equivalent to the nondetermin- 


istic finite-state automaton of Exercise 8. 


Find the sets Z, O, and S, the initial state, and the table 
defining the next-state and output functions for the follow- 
ing finite-state machine. 


all 
bit b/0 
1 
all a 
Design a finite-state machine whose input is a bit string that 


outputs 0 when it sees 001 and thereafter; otherwise, it out- 
puts 1. 


Draw the transition diagram of a finite-state automaton that 
accepts the set of strings over {0, 1} that contain an even 
number of 0’s and an odd number of 1’s. 


Characterize the set of strings accepted by the following 
finite-state automaton. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 
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0 0 
@L ie} —@) 
1 i! 
Write a grammar that generates all nonnull strings over 
{0, 1} having an equal number of 0’s and 1’s. 


Find the sets Z, S, and A, the initial state, and the table defin- 
ing the next-state function for the nondeterministic finite- 
state automaton shown below. 


Is the string aabaaba accepted by the nondeterministic 
finite-state automaton of Exercise 15? Explain. 


Design a nondeterministic finite-state automaton that 
accepts all strings over {0, 1} that begin 01 and contain 110. 


Find a finite-state automaton equivalent to the nondetermin- 
istic finite-state automaton of Exercise 15. 


Explain how to construct a nondeterministic finite-state 
automaton that accepts the language 


LiLy = {aB |a ely, B € Ly}, 


given finite-state automata that accept regular languages L; 
and Ly. 


Prove that any regular language that does not contain the 
null string is accepted by a nondeterministic finite-state 
automaton with exactly one accepting state. Give an exam- 
ple to show that this statement is false for arbitrary regular 
languages (i.e., if we allow the null string as a member of 
the regular language). 


Chapter 12 Computer Exercises 


. Write a program that simulates an arbitrary finite-state 


machine. The program should initially receive as input the 
next-state function, the output function, and the initial state. 
The program should then accept strings, simulate the action 
of the finite-state machine, and output the string produced by 
the finite-state machine. 


. Write a program that simulates an arbitrary finite-state 


automaton. The program should initially receive as input the 
next-state function, the set of accepting states, and the ini- 
tial state. The program should then accept strings, simulate 


the action of the finite-state automaton, and print messages 
indicating whether the strings are accepted. 


. Write a program to draw a fractal given the context- 


free interactive Lindenmayer grammar that generates it 
(see Section 12.3). 


. Report on the Knuth—Morris—Pratt algorithm (see [Johnson- 


baugh]) that determines whether a string contains a partic- 
ular substring. This algorithm makes use of a finite-state 
automaton. 
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Appendix A 


Example A.2 


Example A.4 


MATRICES 


It is a common practice to organize data into rows and columns. In mathematics, such 
an array of data is called a matrix. In this appendix we summarize some definitions and 
elementary properties of matrices. We begin with the definition of “matrix.” 


Definition A.1 >» A matrix 


a1 G2 «++ Gn 
a2) 422, «++ An 

A= : . . (A.1) 
Amt Gm2— +++ Amn 


is a rectangular array of data. 
If A has m rows and n columns, we say that the size of A is m by n (written m x n). 
< 


We will often abbreviate equation (A.1) to A = (aj). In this equation, a; denotes 
the element of A appearing in the ith row and jth column. 


21 0 
ae 6 2 


has two rows and three columns, so its size is 2 x 3. If we write A = (aj), we would 
have, for example, 


The matrix 


ay, = 2, ay, =—l, a3 = 0. < 


Definition A.3 > Two matrices A and B are equal, written A = B, if they are the 
same size and their corresponding entries are equal. < 


Determine w, x, y, and z so that 
x+y y _{5 2 
w+zw-z} \4 6)° 


SOLUTION According to Definition A.3, since the matrices are the same size, they 
will be equal provided that 


x+ty=5 y=2 
w+z=4 w—z=6. 605 
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Example A.6 


Example A.8 


Solving these equations, we obtain 


y= 2, z==1, < 


We describe next some operations that can be performed on matrices. The sum 
of two matrices is obtained by adding the corresponding entries. The scalar product is 
obtained by multiplying each entry in the matrix by a fixed number. 


Definition A.5 > Let A = (a;) and B = (b,) be two m x n matrices. The sum 
of A and B is defined as 


A + B= (aij + bj). 


The scalar product of a number c and a matrix A = (aj) is defined as 


cA = (cajj). 
If A and B are matrices, we define —A = (—1)A andA —B=A + (—B). < 
If 
4 2 1 -3 
A=|{-l Os B= 4 4], 
—2 —-l -3 
then 
5 -l 8 4 -1 3 
A+B= | 3 4}, 2A = | —2 0}, —-B=}|-4 -4 
5 —-5 12 -4 1 3 


Multiplication of matrices is another important matrix operation. 


Definition A.7 >» Let A = (a,) be an m x n matrix and let B = (bj) be ann x | 
matrix. The matrix product of A and B is defined as the m x | matrix 


AB = (cix), 


where 


n 
Cik = . Dix. 
j=l 


< 


To multiply the matrix A by the matrix B, Definition A.7 requires that the number 
of columns of A be equal to the number of rows of B. 


Let 
1 6 
12 -1 
A={4 2], a= ( ). 
3 1 4 7 0 


The matrix product AB is defined since the number of columns of A is the same as the 
number of rows of B; both are equal to 2. Entry c,, in the product AB is obtained by using 
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the ith row of A and the Ath column of B. For example, the entry c3; will be computed 
using the third row 


3 1) 


of A and the first column 


of B. We then multiply, consecutively, each element in the third row of A by each element 
in the first column of B and then sum to obtain 


3-14+1-4=7. 


Since the number of columns of A is the same as the number of rows of B, the elements 
pair up correctly. Proceeding in this way, we obtain the product 


25 44 -1 
AB=|{12 22 —-4 
7 13. =3 < 


Example A.9 The matrix product 


a b x is ax + by 

cd y cx + dy }~ < 
Definition A.10 > Let A be ann x n matrix. If m is a positive integer, the mth 
power of A is defined as the matrix product 


Example A.11__ If 


a oe ae 7 -15\_ ( 199 —435 
ws =#0 = (44 ) ee 22) ~\-290 634)" < 


Exercises 


1. Compute the sum 2. A+B 3. B+A 
2 4 1 abe 4. -A 5. 3A 
6 9 3)4+!1def). 6. —2B 7. 2B+A 
1 -1 6 ghi 8. B—6A 
In Exercises 2-8, let In Exercises 9-13, compute the products. 
_(1 6 9 _f 47 2 1 x3 a8 
a=() 4 3) a=(4 6 ') pat 3) Pat a 
01 4 6 0 


and compute each expression. 
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1 6 
10. {-8 2 G “) 
41 


Appendix A ® Matrices 


Show that if A is ann x n matrix (such a matrix is called 
a square matrix), then 


Al, =A =I,A. 
11. A’, where A = te ‘ 
6 2 Ann x n matrix A is said to be invertible if there exists ann x n 
1 matrix B satisfying 
3 
2. (2 -4 6 1 3)| 2 eyes 
: (The matrix I, is defined in Exercise 17.) 
2 41 ab 18. Show that the matrix 
13 9. 3 c d 2 1 
1 -1 6 e f ‘| 


(a) Give the size of each matrix. 


19. 
1 4 6 14 7 
A= 017)? B={8 2 144 
0 1 6 
4 2 
cC=|{0 0 
2 9 20. 
(b) Using the matrices of part (a), decide which of the 
products 
A’, AB, BA, AC, CA, AB’, 
BC, CB, C 


are defined and then compute these products. 


15. Determine x, y, and z so that the equation 


x+y 3xt+y _ fol 1 
x+z xty—2z)] — 9 -17 


holds. 
16. Determine w, x, y, and z so that the equation 
x 2x 
2 £ <1 9 y -yt+z |_ (45 46 21. 
6 8 O 3 xtw w—2yt+x] 3 87 
Zz Zz 


holds. 
17. Define the n x n matrix J, = (aj) by 
ifi=j 


1 
a= 16 ifi Xj. 


The matrix /,, is called the n x n identity matrix. 


is invertible. 


Show that the matrix 


(° a) 


is invertible if and only if ad — bc £ 0. 


Suppose that we want to solve the system 


AX =C, 
where 
om & -) 
az, a22 
x=(7 
y 
C2 
for x and y. 


Show that if A is invertible, the system has a solution. 


The transpose of a matrix A = (aj) is the matrix Al = (ai). 
where a = aj. Example: 


(1) =G 8): 


If A and B are m x k and k x n matrices, respectively, show 
that 


(AB)! = BTA’. 


Appendix B 


ALGEBRA REVIEW 


In this appendix, we review basic algebra: rules for combining and simplifying expres- 
sions; fractions; exponents; factoring; quadratic equations; inequalities; and logarithms. 
For a more extensive treatment of basic algebra, see [Bleau; Lial; Sullivan]. 


Grouping 
Terms with a common symbol can be combined: 
ac+be=(a+b)c, ac — be = (a— b)c. 


Technically, these equations are known as distributive laws. 


Example B.1 = 2x + 3x = (24+ 3)x = 5x < 


The distributive laws, rewritten as 
a(b+c) =ab+ac, a(b—c) =ab— ac, 


can be used to simplify expressions. 


Example B.2) 2(¢4+ 1) =2x+2-1=2x+42 < 
Example B.3) 2(x4+ 1) +2(¢—1) = 2x +2+4+2x-—2=4% < 
Fractions 
Formulas useful for adding, subtracting, and multiplying fractions are given as 
Theorem B.4. 


Theorem B.4 Combining Fractions 


a b a+b 
@ -+-= 

Cie G c 

a b a-—b 
(bs) <--= 

Chae G c 
Open ad + be 
(e = — 

Cc a cd 


609 


610 Appendix B ® Algebra Review 


‘Example B.5. 
‘Example B.6 
Example B.7 
‘Example B.8 


Example B.9 


Example B.10 


a b_  ad—be 
@) --== 

© @l cd 

a b_ ab 
Ore a wicd 


Using Theorem B.4(a), we obtain 


x-1l x41 (x I)+@4+1)_ 2x 


{| = =x. 
Z 3 2 7 ” 
Using Theorem B.4(b), we obtain 
1 + 1 1 + 1 2 
x x _@=0=G4)) = = P 


Using Theorem B.4(c), we obtain 


x-1 x+1 _ 3(x—- 1) + 24+ 1) _sx-l 


2 ' 3 2-3 6 . 
Using Theorem B.4(d), we obtain 
x-—1 x+1 3@-)-2@4+)  x-5 < 
2 307 23 ~ 6° 
Using Theorem B.4(e), we obtain 
oe 4 
x yxy 


Exponents 


If n is a positive integer and a is a real number, we define a” as 


If a is a nonzero real number, we define a? = 1. If n is a negative integer and a is a 
nonzero real number, we define a” as 


1 


aq 


dq’ = 


If a is a real number, at = a-a-a-a. Asa specific example, 2+ = 2-2-2-2 = 16. Ifa 


is a nonzero real number, 


As a specific example, 
24a =, < 


If a is a positive real number and n is a positive integer, we define a'/” to be the 
positive number b satisfying b” = a. We call b the nth root of a. 


Example B.11 


Example B.12 


Theorem B.13 


Example B.14 


Example B.15 


Example B.16 


AppendixB @® AlgebraReview 611 


3'/4 to nine significant digits is 1.316074013 because (1.316074013)* is approxi- 
mately 3. < 


If a is a positive real number, m is an integer, and n is a positive integer, we define 
qin = (ally, 


The preceding equation defines a‘ for all positive real numbers a and rational numbers 
q. (Recall that a rational number is a number that is the quotient of integers.) 


Since 3'/* to nine significant digits is 1.316074013, 
3°/4 = (1.316074013)? = 11.84466612. 


The decimal values are approximations. < 


If a is a positive real number, the definition of a‘ can be extended to include all 
real numbers x (rational or irrational). The following theorem lists five important laws 
of exponents. 


Laws of Exponents 


Let a and b be positive real numbers, and let x and y be real numbers. Then 


(a) d = aa’ 


(b) (@’)’ =a” 


a 


(c) em 


(d) a‘b* = (ab)* 


0 fa (6): 


Let a = 3, x = 2, and y = 4. Then a* = 9, a’ = 81, and a**” = 37+4 = 729, Now 
a’? = 729 =9-81 =a'a’, 


which illustrates Theorem B.13(a). < 


Let a = 3, x = 2, and y = 4. Thena’ = 9 anda® = 38 = 6561. Now 
(a) = 94 = 6561 =a”, 


which illustrates Theorem B.13(b). < 


Let a = 3, x = 2, and y = 4. Thena’ = 9, a” = 81, anda** = 37 = 1/9. Now 


which illustrates Theorem B.13(c). < 
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Example B.17 


Example B.18 


Example B.19 


Example B.20 


Example B.21 


Example B.22 


Let a = 3, b = 4, and x = 2. Then a* = 9, b* = 16, and (ab)* = 12? = 144. Now 
ab’ =9-16= 144 = (ab)’, 


which illustrates Theorem B.13(d). < 


Let a = 3,b = 4, and x = 2. Then a‘ = 9, b* = 16, and 


(3)'=(3) =% 


Now 

a‘ _ 9 _ ()" 

bk 16. (\bs? 
which illustrates Theorem B.13(e). < 
OX — err = 9x _ (27)* =4 < 
Factoring 


We may use the equation 


(x+b)(x+d) =x + (b+d)x + bd 


to factor an expression of the form e+exteo. 


Factor x7 + 3x + 2. 


SOLUTION We look for integer constants in the factorization. According to the previ- 
ous equation, x° + 3x + 2 factors as (x + b)(x + d), where b+ d = 3 and bd = 2. If 
bd = 2 and b and d are integers, the only choices for b and d are 1, 2 and —1, —2. We 
find that b = 1 and d = 2 satisfy both b + d = 3 and bd = 2. Thus 


VP 4+3x4+2= (4+ D042). < 


Special cases of 


(x+b)(x+d) =x + (b+d)x + bd 


(x+b)? =x +2bx+ 0° 
(x — b)? =x? —2bx + h* 
(x + b)(x — b) =x? —D’. 
Using the equation (x + b)? = x? + 2bx + b”, we have 


(x +9)? =x? + 18x4 81. cl 


Factor x? — 36. 
SOLUTION Since 36 = 62, we have 


x — 36 = (x + 6)(x— 6). < 


Example B.23 


Example B.24 
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We may use the equation 
(ax + b)(cx + d) = (ac)x + (ad + bc)x + bd 


to factor an expression of the form cox? + yx +c. 


Factor 6x2 — x — 2. 


SOLUTION We look for integer constants in the factorization. Using the preceding 
notation, we must have 


ac=6, ad+bc=-—l, bd = —2. 
Since ac = 6, the possibilities for a and c are 
1,6. 2:3. (=]1,=6: =2)<3: 
Since bd = —2, the only possibilities for b and d are 1, —2 and —1, 2. Since we must 
also have ad + bc = —1, we find that a = 2,b = 1,c = 3, andd = —2 provide a 


solution. Therefore, the factorization is 


6x7 —x —2 = (2x+ I)GBx— 2). < 
Show that 


+ (n4 


ise (nt 1(n+2)]? 
= 5 


SOLUTION We show how the left side of the equation can be rewritten as the right side 
of the equation. By Theorem B.13(d) and (e), we have 


nw(n+1)* 


nint+1)]? 
( ) +(n+1p= +(n+1). 
2 4 
Since (n + 1)? is a common factor of the right side of this equation, we may write 
n(n + 1)? n 
mer sity! =(n+1) Fara) ; 
Since 
nr m+4n+4  (n+2)? 
— 1 = = , 
A +(n+1) 4 4 


it follows that 


(n+ 2) 
4 


_ [er nereyy 


2 
(n+1)2 [Ftary) =(n +1)? 
4 2 


Solving a Quadratic Equation 
A quadratic equation is an equation of the form 


ar +bx+c=0,a40. 


A solution is a value for x that satisfies the equation. 
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Example B.25 


Example B.26 


Theorem B.27 


Example B.28 


The value x = —3 is a solution of the quadratic equation 
2x? + 2x-12=0 
because 
2(-3)? + 2(—3) — 12 = 2-9-6 — 12 = 18-18 =0. < 
If a quadratic equation can be easily factored, its solutions may be readily obtained. 
Solve the quadratic equation 


3x7 — 10x + 8 =0. 


SOLUTION We may factor 3x7 — 10x + 8 as 


3x° — 10x + 8 = (x — 2)(3x — 4). 


For this expression to be equal to zero, either x—2 or 3x—4 must equal zero. If x—2 = 0, 
we must have x = 2. If 3x — 4 = 0, we must have x = 4/3. Thus the solutions of the 
given quadratic equation are 


4 
x=2 and aaa < 


The solutions of a quadratic equation can always be obtained from the quadratic 


formula. 


Quadratic Formula 
The solutions of 


ax? +bx+c=0,a4¢0, 


are 
—b+ Jb? — 4ac 
2S ———— 

2a 


The quadratic formula gives the solutions of x* — x — 1 = 0 as 


(-1) + J(-1)* —4-1-( 1) 14V14+4_ 145 
Zeal ~ ~ , 


Thus the solutions are 


Inequalities 


If a is less than b, we write a < b. If ais less than or equal to b, we write a < b. Ifa 
is greater than b, we write a > b. If a is greater than or equal to b, we write a > b. 


Example B.29 


Theorem B.30 


Example B.31 


Example B.32 


Example B.33 
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Suppose that a = 2, b = 8, c = 2. We have 


a<b,b>a,a 


<b,b>a,a<c,a>c. 


Important laws of inequalities are given as Theorem B.30. 


Laws of Inequalities 


(a) If a < band c is any number whatsoever, thena+c<b+c. 
(b) Ifa < band c is any number whatsoever, thena+c <b+c. 
(c) Ifa > band c is any number whatsoever, thena+c>b+c. 
(d) If a > b and c is any number whatsoever, then a +c >b+c. 
(e) Ifa < bandc > 0, then ac < be. 
(f) Ifa < bandc > 0, then ac < be. 
(g) Ifa < bandc < 0, then ac > be. 
(h) Ifa < bandc < 0, then ac > be. 
(i) Ifa > bandc > 0, then ac > be. 
G) Ifa > bandc > 0, then ac > be. 
(k) Ifa > bandc < 0, then ac < be. 
(l) Ifa > band c < 0, then ac < be. 


(m) If a < band b <c, thena < c. 
(n) Ifa < band b <c, thena <c. 
(o) Ifa < band b <c, thena <c. 
(p) Ifa < bandb <c, thena <c. 
(q) Ifa>bandb>c,thena>c. 
(r) Ifa>bandb>c,thena > c. 
(s) Ifa>bandb>c,thena>c. 
(t) Ifa >bandb>c,thena>c. 


Solve the inequality x — 5 < 6. 
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SOLUTION By Theorem B.30(a), we may add 5 to both sides of the inequality to obtain 


the solution 


Solve the inequality 


Xe 1h, 


3x+4<x+4+ 10. 


< 


SOLUTION By Theorem B.30(a), we may add —.x to both sides of the inequality to 
obtain 2x + 4 < 10. Again, by Theorem B.30(a), we may add —4 to both sides of the 
inequality to obtain 2x < 6. Finally, we may use Theorem B.30(e) to multiply both sides 


of the inequality by 1/2 and obtain the 


Show that ifn > 2m and m > 2p, then 


solution x < 3. 


n> Ap. 


< 
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Example B.34 


Example B.35 


Example B.36 


Theorem B.37 


SOLUTION We may use Theorem B.30(i) to multiply both sides of m > 2p by 2 to 
obtain 2m > 4p. Since n > 2m, we may use Theorem B.30(q) to obtain n > 4p. < 


Show that 


n+2 4n+1)* 
ow ee, 
n+l (2n+1)? 


for every positive integer n. 


SOLUTION Since (n + 1)(2n + 1)? is positive, by Theorem B.30(e), 


4(n + 1)? 


GA ee ie <a OR A — 
n+ (n+) 


which can be rewritten as 
(2Qn+1)?(n+2) < (n+ D4(n+ 1’. 
Expanding each side of the inequality, we obtain 
4n? + 12n? + 9n+2 < 4n? + 12n? + 12n+-4. 


By Theorem B.30(a), we may add —4n? — 12n? —9n — 2 to both sides of the inequality to 
obtain 0 < 3n+2. This last inequality is true for all positive integers n because the right 
side is always at least 5. Since the steps are reversible (i.e., beginning with 0 < 3n + 2 
we can obtain the original inequality using Theorem B.30), we have proved the given 
inequality. < 


Logarithms 


Throughout this subsection, b is a positive real number not equal to 1. If x is a positive 
real number, the logarithm to the base b of x is the exponent to which b must be raised 
to obtain x. We denote the logarithm to the base b of x as log,,x. Thus if we let y = log, x, 
the definition states that b” = x. 


We have log,8 = 3 because 2? = 8. < 


Given 27" = n, where n is a positive integer, solve for x. 
SOLUTION Let lg denote the logarithm to the base 2. Then from the definition of 
logarithm, 2* = Ign. Again, from the definition of logarithm, x = lg(1g 7). < 


The following theorem lists important laws of logarithms. 


Laws of Logarithms 
Suppose that b > 0 and b ~ 1. Then 


(a) plosn* = 


(b) log, (xy) = log,x + log,y 
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eG 
(c) log, (=) = log,x — log,y 


(d) log, (x) = ylog,x 


1 
(e) Ifa > Oanda ¥ 1, we have log, x = nets 


log, a 
(f) Ifb > landx > y > 0, then log, x > log, y. 
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Theorem B.37(e) is known as the change-of-base formula for logarithms. If we 
know how to compute logarithms to the base b, we can perform the computation on the 
right side of the equation to obtain the logarithm to the base a. Theorem B.37(f) says 


that if b > 1, log, (x) is an increasing function. 


Example B.38 Let b = 2 and x = 8. Then log,x = 3. Now 


Pera Sk =x, 


which illustrates Theorem B.37(a). 


< 


Example B.39 Letb=2,x = 8, andy = 16. Then log,x = 3, log, y = 4, and log, (xy) = log, 128 = 7. 


Now 


log, (xy) = 7 =34+4 = log,x + log,y, 


which illustrates Theorem B.37(b). 


Example B.40 Let b=2,x = 8, and y = 16. Then log,x = 3, log, y = 4, and 


x 1 
log, {- ] = logy 5 =I, 
y 


Now 


log, (=) = —1 = log,x — log,y, 
y 


which illustrates Theorem B.37(c). 


Example B.41_ Let b = 2, x = 4, and y = 3. Then log,x = 2 and 


log, (x”) = log,64 = 6. 


Now 


log, (@’) = 6 = 3-2 = ylog,x, 


which illustrates Theorem B.37(d). 


< 


Example B.42 Suppose that we have a calculator with a logarithm key that computes logarithms to the 
——_—_—_—_ base 10 but does not have a key that computes logarithms to the base 2. Use Theorem 


B.37(e) to compute log,40. 
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Example B.43 


SOLUTION Using our calculator, we compute 
log))40 = 1.602060, logj 2 = 0.301030. 


Theorem B.37(e) now gives 


log;)40 1.602060 


= = 5.321928. < 
log;)2 0.301030 


log,40 = 


Show that if k and n are positive integers satisfying 

Wl ene ae, 
then 

k-—1<lIgn<k, 
where lg denotes the logarithm to the base 2. 
SOLUTION By Theorem B.37(f), the logarithm function is increasing. Therefore, 
1g(2'!) < Ign < Ig(2"). 
By Theorem B.37(d), 
1g(2*!) = (k — 1) 1g2. 
Since lg 2 = log,2 = 1, we have 
1g!) = (k-1)lg2=k-1. 


Similarly, 1g(2") = k. The given inequality now follows. < 


Exercises 


In Exercises 1-3, simplify the given expression by combining like 11. (—3)-4 12. 1!° 13. 1000° 
terms. 14. Which expressions are equal? 
1. 8x— 12x (a) 34310 (b) (3*)!° (c) 314 
2. 8x +3a—4y—9a (d) 43103 (e) 23203 (f) 3% 
3. 6(a+b) — 8(a— b) (g) 21877 
In Exercises 4-6, combine the given fractions. 15. Show that 5" + 4-5" = 5+! for every positive integer n. 
i 8x — 4b n Tx +b 5. 8x—4b  Ix+b In Exercises 16-24, expand the given expression. 
8x 2 4b 7x i : 16. («+ 3)(x+ 5) 17. (x —3)(x +4) 
6. <3 18. (2x + 3)(3x — 4) 19. «+4? 
7. Show that 20. (x — 4)? 21. (x44)? 
: ! : 22. (x — 2)(x + 2) 23. (x + a)(x — a) 


no n+l nn+l) 


Use this fact to show that 
va 
i@i+ 1) 


i=1 


Find the value of each expression in Exercises 8-13 without using 


a calculator. 


8. 34 9. 3-4 


24. (2x — 3)(2x + 3) 


ik In Exercises 25-36, factor the given expression. 


~ Atl 25. 2 +6x+5 26. x2 —3x—10 
27. x +6x49 28. x2 — 8x+16 
29. x° — 81 30. x? — 4b? 
31. 2x2 + 11x+5 32. 6x7 +x—15 


10. (—3)4 


34. 4x2 -9 
36. 12x2 — 50x +50 


33. 4x? — 12x +9 
35. 9a? — 4b? 
37. Show that 


(n+ 1!+4+)Dm+ ll! =(n+2)! 
for every positive integer n. 
38. Show that 
n(n + 1)(2n + 1) 
6 
for every positive integer n. 
39. Show that 


(n+ 1)(n + 2)(2n + 3) 
6 


t(n+1)? = 


n ; 1 n+1 
Int1 9 (Qn+1)Qn+3) 2n+3 
for every positive integer n. 


40. Show that 
713 nl —4 5st!) = 103 22-2 —4 aoe) = 3.2" 4.5" 


for every positive integer n. 
41. Simplify 2r(n — Ir"! — °?(n — 2)r*-?. 
In Exercises 42-44, solve the quadratic equation. 
42. °° —6x+8=0 
43. 6x7 -7x+2=0 
44, 2x? —4x+1=0 


In Exercises 45-47, solve the given inequality. 


45. 2x+3<9 


46. 2x-—8>3x4+1 

47. x—3 - 4x+3 

6 2 

48. Show that S77 i <n’. 
49. Show that 


d+and+x>14+(@+lx 


for any x anda > 0. 
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50. Show that 


for every integer n > 2. 
51. Show that 
2n+1 2 
PRT Or a ETO) 
(n+ 2)n (n+ 1) 
for every positive integer n. 
52. Show that 6n? < 6n* + 4n + 1 for every positive integer n. 
53. Show that 6n? + 4n + 1 < 11n? for every positive integer n. 


Find the value of each expression in Exercises 54-58 without using 
a calculator (1g means log>). 


54. 1g 64 55. Ig ig 
56. 1g2 57, 2Ig10 
58. 1g 21000 


Given that lg3 = 1.584962501 and lg 5 = 2.321928095, find the 
value of each expression in Exercises 59-63 (lg means log>). 

59. 1g6 60. 1g 30 

61. 1g 59049 62. 1g0.6 

63. 1g 0.0375 

Use a calculator with a logarithm key to find the value of each ex- 
pression in Exercises 64-67. 
64. logs47 

66. logy8.888! 


65. log70.30881 
67. logy 9 (log;g 1054) 


In Exercises 68-70, use a calculator with a logarithm key to solve 
for x. 


68. 5*=11 
69. 526" = 811 
70. 5'% = 19100 


71. Show that x!" = yloss*, 


Appendix C 


Example C.1 


Example C.2 
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PSEUDOCODE 


In this appendix, we describe the pseudocode used in this book. 

We let = denote the assignment operator. In pseudocode, x = y means “copy 
the value of y into x” or, equivalently, “replace the current value of x by the value of y.” 
When x = y is executed, the value of y is unchanged. 


Suppose that the value of x is 5 and the value of y is 10. After 
x=y 


the value of x is 10 and the value of y, which is unchanged, is also 10. < 


In the if statement 


if (condition) 
action 


if condition is true, action is executed and control passes to the statement following 
action. If condition is false, action is not executed and control passes immediately to the 
statement following action. As shown, we use indentation to identify the statements that 
make up action. 


Suppose that the value of x is 5, the value of y is 10, and the value of z is 15. Consider 
the segment 


if (y > x) 
a 
ye 
Since y > x is true, 
L=x% 
executes, and the value of z is set to 5. Next 
yz 


executes and the value of y is set to 5. Now each of the variables x, y, and z has 
value 5. < 


We show reserved words (e.g., if) in the regular typeface and user-chosen words 
(e.g., variables such as x) in italics. 


Example C.3 


Example C.4 
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We use the usual arithmetic operators +, —, * (for multiplication), and / as well 
as the relational operators == (equals), — = (not equal), <, >, <, and > and the logical 
operators A (and), V (or), and — (not). We will use == to denote the equality operator 
and = to denote the assignment operator. We will sometimes use less formal statements 
when to do otherwise would obscure the meaning. (Example: Choose an element x in S.) 


Suppose that the value of x is 5, the value of y is 10, and the value of z is 15. Consider 
the segment 


if (y == x) 


Since y == x is false, 
2=x 

does not execute. Next 
yHzZz 


executes, and the value of y is set to 15. Now the value of x is 5, and each of the variables 
y and z has value 15. < 


In an if statement, if action consists of multiple statements, we enclose them in 
braces. An example of a multiple-statement action in an if statement is 


if (x = 0) { 
x=x+1 
a=b+c 


} 


An alternative form of the if statement is the if else statement. In the if else state- 
ment 


if (condition) 
action] 
else 
action2 


if condition is true, action! (but not action2) is executed and control passes to the state- 
ment following action2. If condition is false, action2 (but not action!) is executed and 
control passes to the statement following action2. If action! or action2 consists of mul- 
tiple statements, they are enclosed in braces. 


Suppose that the value of x is 5, the value of y is 10, and the value of z is 15. Consider 
the segment 


if ya =x) 
y=x 
else 
Z=x 
a=z 


Since y > = x is true, 


y=x 
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Example C.5 


Example C.6 


executes and y is set to 5. The statement 
Z=xX 

does not execute. Next 
a=Z 


executes and a is set to 15. Now the value of each of x and y is equal to 5, and the value 
of each of a and z is equal to 15. < 


Suppose that the value of x is 5, the value of y is 10, and the value of z is 15. Consider 
the segment 


if (vy < x) 
yx 
else 
<=x 
a=zZ 
Since y < x is false, 
yx 
does not execute. Rather, 
£=x 
executes and z is set to 5. Next 


a=Z 


executes and a is set to 5. Now the value of each of x, z, and a is equal to 5, and the value 
of y is equal to 10. < 


Two slash marks // signal the beginning of a comment, which then extends to the 
end of the line. Comments help the reader understand the code but are not executed. 


In the segment 
if (x > 0) { // if x is nonnegative, update x and a 
x=x+1 
a=b+c 
} 


the part 
// if x is nonnegative, update x and a 


is acomment. The segment executes as if it were written 


if (x = 0) { 
x=x+1 
a=b+c 
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The while loop is written 


while (condition) 
action 


If condition is true, action is executed and this sequence is repeated; that is, if condition 
is true, action is executed again. This sequence is repeated until condition becomes false. 
Then control passes immediately to the statement following action. If action consists of 
multiple statements, we enclose them in braces. 


Example C.7 Let 51,..., 5, be a sequence. After the segment 


large = sy 
i=2 
while (i < n) { 
if (s; > large) 
large = sj 
i=it+l 
} 


executes, /arge is equal to the largest element in the sequence. The idea is to step through 
the sequence and save the largest value seen so far in large. < 


In Example C.7 we stepped through a sequence by using the variable i that took 
on the integer values 2 through n. This kind of loop is so common that a special loop, 
called the for loop, is often used instead of the while loop. The form of the for 
loop is 


for var = init to limit 
action 


As in the previous if statement and while loop, if action consists of multiple statements, 
we enclose them in braces. When the for loop is executed, action is executed for values 
of var from init to limit. More precisely, init and limit are expressions that have integer 
values. The variable var is first set to the value init. If var < limit, we execute action and 
then add | to var. The process is then repeated. Repetition continues until var > limit. 
Notice that if init > limit, action will not be executed at all. 


Example C.8 — The segment of Example C.7 may be rewritten using a for loop as 


large = 81 
fori=2ton 
if (s; > large) 
large = 8; < 


A function is a unit of code that can receive input, perform computations, and 
produce output. The parameters describe the data, variables, and so on that are input to 
and output from the function. The syntax is 


function name(parameters separated by commas) { 
code for performing computations 
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Example C.9 The following function, named max/, finds the largest of the numbers a, b, and c. The 

—_—_—_= parameters a, b, and c are input parameters (i.e., they are assigned values before the 
function executes), and the parameter x is an output parameter (1.e., the function will 
assign x a value—namely, the largest of the numbers a, b, and c). 


maxI (a, b, c, x) { 
x=a 
if (b > x) // if b is larger than x, update x 
x=b 
if (c > x) //if c is larger than x, update x 
L=c 


The return statement 
return x 


terminates a function and returns the value of x to the invoker of the function. The return 
statement 


return 


(without the x) simply terminates a function. If there is no return statement, the function 
terminates just before the closing brace. 


Example C.10 The function of Example C.9 can be rewritten using the return statement as 


max2(a, b,c) { 
x=a 
if (b > x) // if b is larger than x, update x 
x=b 
if (c > x) //if c is larger than x, update x 
xc 
return x 


} 


Rather than using a parameter for the largest of a, b, and c, max2 returns the largest 
value. < 


We use the functions print and printin for output. The function println adds a 
newline after printing its argument(s) (which causes the next output to occur flush left 
on the next line); otherwise, the functions are the same. The operator + concatenates 
strings. Strings are delimited by double quotation marks (i.e., “and’’). If exactly one of 
+’s operands is a string, the other argument is converted to a string, after which the 
concatenation occurs. The concatenation operator is useful in the print functions. 


Example C.11 The segment 


fori=lton 
println(s;) 


prints the values in the sequence s),..., S,, one per line. < 


1. 


Example C.12 The segment 


fori=lton 
print(s; + “”) 
printin() 


prints the values in the sequence 51, .. 
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. , Sn, Separated by space, on one line. The sequence 


values are followed by a newline. < 


Exercises 


Show how the segment in Example C.7 finds the largest ele- 
ment in the sequence 


s.=2, 2 =3, 93=8, se =6. 


. Show how the segment in Example C.7 finds the largest ele- 


ment in the sequence 


5, =8, so =8, 93 =4, seal. 


. Show how the segment in Example C.7 finds the largest ele- 


ment in the sequence 


sHl, ss 1, ss=1, smd. 


. Show how the function max] of Example C.9 finds the largest 


of the numbers a = 4, b = —3, andc = 5. 


. Show how the function max] of Example C.9 finds the largest 


of the numbers a = b = 4 and c = 2. 


. Show how the function max] of Example C.9 finds the largest 


of the numbersa = b=c=8. 


7. Write a function that returns the minimum of a and b. 


13. 


14. 


. Write a function that returns the maximum of a and b. 


. Write a function that swaps the values a and b. (Here a and b 


will be both input and output parameters.) 


. Write a function that prints all odd numbers between | and n, 


inclusive. 


. Write a function that prints all negative numbers, one per line, 


that occur in the sequence of numbers 51, ..., Sy. 


. Write a function that prints all indexes of values in the se- 


quence s;,..., 8, that are equal to the value val. 


Write a function that returns the product of the sequence of 
numbers 51, 52,..., Sp. 

Write a function that prints every other value in the sequence 
51, 82,.+-, 5p (L€., 51, 53, 55, .-.) one value per line. 
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HINTS/SOLUTIONS TO Selected Exercises 


Section 1.1 Review 


1. A set is a collection of objects. 
2. A set may be defined by listing the elements in it. For exam- 
ple, {1, 2, 3, 4} is the set consisting of the integers 1, 2, 3, 4. 
A set may also be defined by listing a property necessary for 
membership. For example, 
{x | x is a positive, real number} 
defines the set consisting of the positive, real numbers. 
3. Set Description Examples of Members 
Z Integers —3,2 
Q Rational numbers —3/4, 2.13074 
R Real numbers —2.13074, /2 
Zt Positive integers 2,10 
Qr Positive rational numbers 3/4, 2.13074 
RT Positive real numbers 2.13074, J2 
Zz Negative integers —12, -10 
Q- Negative rational numbers —3/8, —2.13074 
R- Negative real numbers —2.13074, =./2 
zhonnes Nonnegative integers 0,3 
Q’”’? nneg Nonnegative rational 0, 3.13074 
numbers 
R!0NN€k — Nonnegative real numbers 0, /3 
4. The cardinality of X (i.e., the number of elements in X) 
5. x EX 6.x ¢X 7. D 


8. Sets X and Y are equal if they have the same elements. Set 


12. 


13. 


equality is denoted X = Y. 


. Prove that for every x, if x is in X, then x is in Y, and if x is in 


Y, then x is in X. 


. Prove one of the following: (a) There exists x such that x € X 


and x ¢ Y. (b) There exists x such that x ¢ X andx € Y. 


. X is a subset of Y if every element of X is an element of Y. X 


is a subset of Y is denoted X C Y. 


To prove that X is a subset of Y, let x be an arbitrary element 
of X and prove that x is in Y. 


Find x such that x is in X, but x is not in Y. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21. 
22. 


23. 


24. 


25. 


26. 


27. 


X is a proper subset of Y if X C Y and X # Y. X is a proper 
subset of Y is denoted X C Y. 


To prove that X is a proper subset of Y, prove that X is a subset 
of Y and find x in Y such that x is not in X. 


The power set of X is the collection of all subsets of X. It is 
denoted P(X). 


X union Y is the set of elements that belong to either X or Y or 
both. It is denoted X U Y. 


The union of S is the set of elements that belong to at least one 
set in S. It is denoted US. 


X intersect Y is the set of elements that belong to both X and 
Y. It is denoted XN Y. 


The intersection of S is the set of elements that belong to every 
set in S. Itis denoted NS. 

XNY=f6 

A collection of sets S is pairwise disjoint if, whenever X and 
Y are distinct sets in S, X and Y are disjoint. 


The difference of X and Y is the set of elements that are in X 
but not in Y. It is denoted X — Y. 


A universal set is a set that contains all of the sets under dis- 
cussion. 


The complement of X is U — X, where U is a given universal 
set. The complement of X is denoted X. 


A Venn diagram provides a pictorial view of sets. In a Venn 
diagram, a rectangle depicts a universal set, and subsets of the 
universal set are drawn as circles. The inside of a circle repre- 
sents the members of that set. 


a 
SEN 
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28. 
29. 
30. 


31. 
32. 
33. 
34. 
35. 
36. 
38. 
39. 


40. 


41. 


Hints and Solutions to Selected Exercises 


Region 1| represents elements in none of X, Y, or Z. Region 2 
represents elements in X, but in neither Y nor Z. Region 3 rep- 
resents elements in X and Y, but not in Z. Region 4 represents 
elements in Y, but in neither X nor Z. Region 5 represents ele- 
ments in X, Y, and Z. Region 6 represents elements in X and Z, 
but not in Y. Region 7 represents elements in Y and Z, but not 
in X. Region 8 represents elements in Z, but in neither X nor Y. 


(AUB)UC=AU (BUC), (ANB)NC=AN(BNC) 
AUB=BUA, ANB=BNA 


AN(BUC)=(ANB)UANC)AU(BNOC= 
(AUB)N AUC) 


AUS=A,ANU=A 
AUA=U,ANA=2 

AUA=A, ANA=A 

AUU=U, ANZ=B 

AU(ANB)=A, AN(AUB)=A 

A-A 37. @=U, U=B 
(AUB) =ANB, (ANB) =AUB 


A collection S of nonempty subsets of X is a partition of X if 
every element in X belongs to exactly one member of S. 


The Cartesian product of X and Y is the set of all ordered pairs 
(x, y) where x € X andy € Y. It is denoted X x Y. 


The Cartesian product of X, X2,...,X, is the set of all 
n-tuples (x1, %2,...,Xn) where x; € X; fori = 1,...,n. It 
is denoted X; x Xp x +--+ x Xp. 


Section 1.1 


36. 


. {1, 2,3, 4,5, 7, 10} 4. {2,3,5} 
~ om 10. U 

- {6, 8} 

. {1, 2,3, 4,5, 7, 10} 


. Y is the set of odd, positive integers. 

. {2,4} 

. {ne Zt |n=2orn=4orn> 6} 

. {2n+1|ne Zt andn > 3} 

. O 31. 5 

. If x € A, then x is one of 3,2, 1. Thus x € B. If x € B, then x 


is one of 1, 2,3. Thus x € A. Therefore, A = B. 


. If x € A, then x satisfies x2 — 4x +4 = 1. Factoring co 


4x + 4, we find that (x — 2)? = 1. Thus (x — 2) = +1. If 
(x — 2) = 1, then x = 3. If (xv — 2) = —1, then x = 1. Since 
x=3o0rx=1,x € B. Therefore A C B. 

If x € B, then x = 1 orx = 3. If x = 1, then 


v—4444=07-4.144=1 
and thus x € A. If x = 3, then 
vv —4444537-4.344=1 


and again x € A. Therefore B C A. We conclude that A = B. 
Since 1 € A, but! ¢B,A FB. 


39. 
40. 
44. 


47. 


48. 
51. 
52. 


55. 
58. 


60. 


61. 
68. 
71. 


72. 
75. 
76. 
78. 


81. 


83. 
86. 


87. 


Note that A= BNC = {2, 4}. Since 1 € B, but 1 ¢A,A FB. 
43. Not equal 


Let x € A. Then x = 1 or x = 2. In either case, x € B. 
Therefore A C B. 


First note that B = Z*. Now let x € A. Then x = 2n for some 
né Zt. Since 2 € Z*,2n € Z* = B. Therefore A C B. 


Since 3 € A, but 3 ¢ B, A is not a subset of B. 
Since 3 € A, but 3 ¢ B, A is not a subset of B. 


Equal 


A B e 


Same as Exercise 52 


Cc 


The shaded area represents the beverage, which has great taste 
and is less filling. 


10 64. 64 66. 4 
{C1 a), (, b), C1, ¢), (2, 4), (2, b), (2, ¢)} 

{(a, a), (a, b), (a,c), (b, a), (b, b), (b, ©), (c, a), 
(c, b), (c, )} 

{(1, a, a), (1, a, B), (2,4, @), (2, a, B)} 

{(a, 1, a, @), (a, 2, a, a), (a, 1, a, B), (a, 2, a, B)} 
The entire xy-plane 


Parallel horizontal lines spaced one unit apart. There is a 
lowest line [passing through (0, 0)] but the lines continue 
indefinitely above the lowest line. 


Parallel planes stacked one above another one unit apart. The 
planes continue indefinitely in both directions [above and 
below the origin (0, 0, 0)]. 

{{1}} 

{{a, b,c, dh}, {{a, b,c}, {ah}, 

{{a, b, d}, {c}}, {{a, c, dh, {b}}, {{B, c, dh, {ah}, 

{{a, b}, {c}, {a}, {{a, c}, {b}, (ahh, 

{{a, d}, {b}, {ch}, 

{{b, c}, {a}, {d}}, {{d, a}, {a}, {ch}, (fe, a}, fa}, {d}}, 
{{a, b}, {c, d}}, {{a, c}, {b, ah}, (fa, d}, {b, ch}, 

{{a}, {D}, fc}, {ah} 


True 90. True 


93. 
96. 
100. 
104. 


©, {a}, {b}, {a, b}. All but {a, b} are proper subsets. 


2" —-1 97. ACB 
BCA 101. {1, 4, 5} 
The center of the circle 


Section 1.2 Review 


1. 


5. 


7. 


A proposition is a sentence that is either true or false, but not 
both. 


. The truth table of a proposition P made up of the individual 


propositions p1, ..., Py lists all possible combinations of truth 
values for p1,..., Pn, T denoting true and F denoting false, 
and for each such combination lists the truth value of P. 


. The conjunction of propositions p and q is the proposition p 


and q. It is denoted p A q. 


P 4 DA 
| ee! T 
T F F 
F T FE 
FF FE 


The disjunction of propositions p and q is the proposition p or 
q. It is denoted p v q. 


PV 


TMmaH|v 
AHoamH)|s 
mMHAdl|< 


The negation of proposition p is the proposition not p. It is 
denoted =p. 


| 
am 


Section 1.2 


1. 
5. 
8. 
11. 


13. 


16. 


17. 


Is a proposition. Negation: 2+ 5 4 19 
Not a proposition; it is a question. 
Not a proposition; it is a command. 


Not a proposition; it is a description of a mathematical expres- 
sion (i.e., p — g, where p and q are primes). 


Ten heads were not obtained. (Alternative: At least one tail 
was obtained.) 


No heads were obtained. (Alternative: Ten tails were ob- 
tained.) 


True 
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20. True 
23. 
P 4 PA~4q 
T- -T F 
T F T 
F T F 
F F F 
26. 
Po4 (pPAQ A74 
T T F 
T F F 
F T F 
F F F 
29. 
P 4 | PYDACPYDARY=9A CPV 79) 
T T F 
T F F 
F T F 
F F F 
31. p Aq; false 
34. Lee does not take computer science. 


37. 
40. 
43. 


45. 
48. 


50. 
56. 
61. 
68. 


71. 


74. 


75. 


Lee takes computer science or Lee does not take mathematics. 
You play football and you miss the midterm exam. 


It is not the case that you play football or you miss the midterm 
exam, or you pass the course. 


Today is Monday or it is raining. 


(Today is Monday and it is raining) and it is not the case that 
(it is hot or today is Monday). 

=p 53. =p A -=q 
pA~4q 
pAr 


59. =p A =r A 7q 

64. (pV gq) An7r 

Inclusive-or: To enter Utopia, you must show a driver’s license 
or a passport or both. Exclusive-or: To enter Utopia, you must 


show a driver’s license or a passport but not both. Exclusive-or 
is the intended meaning. 


Inclusive-or: The car comes with a cupholder that heats or 
cools your drink or both. Exclusive-or: The car comes with 
a cupholder that heats or cools your drink but not both. 
Exclusive-or is the intended meaning. 


Inclusive-or: The meeting will be canceled if fewer than 10 
persons sign up or at least 3 inches of snow falls or both. 
Exclusive-or: The meeting will be canceled if fewer than 10 
persons sign up or at least 3 inches of snow falls but not both. 
Inclusive-or is the intended meaning. 


No, assuming the interpretation: It shall be unlawful for any 
person to keep more than three [3] dogs and more than three 
[3] cats upon his property within the city. A judge ruled that 
the ordinance was “vague.” Presumably, the intended meaning 
was: “It shall be unlawful for any person to keep more than 


636 


76. 
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three [3] dogs or more than three [3] cats upon his property 
within the city.” 


“national park” “north dakota” OR “south dakota” 


Section 1.3 Review 


1. 


If p and q are propositions, the conditional proposition is the 
proposition if p then q. It is denoted p > q. 


P 4 pq 
Tr Tf T 
T F F 
F T T 
F F T 


3. In the conditional proposition p — q, p is the hypothesis. 


4. In the conditional proposition p — gq, q is the conclusion. 


5. In the conditional proposition p—q, q is a necessary 


10. 


11. 
12. 


condition. 


. In the conditional proposition p—q, p is a sufficient 


condition. 


. The converse of p > q is q > p. 


. If p and q are propositions, the biconditional proposition is 


the proposition p if and only if g. It is denoted p <= q. 


P 4 pq 
TY FT T 
T &F F 
F T F 
FF T 


If the propositions P and Q are made up of the propositions 
Pi.-+-,Pn, P and Q are logically equivalent provided that 
given any truth values of p1,..., Pn, either P and Q are both 
true or P and Q are both false. 


PY Q) =p AG, “(PA 4) = pv 4 
The contrapositive of p > q is =q > =p. 


Section 1.3 


1. 


14. 
23. 


If Joey studies hard, then he will pass the discrete mathematics 
exam. 


. If Katrina passes discrete mathematics, then she will take the 


algorithms course. 


. If you inspect the aircraft, then you have the proper security 


clearance. 


. If the program is readable, then it is well structured. 


. (For Exercise 1) If Joey passes the discrete mathematics exam, 


then he studied hard. 
True 17. False 20. False 
True 26. True 29. True 


32. 
39. 
46. 
47. 
50. 
53. 
56. 


59. 


62. 


63. 
69. 
73. 
76. 


77. 


80. 


True 33. True 36. False 
True 42. True 43. p> q 
qo (DAT) 

p>4 

Go PAr) 

If today is Monday, then it is raining. 


It is not the case that today is Monday or it is raining if and 
only if it is hot. 

Let p: 4 < 6andq:9 > 12. 

Given statement: p — q; false. 

Converse: q > p; if 9 > 12, then 4 < 6; true. 
Contrapositive: ~qg + —p; if 9 < 12, then 4 > 6; false. 
Let p: |4| < 3 andqg: —3 <4 <3. 

Given statement: g — p; true. 


Converse: p — q; if |4| < 3, then —3 < 4 < 3; true. 


Contrapositive: 

=p > 74, if |4| > 3, then —3 > 4 or 4 > 3; true. 
P#4Q 66. PHO 

P#Q 72. PHO 


Pat will not use the treadmill and will not lift weights. 


To make chili, you do not need red pepper or you do not need 
onions. 


P 4 pimplq qimplp 
ToT T T 
T F F F 
FT F F 
F F T T 


Since pimplq is true precisely when qgimp/p is true, 
pimpl q = qimp!p. 


P q P>q “PVG 
tT “f T 7 
Tt. FE F 
F T Tr T 
F F Hd T 


Since p— q is true precisely when —p V q is true, p > q = 
PY 4. 


Section 1.4 Review 


1. 


. In the argument p1,p2,... 


. In the argument pj, p2,... 
. The argument pj, p2,... 


Deductive reasoning refers to the process of drawing a con- 
clusion from a sequence of propositions. 


:Pn/.’. q, the hypotheses are 
P1;P2;+++5Pn- 


. “Premise” in another name for hypothesis. 


: Pn/.’. q, the conclusion is q. 


»Pn/.’. q is valid provided that if p; 
and p2 and... and p,, are all true, then g must also be true. 


6. An invalid argument is an argument that is not valid. 
7. pq 8 pod 
P 74 
.q op 
9. p 10. pAq 
“PV “?P 
ll. p 12. p> q 
qd q—~>r 
“PA spr 
13. pvq 
—?P 
.q 
Section 1.4 
1. Valid Pq 4. Invalid (PVT) > 4 
P q 
Pan’ j p> 5 
6. Valid (> S)AQ> 1) 
PY 
SVS 
9. Valid p > (rvs) 
q->r 
=s 
P 
JrVG 


10. 


13. 


15. 


18. 


20. 
26. 


27. 
30. 


Valid. If 4 megabytes is better than no memory at all, then 
we will buy a new computer. If 4 megabytes is better than no 
memory at all, then we will buy more memory. Therefore, if 
4 megabytes is better than no memory at all, then we will buy 
a new computer and we will buy more memory. 

Invalid. If we will not buy anew computer, then 4 megabytes is 
not better than no memory at all. We will buy a new computer. 
Therefore, 4 megabytes is better than no memory at all. 
Valid. If the for loop is faulty or the while loop is faulty, then 
the hardware is unreliable or the output is correct. The for loop 
is faulty. The hardware is not unreliable. Therefore the output 
is correct. 

Valid. If the for loop is faulty, then the while loop is faulty. If 
the while loop is faulty, then the hardware is unreliable. The 
hardware is not unreliable. If the output is correct, then the 
hardware is unreliable. Therefore the output is not correct. 
Invalid 23. Invalid 

An analysis of the argument must take into account the fact 
that “nothing” is being used in two very different ways. 
Addition 

Let p denote the proposition “there is gas in the car,” let ¢ 
denote the proposition “I go to the store,” and let r denote the 
proposition “I get a soda.” Then the hypotheses are as follows: 


Pq 
q->r 
P 


From p — q and q — r, we may use the hypothetical syl- 
logism to conclude p > r. From p > r and p, we may use 
modus ponens to conclude r. Since r represents the propo- 


33. 


36. 
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sition “I get a soda,” we conclude that the conclusion does 
follow from the hypotheses. 


We construct a truth table for all the propositions involved: 


P q p>@ q ‘P 
T ah T F F 
YT F F T FE 
F i T F i 
FE F T T T 


We observe that whenever the hypotheses p — q and 7g are 
true, the conclusion —p is also true; therefore, the argument is 
valid. 


We construct a truth table for all the propositions involved: 


P q PA 
T T T 
T F F 
F r iE 
FF F F 


We observe that whenever the hypotheses p and gq are true, 
the conclusion p A q is also true; therefore, the argument is 
valid. 


Section 1.5 Review 


1 


10. 


If P(x) is a statement involving the variable x, we call P a 
propositional function if for each x in the domain of discourse, 
P(x) is a proposition. 


. A domain of discourse for a propositional function P is a set 


D such that P(x) is defined for every x in D. 


. A universally quantified statement is a statement of the form 


for all x in the domain of discourse, P(x). 


. A counterexample to the statement Vx P(x) is a value of x for 


which P(x) is false. 


. Anexistentially quantified statement is a statement of the form 


for some x in the domain of discourse, P(x). 


. (Vx P(x)) and Ax—P(x) have the same truth values. 


—=(Ax P(x)) and Vx —P(x) have the same truth values. 


. To prove that the universally quantified statement Vx P(x) is 


true, show that for every x in the domain of discourse, the 
proposition P(x) is true. 


. To prove that the existentially quantified statement 4x P(x) is 


true, find one value of x in the domain of discourse for which 
the proposition P(x) is true. 


. To prove that the universally quantified statement Vx P(x) is 


false, find one value of x in the domain of discourse for which 
the proposition P(x) is false. 


To prove that the existentially quantified statement 4x P(x) is 
false, show that for every x in the domain of discourse, the 
proposition P(x) is false. 
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11. 


13. 


Hints and Solutions to Selected Exercises 


VxP(x) 12. P(d) for everyd € D 
~,P(d)ifdeD Vx P(x) 
dxP(x) 14. P(d) for some d € D 


.. P(d) for some d € D J. dx P(x) 


Section 1.5 


1. 


64. 


67. 


70. 


. True 
« P(1) A P(2) A P(3) A P(A) 
. P11) Vv P(2) v P(3) v P(A) 
« P(2) A P(3) A P(A) 


. Every student is taking a math course. 


Is a propositional function. The domain of discourse could be 
taken to be all integers. 


. Is a propositional function. The domain of discourse is the set 


of all movies. 


. 11 divides 77. True. 
. For every positive integer n, n divides 77. False. 


. There exists n such that n does not divide 77. True. 


19. False 22. False 


. Some student is not taking a math course. 


. (For Exercise 32) 4x —P(x). Some student is not taking a math 


course. 


. Every professional athlete plays soccer. False. 


. Either someone does not play soccer or some soccer player is 


a professional athlete. True. 


. Everyone is a professional athlete and plays soccer. False. 


. (For Exercise 39) 4x(P(x) A =Q(x)). Someone is a profes- 


sional athlete and does not play soccer. 


» Vx(P(x) > Q(x)) 
+ dx(P(x) A Q(x)) 
. (For Exercise 48) 4x(P(x) A —Q(x)). Some accountant does 


not own a Porsche. 


. False. A counterexample is x = 0. 
. True. The value x = 2 makes (x > 1) > (x? > x) true. 
. (For Exercise 53) 4x(x? < x). There exists x such that x2 < x. 


. The literal meaning is: No man cheats on his wife. The in- 


tended meaning is: Some man does not cheat on his wife. Let 
P(x) denote the statement “x is a man,” and Q(x) denote the 
statement “x cheats on his wife.” Symbolically, the clarified 
statement is dx(P(x) A =Q(a)). 


The literal meaning is: No environmental problem is a tragedy. 
The intended meaning is: Some environmental problem is not 
a tragedy. Let P(x) denote the statement “x is an environmen- 
tal problem,” and Q(x) denote the statement ‘‘x is a tragedy.” 
Symbolically, the clarified statement is 3x(P(x) A =Q(x)). 


The literal meaning is: Everything is not sweetness and light. 
The intended meaning is: Not everything is sweetness and 
light. Let P(x) denote the statement “‘x is sweetness and light.” 
Symbolically, the clarified statement is 3x —P(x). 


The literal meaning is: No circumstance is right for a formal 
investigation. The intended meaning is: Some circumstance is 


72. 


75. 
76. 


79. 


not right for a formal investigation. Let P(x) denote the state- 
ment “‘x is a circumstance,” and Q(x) denote the statement “x 
is right for a formal investigation.” Symbolically, the clarified 
statement is dx(P(x) A =Q(a)). 


(a) 
P @ p> qd GP 
T T T T 
T F F T 
F T r EF 
FF T Al 


One of p > q or q > pis true since in each row, one of 
the last two entries is true. 


(b) The statement, “All integers are positive or all positive 
numbers are integers,” which is false, in symbols is 


(Wx((x) > P(x))) V (Wx(P(x) > I(x))). 
This is not the same as the given statement 
Vx((U(x) > P(x) Vv (P(x) > I@))), 


which is true. The ambiguity results from attempting to 
distribute V across the or. 


Universal instantiation 


Let P(x) denote the propositional function “x has a graphing 
calculator,” and let Q(x) denote the propositional function “x 
understands the trigonometric functions.” The hypotheses are 
Vx P(x) and Vx(P(x) > Q(x)). By universal instantiation, we 
have P(Ralphie) and P(Ralphie) — Q(Ralphie). The modus 
ponens rule of inference now gives Q(Ralphie), which repre- 
sents the proposition “Ralphie understands the trigonometric 
functions.” We conclude that the conclusion does follow from 
the hypotheses. 


By definition, the proposition Vx P(x) is true when P(x) is 
true for all x in the domain of discourse. We are given that 
P(d) is true for any d in the domain of discourse D. Therefore, 
Vx P(x) is true. 


Section 1.6 Review 


1. 


For every x and for every y, P(x, y). Let the domain of dis- 
course be X x Y. The statement is true if, for every x € X and 
for every y € Y, P(x, y) is true. The statement is false if there 
is at least one x € X and at least one y € Y such that P(x, y) is 
false. 


. For every x, there exists y such that P(x, y). Let the domain of 


discourse be X x Y. The statement is true if, for every x € X, 
there is at least one y € Y for which P(x, y) is true. The state- 
ment is false if there is at least one x € X such that P(x, y) is 
false for every y € Y. 


. There exists x such that for every y, P(x, y). Let the domain of 


discourse be X x Y. The statement is true if there is at least one 
x € X such that P(x, y) is true for every y € Y. The statement 
is false if, for every x € X, there is at least one y € Y such that 
P(x, y) is false. 


Cenrnntn 


. There exists x and there exists y such that P(x, y). Let the do- 


main of discourse be X x Y. The statement is true if there is 
at least one x € X and at least one y € Y such that P(x, y) is 
true. The statement is false if, for every x € X and for every 
y €Y, P(x, y) is false. 


. Let P(x, y) be the propositional function “x < y” with domain 


of discourse Z x Z. Then VxdyP(x, y) is true since, for every 
integer x, there exists an integer y (e.g., y = x) such thatx < y 
is true. On the other hand, 4xVyP(x, y) is false. For every inte- 
ger x, there exists an integer y (e.g., y = x— 1) such thatx < y 
is false. 


. dxvdy-P(x, y) 
» dxVy7P(x, y) 
» WxdyP(x, y) 
» WxVyaP(x, y) 


. Given a quantified propositional function, you and your op- 


ponent, whom we call Farley, play a logic game. Your goal 
is to try to make the propositional function true, and Far- 
ley’s goal is to try to make it false. The game begins with 
the first (left) quantifier. If the quantifier is V, Farley chooses 
a value for that variable; if the quantifier is 3, you choose a 
value for that variable. The game continues with the second 
quantifier. After values are chosen for all the variables, if the 
propositional function is true, you win; if it is false, Farley 
wins. If you can always win regardless of how Farley chooses 
values for the variables, the quantified propositional function 
is true, but if Farley can choose values for the variables so 
that you cannot win, the quantified propositional function is 
false. 


Section 1.6 


1. Everyone is taller than everyone. 


4. Someone is taller than someone. 


5. (For Exercise 1) In symbols: 4x4y—7} (x, y). In words: Some- 


12. 


15. 


18. 


one is not taller than someone. 


. (Exercise 1) False. Garth is not taller than Garth. (Exercise 2) 


False. Erin is taller than no one. (Exercise 3) False. No one is 
taller than everyone (himself, in particular). (Exercise 4) True. 
Garth is taller than Erin. 


. (Exercise 1) False. Garth is not taller than Pat. (Exercise 2) 


False. Garth is not taller than Pat, Sandy, or Gale. (Exercise 3) 
True. Marty is taller than Pat, Sandy, and Gale. (Exercise 4) 
True. Marty is taller than Pat. 


(Exercise 1) False. Dale is not taller than Dale. (Exercise 2) 
False. Erin is not taller than Dale. (Exercise 3) False. No one is 
taller than Dale. (Exercise 4) False. No one is taller than Dale. 


(Exercise 1) False. Dale is not taller than Dale. (Exercise 2) 
True. Dale is taller than Garth. (Exercise 3) False. Dale is not 
taller than Dale. (Exercise 4) True. Dale is taller than Garth. 


(Exercise 1) False. Pat is not taller than Marty. (Exercise 2) 
True. Pat, Sandy, and Gale are all taller than Erin. (Exercise 3) 
False. Marty is taller than Pat, Sandy, and Gale. (Exercise 4) 
True. Pat is taller than Erin. 


21. 


22. 
25. 
26. 


27. 


28. 


31. 


32. 


33. 


34. 
37. 


38. 


39. 
42. 
43. 
47. 
48. 
51. 
54. 
57. 
60. 
63. 


66. 
69. 


72. 
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(Exercise 1) False. Pat is not taller than Pat. (Exercise 2) False. 
Pat is not taller than everyone. (Exercise 3) Pat is not taller 
than Pat; Sandy is not taller than Pat; and Gale is not taller 
than Pat. (Exercise 4) False. None of Pat, Sandy, or Gale is 
taller than everyone. 

Everyone is taller than or the same height as everyone. 
Someone is taller than or the same height as someone. 


(For Exercise 22) In symbols: Sx4y>7? (x, y). In words: Some- 
one is shorter than someone. 

(For Exercises 22, 6, and 1) False. Erin is not taller than or the 
same height as Garth. 

For any two people, if they are distinct, the first is taller than 
the second. 


There are two distinct people and the first is taller than the 
second. 


(For Exercise 28) In symbols: 4x4y—73(x, y). In words: There 
are two distinct people and the first is shorter than or the same 
height as the second. 


(For Exercises 28, 6, and 1) False. Erin and Garth are distinct 
persons, but Erin is not taller than Garth. 


AxVyL(x, y). True (think of a saint). 


VxdyL(x, y). True (according to Dean Martin’s song, “Every- 
body Loves Somebody Sometime’’). 


(For Exercise 34) Everyone does not love someone. 
Vx3y-L(x, y) 


dy A( Brit, y) 


Vyax A(x, y) 
False 46. True 
(For Exercise 43) Sx4dy—P(x, y) or Axdy(x < y) 


False. A counterexample is x = 2, y = 0. 

True. Take x = y = 0. 

False. A counterexample is x = y = 2. 

True. Take x = 1, y= a/ 8: 

True. Take x = 0. Then for all y, x7 + y* > 0. 

True. For any x, if we set y = x — 1, the conditional propo- 
sition, if x < y, then x? < y’, is true because the hypothesis is 
false. 

(For Exercise 48) Axdy (x? >y+t+1) 

(For Exercise 48) Since both quantifiers are V, Farley chooses 
values for both x and y. Since Farley can choose values that 
make x* < y+ 1 false (e.g., x = 2, y = 0), Farley can win the 
game. Therefore, the proposition is false. 

Since the first two quantifiers are V, Farley chooses values 
for both x and y. The last quantifier is 4, so you choose a 
value for z. Farley can choose values (e.g.,x = l,y = 2) 
so that no matter which value you choose for z, the 
expression 


(x<y) > (Z>XAGR<y)) 


is false. Since Farley can choose values for the variables so 
that you cannot win, the quantified statement is false. 
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74. 


77. 


80. 


83. 


86. 


89. 


92. 


95. 


98. 


99. 
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Vx dy P(x, y) must be true. Since Vx Vy P(x, y) is true, regard- 
less of which value of x is selected, P(x, y) is true for all y. 
Thus for any x, P(x, y) is true for any particular y. 


Vx Vy P(x, y) might be false. Let P(x, y) denote the expression 
x < y. If the domain of discourse is Z* x Z*, Ax Vy P(x, y) is 
true; however, Vx Vy P(x, y) is false. 
Vx Vy P(x, y) might be false. Let P(x, y) denote the expression 
x < y. If the domain of discourse is Z* x Z*, dx dy P(x, y) is 
true; however, Vx Vy P(x, y) is false. 


Vx dy P(x, y) might be true. Let P(x, y) denote the expression 
x < y. If the domain of discourse is Z* x Z*, Wx dy P(x, y) is 
true; however, Vx Vy P(x, y) is false. 


Vx Vy P(x, y) must be false. Since Vx dy P(x, y) is false, there 
exists x, say x = x’, such that for all y, P(x, y) is false. Choose 
y = y’ in the domain of discourse. Then P(x’, y’) is false. 
Therefore Vx Vy P(x, y) is false. 

Vx Vy P(x, y) must be false. Since 4x Vy P(x, y) is false, for ev- 
ery x there exists y such that P(x, y) is false. Choose x = x’ in 
the domain of discourse. For this choice of x, there exists y = 
y’ such that P(x’, y’) is false. Therefore Vx Vy P(x, y) is false. 


Vx Vy P(x, y) must be false. Since 4x dy P(x, y) is false, for 
every x and for every y, P(x, y) is false. Choose x = x’ and 
y = y’ in the domain of discourse. For these choices of x and 
y, P(x’, y’) is false. Therefore Vx Vy P(x, y) is false. 


dx (Vy P(x, y)) is not logically equivalent to —(Vx dy 
P(x, y)). Let P(x, y) denote the expression x<y. If the 
domain of discourse is Z x Z, 4x (Vy P(x, y)) is true; how- 
ever, — (Wx dy P(x, y)) is false. 

dxdy-P(x,y) is not logically equivalent to —(Vxdy 
P(x, y)). Let P(x, y) denote the expression x <y. If the do- 
main of discourse is Z x Z, 4x dy P(x, y) is true; however, 
= (Vx dy P(x, y)) is false. 


Ve > 0456 > OVx((0 < |x—a| < 5) > (|f() —L| < €)) 


Chapter 1 Self-Test 


1. 
2. 
3. 


Section 1.1] @ 
Section 1.2] False 


Section 1.3] If Leah gets an A in discrete mathematics, then 
Leah studies hard. 


Section 1.3] Converse: If Leah studies hard, then Leah gets 
an A in discrete mathematics. Contrapositive: If Leah does 
not study hard, then Leah does not get an A in discrete math- 
ematics. 


Section 1.1]A CB 
Section 1.1] Yes 


Section 1.4] The argument is invalid. If p and r are true and 
q is false, the hypotheses are true, but the conclusion is false. 


Section 1.3] True 
Section 1.4] Let 


p: The Skyscrapers win. 
q: Tlleat my hat. 
r: Vl be quite full. 


10. 


11. 


12. 
13. 
14. 


15. 


16. 


17. 
18. 


19. 


20. 
21. 


Then the argument symbolically is 

p>4 

q->r 

o> p 
The argument is invalid. If p and g are false and r is true, the 
hypotheses are true, but the conclusion is false. 


Section 1.5] The statement is not a proposition. The truth 
value cannot be determined without knowing what “the team” 
refers to. 

Section 1.5] The statement is a propositional function. When 
we substitute a particular team for the variable “team,” the 
statement becomes a proposition. 


Section 1.6] axVy-K (x, y) 
[Section 1.6] VxdyK (x, y); everybody knows somebody. 


[Section 1.1] Since |AJ = 3 and |P(A)| = 23 = 8, 
P(A) x Al = 8-3 = 24. 
[Section 1.2] 
p q r 7(pAQ Vv (py 7r) 
T T T T 
T T F T 
T F T T 
i F F aT 
F T T T 
F T F T 
F F T iT 
F F F 7 


Section 1.2] I take hotel management and either I do not take 
recreation supervision or I take popular culture. 


Section 1.2] p V (¢ A 71) 


Section 1.5] For all positive integers n, n and n+ 2 are prime. 
The proposition is false. A counterexample is n = 7. 


Section 1.5] For some positive integer n,n and n+ 2 are 
prime. The proposition is true. For example, ifn = 5,n and 
n-+ 2 are prime. 

Section 1.4] Hypothetical syllogism 

Section 1.4] Let 


p: The Council approves the funds. 

q: New Atlantic gets the Olympic Games. 
r: New Atlantic builds a new stadium. 

s: The Olympic Games are canceled. 


Then the argument symbolically is 
p> 4 
q->r 
a 


apVs 


From p — gq and q — r, we may use the hypothetical syllo- 
gism to conclude p —> r. From p — rand —r, we may use 


22. 


23. 


24. 


the modus tollens to conclude =p. We may then use addition 
to conclude =p V s. 


[Section 1.6] The statement is true. For every x, there exists y, 
namely the cube root of x, such that x = y*. In words: Every 
real number has a cube root. 


[Section 1.6] 
(xd yVzP(x, y, 2)) = dxa(AyVzP(x, y, 2) 
= Invy7(VzP(, y, z)) 
= AxVydz7P(, y, z) 


[Section 1.3] (=r V q) > 7q 


Section 2.1 Review 


1. 


A mathematical system consists of axioms, definitions, and 
undefined terms. 


2. An axiom is a proposition that is assumed to be true. 


. A definition creates a new concept in terms of existing ones. 


. An undefined term is a term that is not explicitly defined but 


rather is implicitly defined by the axioms. 


5. A theorem is a proposition that has been proved to be true. 


6. A proof is an argument that establishes the truth of a 


11. 


12. 


theorem. 


. A lemma is a theorem that is usually not too interesting in its 


own right but is useful in proving another theorem. 


. A direct proof assumes that the hypotheses are true and then, 


using the hypotheses as well as other axioms, definitions, and 
previously derived theorems, shows directly that the conclu- 
sion is true. 


. An integer n is even if there exists an integer k such that 


n = 2k. 


. An integer n is odd if there exists an integer k such that n = 


2k +1. 
Within a proof, a proof of an auxiliary result is called a 
subproof. 


To disprove the universally quantified statement VxP(x), we 
need to find one member x in the domain of discourse that 
makes P(x) false. 


Section 2.1 


1. 


10. 


If three points are not collinear, then there is exactly one plane 
that contains them. 


. If x is a nonnegative real number and n is a positive integer, 


x'/" is the nonnegative number y satisfying y” = x. 


. Let m and n be even integers. Then there exist k; and kz such 


that m = 2k; andn = 2k». Now 
m+n = 2k; + 2kp = 2(ky + ko). 


Therefore m + n is even. 


Let m and n be odd integers. Then there exist kj and kz such 
that m = 2k; + 1 andn = 2k) + 1. Now 


13. 


16. 


18. 


21. 


24. 


27. 


30. 


33. 


36. 


39. 


42. 


45. 


46. 
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mn = (2k, 4 


1) = 4kyko + 2k; 4 
= 2(2kjko + ky 


1) (2ka 4 


> 

3 
— 

_ 


Therefore mn is odd. 
Let x and y be rational numbers. Then there exist integers 
m,, Ny, M2, Nz such that x = m,/n, and y = m2/n2. Now 

my | m 


x+y= t = 
ny nz 


mn2 +mn, 
mn. 


Since mjnz + mzn, and njnz are integers, x + y is a rational 
number. 


If m = 3k, + 1 andn = 3k, + 1, then 
mn = (3k, + 1)(k2 + 1) = 9k ko + 3k) +3ko +1 
= 3(3k{ko +ki +ko) +1. 
From the definition of max, it follows that d > d, andd > db. 


Fromx > dandd > d, we may derive x > d; froma previous 
theorem (the second theorem of Example 2.1.5). From x > d 
and d > dz, we may derive x > d2 from the same previous 
theorem. Therefore, x > d, and x > do. 


Let x € XO Y. From the definition of “intersection,” we con- 
clude that x € X. Therefore XN Y C X. 


Let x € XM Z. From the definition of “intersection,” we con- 
clude that x € X andx € Z. Since X C Yandx € X,xe Y. 
Since x € Y and x € Z, from the definition of “intersection,” 
we conclude that x € YM Z. Therefore XN Z C YNZ. 


Let x € Y. From the definition of “union,” we conclude that 
xeXUY. Since X UY = X UZ, x € X UZ. From the defini- 
tion of “union,” we conclude that x € X or x € Z. If x € Z, we 
conclude that Y C Z. If x € X, from the definition of “‘inter- 
section,’ we conclude that x € XN Y. Since XN Y = XNZ, 
x € XM Z. Therefore x € Z, and again Y C Z. 

The argument that Z C Y is the same as that for Y C Z 
with the roles of Y and Z reversed. Thus Y = Z. 


Since X € P(X), X € P(Y). Therefore X C Y. 


False. If X = {1,2} and Y = {2, 3}, then X is not a subset of 
Y since 1 € X, but | ¢ Y. Also, Y is not a subset of X since 
3€Y,but3 ¢X. 


False. Let X = {l,a}, Y = {1,2,3}, and Z = {3}. Then 
Y—Z={l,2} and (X UY) — (X UZ) = {2}. 
False. Let X = Y = {1} and U = {1,2}. Then XN Y = {2}, 
which is not a subset of X. 
False. Let X = Y = {1} and U = {1,2}. ThenX¥ x Y = 
{(1, 2), (2, 1), (2, 2)} and X x Y = {(2, 2)}. 
False. Let X = {1,2}, Y = {1}, and Z = {2}. Then 
XN(Y x Z) = Sand (XN Y) x (XN Z) = {(1, 2)}. 
We prove only (AUB)UC = AU (BUC). Letx € (AUB)UC. 
Then x € AUBorx € C.Ifx € AUB, then x € Aor 
x € B. Thusx € Aorx € Borx € C.Ifx ée A, then 
x€AU(BUC).Ifx € Borx € C, thenx € BUC. Again, 
xE€AU(BUC). 

Now suppose that x € AU (BUC). Then x € A or 
xéBUC.Ifx e BUC,thenx € Borx e C.ThusxeA 
orx € Borx € C.Ifx € Aorx € B, thenx € AUB. Thus 
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x € (AUB) UC. Ifx € C, again x € (AU B) UC. Therefore 
(AUB)UC=AU(BUC). 
We prove only AU @ = A. Letx e AUS. Then x € A or 
xe SO. Butx¢d O,soxeAd. 

Now suppose that x € A. Then x € AU ©. Therefore 
AUS=A., 


We prove only A U U = U. By definition, any set is a subset 
of the universal set, so A UU C U. 

If x € U, thenx €e AUU. Thus U CAUU. Therefore 
AUU=U. 


We prove only @ = U. By definition, any set is a subset of the 
universal set, so @ C U. 

Now suppose that x € U. Thenx ¢ © (by the definition 
of “empty set”). Thus x € @ and U C @. Therefore @ = U. 


Letx € AAB. Thenx € AUBandx ¢ ANB. Since x € AUB, 
x €Aorxe B.Sincex €ANB,x ¢Aorx ¢ B.Ifx € A, then 
x € B. Thus x € A—B; hence x € (A— B) U(B—A). Ifx € B, 
then x ¢ A. Thus x € B—A and again x € (A — B) U(B—A). 
Therefore A A B C (A — B) U(B—A). 

Now suppose that x € (A—B)U(B—A). Thenx € A—B 
orx € B—A.Ifx € A—B, thenx € Aandx ¢ B. Thusx € AUB 
and x ¢ AM B. Therefore x € (AUB) — (ANB) =AAB. 
Ifx ¢€ B—A,thenx € Bandx ¢ A. Thenx € AUB 
andx ¢ AN B. Againx € (AUB)—- (ANB) =AAB. 
Therefore (A — B) U(B—A) CA AB. We have proved that 
(A —B)U(B—A)=AAB. 

False. Let A = {1, 2,3}, B = {2,3,4}, and C = {1, 2, 4}. 
Then A A (BUC) = {4} and (A AB) U(A AC) = {1, 4}U 
{3, 4} = {1, 3, 4}. 

True. Using Example 2.1.11,, we find that 


AN (BAC) =AN[(BUC)—- (BN O)] 
= [AN (BUC)] —- [AN (BN O)]. 


Using the distributive law and observing that (A N B) 
(ANC) =AN (BNC), we find that 


(ANB) A (ANC) = [(ANB)U(ANOC)]—[LANB)N(ANC)] 


= [AN (BUC) —-[AN (BN O)]. 
Therefore 


AN(BAC) = (ANB) A(ANO). 


Section 2.2 Review 


1. 


A proof by contradiction assumes that the hypotheses are 
true and that the conclusion is false and then, using the 
hypotheses and the negated conclusion as well as other ax- 
ioms, definitions, and previously derived theorems, derives a 
contradiction. 


. Example 2.2.1 


3. “Indirect proof” is another name for proof by contradiction. 


4. To prove p — gq, proof by contrapositive proves the equivalent 


statement -q — —p. 


. Example 2.2.5 


13. 
14. 


15. 


. Instead of proving 


(Pi Vv p2V +++ VY Dn) —> q, 
in proof by cases, we prove 


(pi > q) A (p2 > GQ) A+++ A (Pn > Q)- 


. Example 2.2.6 


. Proof of equivalence shows that two or more statements are 


all true or all false. 


. Example 2.2.11 


. If the statements are p, g, and r, we can show that they are 


equivalent by proving that p > g, gq > r, andr —> pare all 
true. 


. A proof of 4x P(x) is called an existence proof. 


. An existence proof of 4x P(x) that exhibits an element a of the 


domain of discourse that makes P(a) true is called a construc- 
tive proof. 

Example 2.2.12 

A proof of 4x P(x) that does not exhibit an element a of the 
domain of discourse that makes P(a) true, but rather proves 
dx P(x) some other way (e.g., using proof by contradiction), is 
called a nonconstructive proof. 


Example 2.2.14 


Section 2.2 


1. 


10. 


13. 


Suppose, by way of contradiction, that x is rational. Then there 
exist integers p and q such that x = p/g. Now x? = p?/q? is 
rational, which is a contradiction. 


. Suppose, by way of contradiction, that n is even. Then there 


exists k such that n = 2k. Now n? = 2(2k?); thus n2 is even, 
which is a contradiction. 


. Suppose, by way of contradiction, that \/2 is rational. Then 


there exist integers p and q such that \/2 = p/q. We assume 
that the fraction p/g is in lowest terms so that p and q are not 
both even. Cubing ./2 = p/q gives 2 = p>/q°, and multiply- 
ing by q° gives 2q* = p’. It follows that p* is even. An argu- 
ment like that in Example 2.2.1 shows that p is even. There- 
fore, there exists an integer k such that p = 2k. Substituting 
p = 2k into 2q? = p? gives 2q> = (2k)? = 8k. Canceling 2 
gives q> = 4k. Therefore q° is even and, thus, g is even. Thus 
p and gq are both even, which contradicts our assumption that 
p and q are not both even. Therefore, J/2 is irrational. 

Since the integers increase without bound, there exists n € Z 
such that 1/(b — a) < n. Therefore 1/n < b — a. Choose 
m € Zas large as possible satisfying m/n < a. Then, by the 
choice of m, a < (m+ 1)/n. Also 


1 1 
aE a4 <a+(b-—a)=b. 
n non 
Therefore x = (m+ 1)/n is a rational number satisfying 
a<x<b. 
True. Let a = b = 2. Then a and D are rational numbers 


and a’ = 4 is also rational. This is a constructive existence 
proof. 


16. 


19. 


22. 


25. 


27. 


A team can score any number of points greater than or equal 
to two. To prove this, let n be an integer greater than or equal 
to 2. If n is even, say n = 2k, if the team scores k safeties, 
its total score is 2k = n. If n is odd, say n = 2k + 1, if the 
team scores one field goal and k — 1 safeties, its total score is 
3+2(k—1) = 2k+1 =n. (Note that k > 1, so the prescribed 
scoring is possible.) 

True. We give a proof by contradiction. Suppose that (X— Y)N 
(Y —X) is nonempty. Then there exists x € (X —Y)N(Y —X). 
Thus x € X —Yandx € Y—X.Sincex € X-—Y,x eX 
and x ¢ Y. Since x € Y—X,x € Y andx ¢ X. We now 
have x € X and x ¢ X, which is a contradiction. Therefore 
(X-—Y)N(Y-X)=2. 

Suppose, by way of contradiction, that no two bags contain 
the same number of coins. Suppose that we arrange the bags 
in increasing order of the number of coins that they contain. 
Then the first bag contains at least one coin; the second bag 
contains at least two coins; and so on. Thus the total number 
of coins is at least 


14243+---4+9=45. 


This contradicts the hypothesis that there are 40 coins. Thus 
if 40 coins are distributed among nine bags so that each bag 
contains at least one coin, at least two bags contain the same 
number of coins. 
Let i be the greatest integer for which s; is positive. Since sj 
is positive and the set of indexes 1, 2,..., 7 is finite, such an 
i exists. Since s, is negative, i < n. Now s;+1 is equal to ei- 
ther s; + 1 or s; — 1. If 541 = 5; + 1, then 5,4) is a positive 
integer (since s; is a positive integer). This contradicts the fact 
that 7 is the greatest integer for which s; is positive. Therefore, 
Sit1 = 5; — 1. Again, if s; — 1 is a positive integer, we have 
contradiction. Therefore, s;,; = s; — 1 = 0. 
We use proof by contradiction and assume the negation of the 
conclusion 

adi(s; < A). 


By the generalized De Morgan’s laws for logic, this latter state- 
ment is equivalent to 


Vi(s; > A). 


Thus we assume 
sp >A 
52 >A 


Sn >A. 
Adding these inequalities yields 
Sj tsa +-+-++ 5) > NA. 
Dividing by n gives 


Sp +52 +++++ 5p 
n 


>A, 


which contradicts the hypothesis. Therefore, there exists i such 
that s; < A. 


30. 


32. 


35. 


37. 


39. 
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Since s; sj, either s; # A or s; 4 A. By changing the nota- 
tion, if necessary, we may assume that s; # A. Either s; < A 
or s; > A. If s; < A, the proof is complete; so assume that 
sj > A. We show that there exists k such that s, < A. Suppose, 
by way of contradiction, that s,, > A for all m, that is, 

sp >A 


so >A 


Sn >A. 
Adding these inequalities yields 
Sp+SQ +e +5 ++++ +5, > nA 
since s; > A. Dividing by n gives 


St +52 +--+ 5p 
nN 


>A, 


which is a contradiction. Therefore there exists k such that 
Sk <A. 


Notice that ifn > 2 andm> 1, 
2m + Sn >2m+5 > 20, 
so the only possible solution is if n = 1. However, ifn = 1, 
2m +5n? = 2m +5, 


which is odd being the sum of an even integer and an odd inte- 
ger. Thus this sum cannot equal 20. Therefore, 2m-+5n* = 20 
has no solution in positive integers. 


We claim that if m and n + 1 are consecutive integers, one is 
odd and one is even. Suppose that n is odd. Then there exists 
k such thatn = 2k+ 1. Nown+1 = 2kK+2 = 2(k+ 1), 
which is even. If 7 is even, there exists k such that n = 2k. Now 
n+1 = 2k-+1, which is odd. Since one of n andn+ 1 is even 
and the other is odd, their product is even (see Exercise 11, 
Section 2.1). 


We consider four cases: x > 0,y > 0; x < O,y > 0; 
x >0,y <0; andx <0,y <0. 

First assume that x > 0 and y > 0. Then xy > 0 and 
|xy| = xy = |x||y|. Next assume that x < 0 and y > 0. Then 
xy < Oand |xy| = —xy = (—x)(y) = |2||y|. Next assume that 
x > O and y < 0. Then xy < 0 and |xy| = —xy = (x)(—y) = 
|x||y|. Finally assume that x < 0 and y < 0. Then xy > 0 and 
Ixy| = xy = (—x)(—y) = [allyl 
We consider three cases: x > 0, x = 0, and x < 0. 

If x > 0, |x| = x and sgn(x) = 1. Therefore, 


|x] =x = 1-x = sgn(x)x. 
If x = 0, |x| = 0 and sgn(x) = 0. Therefore, 
|x] = 0 = 0-0 = sgn(x)x. 
If x < 0, |x| = —x and sgn(x) = —1. Therefore, 
|x| = —x = —1-+-x = sgn(x)x. 


In every case, we have |x| = sgn(x)x. 
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42. 


46. 


48. 


51. 


Hints and Solutions to Selected Exercises 


We consider two cases: x > y and x < y. 
Ifx>y, 

max{x, y} =x and min{x, y} = y. 
Therefore, 

max{x, y} + min{x, y} =x-+y. 

Ifx<y, 

max{x, y} =y and min{x, y} = x. 
Therefore, 


max{x, y} + min{x, y}=y+x=x+y. 
In either case, 
max{x, y} + min{x, y}=x-+y. 


We first prove that if n is even, then n + 2 is even. Assume 
that n is even. Then there exists k such that n = 2k. Now 
n+2=2k+2 = 2(k + 1) is even. 

We next prove that if m + 2 is even, then n is even. 
Assume that n + 2 is even. Then there exists k such that 
n+ 2= 2k. Nown = 2k — 2 = 2(k — 1) is even. 

We first prove that if A C B, then B C A. Assume that A C B. 
Let x € B. Then x ¢ B.If x € A, then x € B, which is not the 
case. Therefore x ¢ A. Therefore x € A. Thus BC A. 

We next prove that if B C A, then A C B. Assume that 
B CA. From the first part of the proof, we can deduce A C B. 
SinceA = AandB=B,ACB. 

We first show that if (a,b) = (c,d), then a = c and b = d. 
Assume that (a, b) = (c, d). Then 


{{a}, ta, b}} = {fc}, fe, ah}. (*) 


First suppose that a 4 b. Then the set on the left-hand side of 
equation (*) contains two distinct sets: {a} and {a, b}. Thus 
the set on the right-hand side of equation (*) also contains 
two distinct sets: {c} and {c, d}. Therefore c 4 d. (If c = d, 
{c, d} = {c, c} = {c}.) Since a 4 b and c # d, we must have 


{a} = {c} 


It follows that a = c and b = d. 
Now suppose that a = b. Then 


{{a}, {a, b}} = {fa}, ta, a}} = {fa}, {ah} = {fa}. 


Thus the set on the left-hand side of equation (*) contains one 
set. Therefore the set on the right-hand side of equation (*) 
also contains one set. We must have c = d; otherwise the set 
on the right-hand side of equation (*) would contain two dis- 
tinct sets. Thus 


and {a,b} = {c, d}. 


{{c}, fc, dh} = {{c}}. 
Equation (*) now becomes 
{{a}} = {{c}}. 


It follows that a = c and, hence, b = d. We have shown that 
if (a, b) = (c,d), thena = candb =d. 
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Ifa=candb =d, then 
(a, b) = {{a}, {a, b}} = {{c}, {c, d}} = (c,d). 


The proof is complete. 


[(a) — (b)] Assume that n is odd. Then there exists k’ such 
that n = 2k’ + 1. Since n = 2(k’ + 1) — 1, takingk =k +1, 
we have n = 2k — 1. 

[(b) — (c)] Assume that there exists k such that n = 
2k — 1. Then 


w+ = (2k—-1)* +1 = (4K? —4k +41) 41 = 202k? —2k4+1). 


Therefore n? + 1 is even. 

[(c) — (a)] We prove the contrapositive: If n is even, 
then n? + 1 is odd. 

Suppose that n is even. Then there exists k such that 
n = 2k. Now 


2 
n + 


1 = (2k)? +1 = 4k +1 = 2(2k7) +1. 


Therefore n? + 1 is odd. 


Section 2.3 Review 


1. 
2. 


3. 


PYG, —pVt/..qVvr 

A clause consists of terms separated by or’s, where each term 
is a variable or the negation of a variable. 

A proof by resolution proceeds by repeatedly applying the rule 
in Exercise | to pairs of statements to derive new statements 
until the conclusion is derived. 


Section 2.3 


1. 
P qd r PY s=pvi qvVvr 
T T T T T T 
T T F T F T 
T F T T T T 
T F F T F F 
F T T T T T 
F T F T T T 
F F T F T T 
F F F F T F 
2.1. apvqvr 
2. 7q 
3. ar 
4. apvr From | and 2 
5. =p From 3 and 4 
5. First we note that p > q is logically equivalent to sp Vv q. We 


now argue as follows: 
1. =pvq 
2. pVq 


3. q From | and 2 


7. 


(For Exercise 2) 


1. =spVqvVr_ Hypothesis 

2. 7q Hypothesis 

3. SF Hypothesis 

4. p Negation of conclusion 
5. =pvr From | and 2 

6. =p From 3 and 5 


Now 4 and 6 combine to give a contradiction. 


Section 2.4 Review 


1. 


Suppose that we have propositional function S(m) whose do- 
main of discourse is the set of positive integers. Suppose that 
S(1) is true and, for all n > 1, if S(n) is true, then S(n + 1) is 
true. Then S(n) is true for every positive integer n. 


. We first verify that $(1) is true (Basis Step). We then assume 


that S(n) is true and prove that $(n+ 1) is true (Inductive Step). 


3, n(n+ 1) 


2 


4. The geometric sum is the sum 


atar! tar? +---+ar". 


It is equal to 


a(r"t! — 1) 
r—1 
Section 2.4 
1. Basis Step 1 = 1? 
Inductive Step Assume true for n. 
L+---+Qn—-)+Qn4) =n? +2n+1=(n41) 
4, Basis Step 17 = (1-2-3)/6 


11. 


. Basis Step 


Inductive Step Assume true for n. 
1)(Qn+ 1 
Pps ages je Oe agape 
(n+ Ia + 2)(Qn + 3) 
~ 6 


1/(1-3) = 1/3 


Inductive Step Assume true for n. 
1 ; ; 1 1 
1-3) Qn—DQn+l) | Qn+DQn+3) 
n 1 n+1 
~In+1* Qn+1)\2Qn+3) 2n+3 
cos[(x/2) +2] sin(x/2) 


Basis Step cosx = sin(x/2) 


Inductive Step Assume true for n. Then 
cosx +---+cosnx + cos(n + 1)x 
cos[(x/2)(n + 1)] sin(nx/2) 


7 sin(x/2) + cos(n + 1)x. (*) 


13. 


16. 


19. 
22. 


25. 


28. 
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We must show that the right-hand side of (*) is equal to 
cos[(x/2)(n + 2)] sin[(m + 1)x/2] 
sin(x/2) , 


This is the same as showing that [after multiplying by the term 
sin(x/2)] 


x . nx we. 
cos [a + | sin 5 + cos(n + 1)xsin 3 
Ix 
2) sin 4 *) ; 
2 


1) and 6 = x/2, we must show that 


. 
= COs 5) n- 


If we let a = (x/2)(n 


cos a sin(a — B) + cos 2a sin B = cos(a + f) sina. 


This last equation can be verified by reducing each side to 
terms involving a and p. 


Basis Step 1/2 < 1/2 
Inductive Step Assume true for n. 
1-3-5---(2n—1)(2n+ 1) 2 1 2n+1 
2+4-6---(2n)(2n4+2) ~ 2n 2n+2 
2n+1 1 ss 1 
~  n- n+2~ n+2 
Basis Step(n = 4) 24= 16> 16=47 
Inductive Step Assume true for n. 
(n+ 1? =n? 4+2n41 < 2"4+2n41 
< 2"+4+2" by Exercise 15 
= gn+l 
1—,"tl 1 
Ceres a ease r’ = 
aes ° =r 7 l= 
Basis Step 7! — 1 = 6 is divisible by 6. 


Inductive Step Suppose that 6 divides 7” — 1. Now 


ql te=737" 1=7" 


1+6-7". 

Since 6 divides both 7” — | and 6-7", it divides their sum, 
which is 7"! — 1, 

Basis Step 3! +7! — 2 = 8 is divisible by 8. 

Inductive Step Suppose that 8 divides 3” + 7” — 2. Now 


antl n qe 


2= 33° +7 -2)+47" +). 


By the inductive assumption, 8 divides 3” + 7” — 2. We can 
use mathematical induction to show that 2 divides 7” + 1 for 
all n > 1| (the argument is similar to that given in the hint for 
Exercise 22). It then follows that 8 divides 4(7” + 1). Since 
8 divides both 3(3” + 7” — 2) and 4(7” + 1), it divides their 
sum, which is 3+! 4. 7"+! — 2, 


We prove the assertion using induction on n. The Basis Step 
is n = 1. In this case, there is one subset of {1} with an even 
number of elements, namely, @. Since gn-l — 2° = 1, the 
assertion is true when n = 1. 

Assume that the number of subsets of {1,..., 1} con- 
taining an even number of elements is 2”~!. We must prove 
that the number of subsets of {1, ..., 7-1} containing an even 
number of elements is 2”. 
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30. 


33. 


37. 


40. 


Hints and Solutions to Selected Exercises 


Let E),..., Eo»-1 denote the subsets of {1,2,...,n} 
containing an even number of elements. Since there are 2” 
subsets of {1,2,...,m} altogether and 2"-! contain an even 
number of elements, there are 2” — 2’—! = 2”—! subsets of 
{1,...,m} that contain an odd number of elements. Denote 
these as O1,..., Oo-1. Now E),..., Eon-1 are the subsets of 
{1,...,-+ 1} containing an even number of elements that do 
not contain n + 1, and 


O, U{n+ 1}, ..., Oo-1 U {n+ 1} 


are the subsets of {1, ..., 7-++ 1} containing an even number of 
elements that contain n + 1. Thus there are 2”~! +2”-! = 2” 
subsets of {1,...,-+ 1} that contain an even number of ele- 
ments. The Inductive Step is complete. 


At the Inductive Step when the (n+ 1)st line is added, because 
of the assumptions, the line will intersect each of the other n 
lines. Now, imagine traveling along the (n + 1)st line. Each 
time we pass through one of the original regions, it is divided 
into two regions. 


We denote the square in row i, column j by (i, /). Then, by 
symmetry, we need only consider 7 x 7 boards with squares 
(i, 7) removed where i < j < 4. The solution when square 
(1, 1) is removed is shown in the following figure. 


5x5 
3x2 


2X3 


Not all trominoes of the tiling are shown. By Exercise 36, the 
3x2 subboards have tilings. By Exercise 33, the 5 x 5 subboard 
with a corner square removed has a tiling. Essentially the same 
figure gives tilings if square (1, 2) or (2, 2) is deleted. 

A similar argument gives tilings for the remaining 
cases. 


Basis Step (7 = 1) The board is a tromino. 


Inductive Step Assume that any 2” x 2” deficient board can 
be tiled with trominoes. We must prove that any 2”+! x 2”+1 
deficient board can be tiled with trominoes. 

Givena2"t! x2"+! deficient board, we divide the board 
into four 2” x 2” subboards as shown in Figure 2.4.6. By 
the inductive assumption, we can tile the subboard contain- 
ing the missing square. The three remaining subboards form a 
2” x 2” L-shape, which may be tiled using Exercise 39. Thus, 
the 2”+! x 2"+! deficient board is tiled. The inductive step is 
complete. 


41. 


45. 


49. 


Number the squares as shown: 


Notice that each tromino covers exactly one 1, one 2, and 
one 3. Therefore, if there is a tiling, five 2’s are covered. Since 
five trominoes are required, the missing square cannot be a 2. 
Similarly the missing square cannot be a 3. 

The same argument applied to 


shows that the only possibility for the missing square is a cor- 
ner. Such a board can be tiled: 


We prove that pow = a’! is a loop invariant for the while 
loop. Just before the while loop begins executing, i = 1 and 
pow = 1, so pow = a!~!. We have proved the Basis Step. 

Assume that pow = a'~!. If i < n(so that the loop body 
executes again), pow becomes 


leaz=di, 


pow*xa= a 
and i becomes i+ 1. We have proved the Inductive Step. There- 
fore pow = a’~! is an invariant for the while loop. 


The while loop terminates when i = n + 1. Because 


pow = a’! is an invariant, at this point pow = a". 
(a) 3) =0 42; 
2+---+2n+2(n4+1) = S,+2n+2 
= (n+ 2)(n— 1) +2n+2 
= (n+ 3)n = Sn+1 : 


(b) We must have Si, = S’_, + 2n; thus 


53. 


56. 


58. 


61. 


63. 
66. 


69. 


70. 


73. 


S;, = Si_1 +2n 
SIS 3 teen 
= S)_5 +2n+2(n— 1) 


= S\_3+2n+2(n— 1) +2(n— 2) 
= Sj} +2[In+(n—1) +--+ +2] 
45 “Ss 1) | 

2 
=w+nt+C. 


If n = 2, each person throws a pie at the other and there are 
no survivors. 


The statement is false. In 


— ae —y — 


1 2 3 4 


1 and 5 are farthest apart, but neither is a survivor. 


Let x and y be points in X MN Y. Then x is in X and y is in X. 
Since X is convex, the line segment from x to y is in X. Simi- 
larly, the line segment from x to y is in Y. Therefore, the line 
segment from x to y is in XM Y. Thus, XM Y is convex. 


Let x1, ...,X, denote the n points, and let X; be the circle of 
radius | centered at x;. Apply Helly’s Theorem to X1,..., Xn. 


1 


Basis Step (i = 1) Since 2 is eliminated, 1 survives. Thus 
JQ) = 1. 

Inductive Step Assume true for i. Now suppose that 2'+! 
persons are arranged in a circle. We begin by eliminating 
2, Fy 65065 2'+1_ We then have 2! persons arranged in a cir- 
cle, and, beginning with 1, we eliminate the second person, 
then the fourth person, and so on. By the inductive assump- 
tion, 1 survives. Therefore, 


Jt!) = 724) = 1. 


The greatest power of 2 less than or equal to 100,000 is 2!°. 
Thus, in the notation of Exercise 67, n = 100,000, i = 16, and 


2' = 100,000 — 2!° = 100,000 — 65,536 = 34,464. 


f=Sn 
By Exercise 67, 


J(100,000) = J(n) = 27 + 1 = 2+34,464 + 1 = 68,929. 


by + by +++ + bn = (a2 — ay) + (a3 — a2) 
tee e+ (Qn41 — ay) 
= —a, + An41 = 4n41 — a 
since a2, ..., Gp, cancel. 
Let 


Sle 
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Then 
x 1 1 -1 
an =a an = -= i: 
7 a "“n+tlon n(n + 1) 
Let b, = Aa,. By Exercise 70, 
iin ea altace dee 
1-2 n(n + 1) 
=b, +-+-+b), 
= 4n+1 — 41 
1 —n 
~ ntl ~ n+l" 


Multiplying by —1 yields the desired formula. 


Section 2.5 Review 


1. Suppose that we have a propositional function $(”) whose do- 
main of discourse is the set of integers greater than or equal 
to no. Suppose that S(no) is true; and for all n > no, if S(k) is 
true for all k, ng < k <n, then S(n) is true. Then S(n) is true 
for every integer n > no. 

2. Every nonempty set of nonnegative integers has a least 
element. 


3. If d and n are integers, d > 0, there exist unique integers q 
(quotient) and r (remainder) satisfying n = dq+r, 0 <r <d. 


Section 2.5 


1. Basis Steps (n = 6,7) We can make six cents postage by 

using three 2-cent stamps. We can make seven cents postage 
by using one 7-cent stamp. 
Inductive Step We assume that n > 8 and postage of 
k cents or more can be achieved by using only 2-cent and 
7-cent stamps for 6 < k < n. By the inductive assumption, 
we can make postage of n — 2 cents. We may add a 2-cent 
stamp to make n cents postage. 


4. Basis Step (7 = 4) We can make four cents postage by us- 
ing two 2-cent stamps. 
Inductive Step We assume that we can make n cents 
postage, and we prove that we can make n + | cents postage. 
If among the stamps that make n cents postage there 
is at least one 5-cent stamp, we replace one 5-cent stamp by 
three 2-cent stamps to make n + | cents postage. If there are 
no 5-cent stamps among the stamps that make n cents postage, 
there are at least two 2-cent stamps (because n > 4). We re- 
place two 2-cent stamps by one 5-cent stamp to make n + | 
cents postage. 


7. Inthe Inductive Step, we must have k = [n/2]| > 3. Since this 
inequality fails for n = 4, 5, the Basis Steps are n = 3, 4, 5. 


10. co = 4, c3 = 9, c4 = 20, c5 = 29 

12. co = 2, c3 = 3,c4 = 12, c5 = 13 

15. Notice that 
co = 0 
cp =ot+3=3 
aQ=ct3= 
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17. 
20. 


23. 
27. 


Hints and Solutions to Selected Exercises 


o=cp+3=6 
4 =024+3=9. 


Thus the assertion c, < 2n fails forn = 4. 

In the Inductive Step, we must have k = |n/2]| > 3. 
Since this inequality fails for n = 4,5, the Basis Steps are 
n = 3,4, 5. In the fallacious proof, only the case n = 3 was 
proved in the Basis Steps. In fact, since the statement is false 
for n = 4, the Basis Steps n = 3, 4, 5 cannot be proved. 


q=5, r=2 


q=-l,r=2 
5 1 oi 1 1 1 


6 2°3 3°4' DD 
We may assume that p/q > 1. Choose the largest integer n 


satisfying 


4 if Se 1 op 
1 22 n” q 
(The previous Problem-Solving Corner shows that the sum 
: ae ; she ak : 
1 2 n 


is unbounded; so such an n exists.) If we obtain an equality, 
p/q is in Egyptian form, so suppose that 


Set 


Clearly, D > 0. Since n is the largest integer satisfying (+), 


pea a) 
1. 2 n 


1 1 
p=2-(T4-42) 
qd 1 n 
1 i 1 1 1 
< Basa fee 
~\d n n+l 1 n 
1 


n+1° 


1 ah. 
n+17 q 


Thus 


In particular, D < 1. By Exercise 25, D may be written in 
Egyptian form: 


where the n; are distinct. Since 


1 1 
—<D< 


i fori=1,...,k, 
Nj l+n 


n<n+1<n;fori=1,...,k. It follows that 


1,2 2085 Mig any ME 


28. 


are distinct. Thus 


is represented in Egyptian form. 


In this solution, the induction is over the set X of odd integers 
n > 5, where 3 divides n? — 1. Such an induction can be jus- 
tified by considering the first statement to be about the small- 
est integer in X, the second statement to be about the second 
smallest integer in X, and so on. 

Basis Steps (7 = 7,11) Exercise 37, Section 2.4, gives a 
solution if n = 7. 

If n = 11, enclose the missing square in a corner 7 x 7 
subboard (see the following figure). Tile this subboard using 
the result of Exercise 37, Section 2.4. Tile the two 6 x 4 sub- 
boards using the result of Exercise 36, Section 2.4. Tile the 
5 x 5 subboard with a corner square missing using the result 
of Exercise 33, Section 2.4. Thus the 11 x 11 board can be tiled. 


11 


6x4 
7X7 


11 


6x4 aes 


Inductive Step Suppose that n > 11 and assume that if 
k <n, kis odd, k > 5, and 3 divides k? — 1, thenak x k 
deficient board can be tiled with trominoes. 

Consider an n x n deficient board. Enclose the missing 
square in a corner (n — 6) x (n—6) subboard. By the inductive 
assumption, this board can be tiled with trominoes. Tile the 
two 6 x (n — 7) subboards using the result of Exercise 36, 
Section 2.4. Tile the deficient 7 x 7 subboard using the result 
of Exercise 37, Section 2.4. The n x n board is tiled, and the 
inductive step is complete. 


waoxGse “Pow? 


6% G7) ae 


31. 


36. 


Let X be a nonempty set of nonnegative integers. We must 
prove that X has a least element. Using mathematical induc- 
tion, we prove that for all n > 0, if X contains an element less 
than or equal to n, then X has a least element. Notice that this 
proves that X has a least element. (Since X is nonempty, X 
contains an integer n. Now X contains an element less than or 
equal to n; so it follow that X has a least element.) 

Basis Step (7 = 0) If X contains an element less than or 
equal to 0, then X contains 0 since X consists of nonnegative 
integers. In this case, 0 is the least element in X. 

Inductive Step Now we assume that if X contains an ele- 
ment less than or equal to n, then X has a least element. We 
must prove that if X contains an element less than or equal to 
n+ 1, then X has a least element. 

Suppose that X contains an element less than or equal 
to n + 1. We consider two cases: X contains an element less 
than or equal to n, and X does not contain an element less than 
or equal to n. If X contains an element less than or equal to n, 
by the inductive assumption, X has a least element. If X does 
not contain an element less than or equal to n, since X contains 
an element less than or equal to n + 1, X must contain n + 1, 
which is the least element in X. The inductive step is complete. 


Chapter 2 Self-Test 


1. 


[Section 2.1] Axioms are statements that are assumed to be 
true. Definitions are used to create new concepts in terms of 
existing ones. 


. [Section 2.2] In a direct proof, the negated conclusion is not 


assumed, whereas in a proof by contradiction, the negated 
conclusion is assumed. 


. [Section 2.2] Suppose that if four teams play seven games, no 


pair of teams plays at least two times; or, equivalently, if four 
teams play seven games, each pair of teams plays at most one 
time. If the teams are A, B, C, and D and each pair of teams 
plays at most one time, the most games that can be played are: 


AandB; AandC; AandD; BandC; BandD; 
C and D. 
Thus at most six games can be played. This is a contradiction. 


Therefore, if four teams play seven games, some pair of teams 
plays at least two times. 


. [Section 2.1] Since x and y are rational numbers, there exist 


integers m,, 1, m2, Nz such that x = m,/n, and y = m2/n2. 
Since y 4 0, m2 4 0. Now 


x  m/n mn2 


yo m/ng nym” 


Since x/y is the quotient of integers, it is rational. 


. [Section 2.2] We consider two cases: a < banda > b. Ineach 


of these two cases, we consider the two cases: b < candb > c. 
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First suppose that a < b. If b < c, then 


min{min{a, b}, c} = min{a, c} = a = min{a, b} 


min{a, min{b, c}}. 
If b > c, then 
min{min{a, b}, c} = min{a, c} = min{a, min{b, c}}. 
In either case, 
min{min{a, b}, c} = min{a, min{d, c}}. 
Now suppose that a > b. If b < c, then 


min{min{a, b}, c} = min{b, c} = b = min{a, b} 


= min{a, min{d, c}}. 
If b > c, then 


min{min{a, b}, c} = min{b, c} = c = min{a, c} 


= min{a, min{d, c}}. 
In either case, 
min{min{a, b}, c} = min{a, min{d, c}}. 
Therefore, for all a, b, c, 


min{min{a, b}, c} = min{a, min{d, c}}. 


In Exercises 6-9, only the Inductive Step is given. 


6. [Section 2.4]2 +4+.---+2n+ 2(n+ 1) =n(n+ 1) + 
2(n+ 1) =(n + Dn + 2) 
7. [Section 2.4] 
P44 st... + Qn)? + 2+ DP = aaa 
+ 2Q(n+ DP = 2(n + I)(n4 ae + 1) + J] 
8. [Section 2.4] 
1 2 n n+l 
21° 31°" @+D! @+2)! 
1 n+1 1 1 
(n+1)! (n+2)! (n+ 2)! 


10. 


11. 


. [Section 2.4] 


art? = 9.2" < 271 4+ n+ 12") = 24+ (n+ 1)2""! 
1+ Tt@¢b2"t}] 

14721 4 m+412"")] 

1+ (n4+2)2"+! 


A 


[Section 2.1] Suppose that m and m—n are odd. Then there ex- 
ist integers k; and kz such that m = 2k;+1 andm—n = 2k7+1. 
Now 


n=m-—(m—n) = (2k, +1) — (2ko +1) = 2(ky — kp). 


Therefore n is even. 


[Section 2.2] [(a) — (b)] We prove the contrapositive: If 
ANB F @, then A is not a subset of B. Since ANB F @, 
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12. 


13. 
14. 
15. 


16. 


17. 


18. 


19. 
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there exists x with x € A and x € B. Thus there exists x with 
x €Aand x ¢ B. Therefore A is not a subset of B. 

[(b) > (c)] If x € B, then x € AUB. Therefore 
BCAUB. 

Let x €¢ AU B. We must show that x € B. Nowx EA 
or x € B. Ifx € B, this part of the proof is complete; so sup- 
pose that x € A. Since ANB = @, x ¢ B. Thus x € Band 
AUB CB. Therefore AU B = B. 

[(c) > (a)] Letx € A. Then x € AUB. Since AUB = 
B,x € B. Therefore A C B. 


[Section 2.1] We first prove that X C Z. Let x € X. Since 
XCY,x eY. Since Y C Z,x € Z. Therefore X C Z. 

We now show that X is a proper subset of Z. Since 
Y CZ, there exists z € Z such that z ¢ Y. Now z ¢ X, because 
if it were, we would have z € Y. Therefore X C Z. 
[Section 2.5] g = 9,r =2 
[Section 2.5] c2 3, ¢4 
[Section 2.5] Basis Step (n = 1) 
Inductive Step 


8,¢e5 = 9. 
c) =0<0=1lgl1 


2, €3 


Cn = 2c\nj2j +n 
< 2|n/2} lg\n/2| +n 
< 2(n/2)1g(n/2) +n 
= n(lgn—1)+n=nIlgn 


[Section 2.5] Let X be a nonempty set of nonnegative integers 
that has an upper bound. We must show that X contains a 
largest element. 

Let Y be the set of integer upper bounds for X. By 
assumption, Y is nonempty. Since X consists of nonnega- 
tive integers, Y also consists of nonnegative integers. By the 
Well-Ordering Property, Y has a least element, say n. Since 
Y consists of upper bounds for X, k < n for every k in X. 
Suppose, by way of contradiction, that n is not in X. Then 
k <n-—1 for every k in X. Thus, n — | is an upper bound for 
X, which is a contradiction. Therefore, n is in X. Since k < n 
for every k in X, n is the largest element in X. 


[Section 2.3] 
(PY g>r=7PVaQvr 
=—7p-qvr 
= (pv r)(-qv r) 
[Section 2.3] 
(pV 7q) > 7rs = 7(p V 7q) V ars 
= 7pq V —rs 
= (pV mr) (7pVv s\(qv 7r)(GV 8) 
[Section 2.3] 
l. spvq 
2. =qV -r 
3. pVv-r 
4. =pV-r From | and 2 
5. =r From 3 and 4 


20. 


[Section 2.3] 

l. spvq 

.aqV7r 

.pVvrr 

r Negation of conclusion 
From | and 2 

From 3 and 5 


Now 4 and 6 give a contradiction. 


=pV -r 


DAR WN 


=r 


Section 3.1 Review 


1. 


Let X and Y be sets. A function f from X to Y is a subset of 
the Cartesian product X x Y having the property that for each 
x € X, there is exactly one y € Y with (x, y) € f. 


. In an arrow diagram of the function f, there is an arrow from 


itojif (i,j) ef. 


. The graph of a function f, whose domain and codomain are 


subsets of the real numbers, consists of the points in the plane 
that correspond to the elements in f. 


. A set S of points in the plane defines a function when each 


vertical line intersects at most one point of S. 


. The remainder when x is divided by y 


6. A hash function takes a data item to be stored or retrieved 


10. 


11. 


12. 


13. 


14. 


and computes the first choice for a location for the item. 


. Acollision occurs for a hash function H if H(x) = H(y) but 


xA#y. 


. When a collision occurs, a collision resolution policy deter- 


mines an alternative location for one of the data items. 


. Pseudorandom numbers are numbers that appear random 


even though they are generated by a program. 


A linear congruential random number generator uses a for- 
mula of the form 


Xn = (aX,»_1 +c) mod m. 


Given the pseudorandom number x,—1, the next pseudoran- 
dom number x, is given by the formula. A “seed” is used as 
the first pseudorandom number in the sequence. As an exam- 
ple, the formula 


Xn = (7Xn-1 + 5) mod 11 


with seed 3 gives a sequence that begins 3, 4,0,5,.... 


The floor of x is the greatest integer less than or equal to x. It 
is denoted |x]. 


The ceiling of x is the least integer greater than or equal to x. 
It is denoted [x]. 


A function f from X to Y is said to be one-to-one if for each 
y € Y, there is at most one x € X with f(x) = y. The function 
{(a, 1), (b, 3), (c, 0)} is one-to-one. If a function from X to 
Y is one-to-one, each element in Y in its arrow diagram will 
have at most one arrow pointing to it. 


A function f from X to Y is said to be onto Y if the range of f 
is Y. The function {(a, 1), (b, 3), (c, 0)} is onto {0, 1, 3}. Ifa 


15. 


16. 


17. 


18. 


19. 


function from X to Y is onto Y, each element in Y in its arrow 
diagram will have at least one arrow pointing to it. 


A bijection is a function that is one-to-one and onto. The 
function of Exercises 13 and 14 is one-to-one and onto 
{0, 1, 3}. 

If f is a one-to-one, onto function from X to Y, the inverse 
function is 


f' ={0,91@») ef} 
If f is the function of Exercises 13 and 14, we have 


f7! ={A,a), G,b), (0, 0}. 


Given the arrow diagram for a one-to-one, onto function f 
from X to Y, we can obtain the arrow diagram for f—! by 
reversing the direction of each arrow. 


Suppose that g is a function from X to Y and f is a function 
from Y to Z. The composition function from X to Z is de- 
fined as 


fog ={(x, z)| (x,y) € g and (y, z) € f for some y € Y}. 
If g = {C, 2), (2,2)} and f = {2,a}, fog = {U.a), 
(2, a)}. Given the arrow diagrams for functions g from X to 
Y andf from Y to Z, we can obtain the arrow diagram of f 0 g 
by drawing an arrow from x € X to z € Z provided that there 
are arrows from x to some y € Y and from y to z. 

A binary operator on X is a function from X x X to X. The 
addition operator + is a binary operator on the set of integers. 


A unary operator on X is a function from X to X. The minus 
operator — is a unary operator on the set of integers. 


Section 3.1 


1. 
4. 
7. 


11. 
13. 


Valid 
Invalid. The check digit should be 5. 


When 82 is transposed to 28, the sum becomes 


8+6+9+04+6454+4+8+4+24345+4+5+8+64+1 = 76 


and the check digit becomes 4. 


. Itis a function from X to Y; domain = X, range = {a, b, c}; 


it is neither one-to-one nor onto. Its arrow diagram is 


It is not a function (from X to Y). 


16. 


17. 


20. 


23. 


26. 


29. 


30. 
33. 
36. 


39. 
42. 


43. 
46. 
49. 
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€ 
a 1 1 2 


The function f is both one-to-one and onto. To prove that f is 
one-to-one, suppose that f(n) = f(m). Thenn + 1 =m+1. 
Thus n = m. Therefore f is one-to-one. 

To prove that f is onto, let m be an integer. Then 
f(m— 1) = (m— 1) +1 =m. Therefore f is onto. 
The function f is neither one-to-one nor onto. Since f(—1) = 
|-— 1] = 1 = fC), f is not one-to-one. Since f(n) > 0 for 
alln € Z, f(n) # —1 for alln € Z. Therefore f is not 
onto. 


The function f is not one-to-one, but it is onto. Since 
f2,1)=2-1=1=3-2=f(3, 2), f is not one-to-one. 
Suppose that k € Z. Then f(k, 0) = k —- 0 = k. Therefore f 
is onto. 


The function f is neither one-to-one nor onto. Since f(2, 1) = 
P+pPra +P a Ff, 2), f is not one-to-one. Since 
f (m,n) => 0 for all m,n € Z, f(m,n) 4 —1 for all m,n € Z. 
Therefore f is not onto. 

Suppose that f(a, b) = f(c, d). Then 243" = 2°34, We claim 
that a = c. If not, either a > c ora < c. We assume that 
a > c.(The argument is the same if a < c.) We may then can- 
cel 2° from both sides of 273? = 2°34 to obtain 27-°3? = 34, 
Since a — c > 0, 24~©3? is even. Since 34 is odd, we have a 
contradiction. Therefore a = c. 

We may now cancel 2? from both sides of 273? = 2°34 
to obtain 3? = 3¢. An argument like that in the preceding 
paragraph shows that b = d. Since a = c and b = d,f is 
one-to-one. 

Since f(m,n) 4 5 for all m,n € Z*, f is not onto. 
[Note that f(m, n) > 6 for all m,n € Z*.] 


f is both one-to-one and onto. 
f is both one-to-one and onto. 
Define a function f from {1, 2, 3, 4} to {a, b, c, d, e} as 


f = {C, a), 2, ¢), 3, b), 4, a}. 


Then f is one-to-one, but not onto. 


Wx1Vx2((f 1) = f2)) > (1 = x2) 

a(Vy € Yax € X(f(x*) = y)) = Ay € Ya(ax € X(f(®) = y)) 
= dye Y Vx e X7(f(x) = y) 
= dye YVx Ee X(f(x) Fy) 

f'0) =0-2/4 

£7") = 1/3) 


fog ={d,x), (2,2), 3,x)} 


652 


52. 


54. 
57. 
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63. 
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Ff of) = 212x], (go g)@) = x4, Fo g)(a) = 122’), 
(g of)(x) = [2x]? 

Let g(x) = log, x and A(x) = x* +2. Then f(x) = (g0h)(x). 
Let g(x) = 2x and A(x) = sinx. Then f(x) = (g 0 h)(x). 

f = {(-5, 25), (—4, 16), (—3, 9), (—2, 4), (-1, 1), (0, 0), 
C1, 1), (2, 4), (3, 9), (4, 16), (5, 25)}. f is neither one-to-one 
nor onto. We omit the arrow diagram of f. 

Ff = {00, 0), (1, 4), (2, 3), (3, 2), (4, 1)}; f is one-to-one and 
onto. The arrow diagram of f is 


YN 
i 
x xX 
66. The ISBN’s check digit is invalid. It should be 8. 
69. Valid 
72. Ifiis an untripled digit, i contributes the value i to the sum. 
Thus if an untripled digit is changed, the sum will change 
by an amount a satisfying 0 < |a| < 10. It follows that the 
check digit will change. If a tripled digit is changed, the fol- 
lowing table shows that the sum mod 10 will again change 
by an amount a satisfying 0 < |a| < 10: 
Digit Digit Times 3 Contribution to Sum mod 10 
0 0 0 
1 3 3 
2, 6 6 
3 9 9 
+ 12 2 
5 15 5 
6 18 8 
7 21 1 
8 24 4 
9 27 7 


Again, the check digit will change. 


In the solutions to Exercises 73 and 76, a: b means “store item a 
in cell b.” 


73 


» 53:2 9,13 22,281 2 6,743 > 7,377 ¢ 3,20: 10,10: 9, 


796: 4 


76. 


79. 


80. 
83. 


86. 


88. 


93. 


96. 


99. 


101. 


104. 


107. 


109. 
112. 
114. 
117. 


714 : 0,631 : 6,26: 5, 373 : 1,775 : 8, 906: 13, 

509 : 2, 2032 : 7,42: 4,4:3, 136: 9, 1028: 10 

During a search, if we stop the search at an empty cell, we 
may not find the item even if it is present. The cell may be 
empty because an item was deleted. One solution is to mark 
deleted cells and consider them nonempty during a search. 
False. Take g = {(1, a), (2, b)} and f = {(a, z), (b, 2}. 
True. Let z € Z. Since f is onto, there exists y € Y such 
that f(y) = z. Since g is onto, there exists x € X such that 
g(x) = y. Now f(g(x)) = f(y) = z. Therefore f o g is onto. 


True. Suppose that g(x;) = g(x2). Then f(g(11)) =f(g(2)). 
Since f og is one-to-one, x; = x2. Therefore, g is one-to-one. 


g(S) = {a}.g(T) = {a,c}.g (UU) = {I}, g (VY) = 
{1, 2, 3} 


No. Let f(x) =x and g(x) = x*. Then 
E\(f) =f() = 1 = g() = Ei(g). 
101 


Suppose that $(Y1) = s1s52s3 = S(¥2). Nowa € Y if and 
only if s; = 1 if and only if a € Y2. Alsob € Y, if and only 
if sy = 1 if and only if b € Y2. Alsoc € Y if and only if 
53 = 1 if and only if c € Yp. It follows that Y; = Y2 and S is 
one-to-one. 


Ifxe XO Y,Cyny@®) =1=1-1 = Cy@)Cy(x). If x ¢ 
XY, then Cyny(x) = 0. Since either x ¢ X or x ¢ Y, either 
Cx (x) = 0 or Cy(x) = 0. Thus Cy (x)Cy(x) = 0 = Cyny(4). 
If x € X — Y, then 

Cx-y(x) = 1 = 1-[1 — 0] = Cy@[1 — Cy@)]. 
Ifx ¢ X — Y, then either x ¢ X orx € Y. Incase x ¢ X, 

Cx-y(x) = 0 =0-[1 — Cy@)] = Cxy@1 — Cy(@)]. 

In case x € Y, 


Cx-y(Q%) =0 = CxQ@)[1 


1] = Cx(@)[1 
Thus the equation holds for all x € U. 

f is onto by definition. Suppose that f(X) = f(Y). Then 
Cx(x) = Cy(x), for all x € U. Suppose that x € X. Then 
Cx(x) = 1. Thus Cy(x) = 1. Therefore, x € Y. This argu- 
ment shows that X C Y. Similarly, Y C X. Therefore X = Y 
and f is one-to-one. 


Cy(x)]. 


f is a commutative, binary operator. 
f is not a binary operator since f(x, 0) is not defined. 
g(x) = —x 


The statement is true. The least integer greater than or equal 
to x is the unique integer k satisfying 


k-1l<x<k. 
Now 
k4+2<x4+3<k4+3. 


Thus, k + 3 is the least integer greater than or equal to x + 3. 
Therefore, k + 3 = [x +3]. Since k = [x], we have 


[x +3] =k+3 = [x] +3. 


120. 


If n is an odd integer, n = 2k + 1 for some integer k. Now 


nm (2k+1)P 42 +4+4k+1 
4 4 - 4 


1 
=P+k4 
ale 


Since k? + kis an integer, 


rn 
—/=R +k. 
[z]="+ 


The result now follows because 
2k(2k + 2) 


n—-1\ /n+1 (2k+1)—1] [2Qk+1) +1 
( 2 )( 2 )=| l| | 
4 


2 2 
_ 4k? + 4k 


=P +k. 


123. Let k = [x]. Then k —1 < x < k and 2x < 2k. Thus 


[2x] < 2k = 2[x]. Now [x] = k < x +41. Therefore 
2[x] < 2x+2 < [2x] + 2, so 2[x] — 2 < [2x]. Therefore 
2[x] — 1 < [2x]. 


126. April, July 


Section 3.2 Review 


1. 


15. 


16. 


© dm + Amt) +++ + an 


. A string over X is a finite sequence of elements from X. 


A sequence is a function in which the domain is a subset of 
integers. 


. If s, denotes element n of the sequence, we call n the index 


of the sequence. 


. A sequence s is increasing if for all i and j in the domain of 


s, if i <j, then s; < sj. 


. A sequence s is decreasing if for all i and j in the domain of 


s, if i <j, then s; > s;. 


. A sequence s is nonincreasing if for all 7 and j in the domain 


of s, if i <j, then s; = 5;. 


. A sequence s is nondecreasing if for all i andj in the domain 


of s, ifi <j, then s; < sj. 


. Let {s,} be a sequence defined forn = m,m-+1,..., and let 


nj, nz, ... be an increasing sequence whose values are in the 
set {m,m+1,...}. We call the sequence {s,,} a subsequence 
of {sp}. 


9. Am4m+1°°* an 


. The null string is the string with no elements. 
. X* is the set of all strings over X. 
. X* is the set of all nonnull strings over X. 


. The length of a string @ is the number of elements in a. It is 


denoted |a|. 


The concatenation of strings w and f is the string consisting 
of a followed by £. It is denoted af. 


A string 6 is a substring of the string a if there are strings y 
and 6 witha = yBé. 
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Section 3.2 


lc 2c 

3. cddcdc 

5. Increasing, decreasing, nonincreasing, nondecreasing 
8. None of (a)—(d) 31. 52 

32. 52 33. No 

34. No 35. No 

36. Yes 45. 12 

46. 23 47.7 

48. 46 49. 1 

50. 3 51. 3 

52. 21 53. No 

54. No 55. No 

56. Yes 73. 15 

74. 155 75. 2n + 3(n— 1)n/2 
76. Yes 77. No 

78. No 79. Yes 

88. 2/9 

91. 74 = a3 +a4 = 2/9 + 3/64 = 155/576 

94. Yes. We must show that 


97. 
100. 
101. 
105. 


109. 
111. 
117. 
119. 


n—-1 n 
"= Din—-2?” n+ 12n— 12 


The preceding inequality is equivalent, successively, to 


a for alln > 3. 


= an+1 


(n— 12+ 12 (n— 1) > 2) 
(n? — 1° (n— 1) > W(n—2)° 
(n* — 2n? + lh(n-1) > w(n7 —4n+4) 
3n* + 2n? +n—1 > 6n°. 


Since 3n* > 6n? and 2n? +n— 1 > 0 forall n > 3, 


3n* + 2n? +n—1>3n' > 6n° foralln > 3. 


By plugging in n = 3 into the inequality 3n* + 2n? +n—1 > 
6n°, we can verify that the inequality also holds for n = 3. 
Thus 


3n* + 2n? +n—1> 6n? for all n > 3 


and hence the equivalent inequality a, > a,+1 also holds for 
all n > 3. Therefore a is decreasing. 

Since a; > O for alli, z,41 > Zp for alln. Thus zis increasing. 
Yes, z is nondecreasing. See the solution to Exercise 97. 

We have n < 4n and n-4n is a perfect square. 


Since 6 = 2 -3, the a-sequence must contain a multiple of 3. 
The smallest multiple of 3 that can contribute to a perfect 
square is 4-3 = 12. Since 6 < 8 < 12 and6-8-I2isa 
perfect square, sg = 12. 


1,355 7,.9).11,.13 110. 1,5, 9, 13, 17, 21, 25 
ny = 2k -1 112. sy, = 4k — 3 

88 118. 1140 

48 120. 3168 


654 


137. 
140. 


143. 
146. 


149. 


151. Since xj <x <%, |x—x1| =x 
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b, = 1, bp = 2, b3 
Let so = 0. Then 


n n 
> abe = > (sk = Se—1) bE 
k=l k=l 
n n 
= > sebe — Sy SK—1DK 
k=l k=l 


n n 
= > SEDk ) SOKA + SnPn41 
k=1 k=1 
n 


= S 8K(Da = Devi) + Snbnzi- 
k=l 


3, bg = 4, bs = 5S, be = 126 


00, 01, 10, 11 


000, 010, 001, 011, 100, 110, 101, 111, 00, 01, 11, 10, 0, 
1A 
Basis Step (n = 1) In this case, {1} is the only nonempty 
subset of {1}, so the sum is 

1 

-=l=n. 


1 


Inductive Step Assume that the statement is true for n. We 
divide the subsets of 


{l,...,n,n+ 1} 


into two classes: 


Ci 
C2 


class of nonempty subsets that do not contain n + 1 


class of subsets that contain n + 1. 


By the inductive assumption, 


1 
ieee ee 


Cc 


Since a set in C2 consists of n + 1 together with a subset 


(empty or nonempty) of {1,..., 7}, 
1 1 1 1 
> =a aioe 
a (n+ 1)ny +++ ng n+l n+1 G Ny ++ Ng 


[The term 1/(n + 1) results from the subset {n + 1}.] By the 
inductive assumption, 


1 1 1 1 1 
= | n=l. 
a n+1 n+1 . 
1 


¢, 
Therefore, 
1 

nr 

G (n+ I)ny +++ ng 
Finally, 

1 1 1 

» 7 x | > 1 =ntl. 
cue, 7°" Cc Ayo Nk C2 (n+ )ny +++ Ng 


x, and |x — x,| = %,—-~x. 
Thus 


n n-1 
So be mil = be ail + D0 be — ail + bx 
i=2 


i=l 
n-1 


x+ Sob 
i=2 
n-1 
= Xj] + Qn — x1). 
i=2 


= (x Xi| Si (Xn Xx) 


154. Using Exercise 4, Section 2.4, we have 


n n n 


n n n 
= 2D 
i=l j=l 


i=1 j=l i=1 j=l 


n n n n n n 
=D 2 
j=l i=l i=l j=l i=1 j=l 
pea aa 2 [mero 
~ 6 2 


J= 
n 


; Ss n(n + 1)(2n + 1) 
6 


i=l 
; = 1)(Qn+ 2] 
~ 6 
“ 1)Qn+ 2] 
+ n 


n(n + 1)? 
2 


6 
[“ +1)Qn+ 2] 
= 2n 
6 
L[2Q2n 


n(n +1)? 
2 
1)-—3(n+ 1] 


n(n 


6 
rint+l[n—1] n(n? - 1) 


6 6 


155. The function is one-to-one. Suppose that f(a) = f(f). Then 


aab = Bab. Thus a = B. 
The function f is not onto. Since |f(@)| > 2 for all 
a €X*, f(a) #A for all a € X*. 


158. Leta =A. Thena € Land the first rule states that ab = aab € 


L. Now B =ab € Land the first rule states that aabb = aBb € 
L. Now y=aabb e€ L and the first rule states that 
aaabbb=ayb € L. 


161. We use strong induction on the length n of a to show that if 


a € L,a has an equal number of a’s and b’s. 

The Basis Step is n = 0. In this case, @ is the null 
string, which has an equal number of a’s and b’s. 

We turn now to the Inductive Step. We assume any 
string in L of length k < n has an equal number of a’s and 
b’s. We must show that any string in L of length n has an 
equal number of a’s and b’s. Let aw € L and suppose that 
|a| =n > 0. Nowa is in L because of either rule 1 or rule 2. 

Suppose that @ is in L because of rule 1. In this case, 
a = apb ora = bfa, where B € L. Since |B| < n, by the 
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inductive hypothesis 6 has an equal number of a’s and b’s. is 
Since a = aBb or a = ba, a also has an equal number of 
a’s and b’s. Roo R; = {(1, 1), C, 3), 2, D, (2, 3)}- 


Suppose that @ is in L because of rule 2. In this case, 
a = By, where B € Landy é€ L. Since |B| < nand|y| <n, Section 3.3 


by the inductive hypothesis 6 and y each have equal numbers : F 
of a’s and b’s. Since a = By, @ also has an equal number of 1. {(8840, Hammer), (9921, Pliers), (452, Paint), (2207, 


a’s and b’s. The proof by induction is complete. Carpe) 
4. {(a, a), (b, b)} 
Section 3.3 Review 5. , s 

1. A binary relation from a set X to a set Y is a subset of the b 2 
Cartesian product X x Y. a 1 

2. Inadigraph of a relation on X, vertices represent the elements c 1 
of X and directed edges from x to y represent the elements 
(x, y) in the relation. ‘ 

3. A relation R on a set X is reflexive if (x,x) € R for ev- : Mercury 1 
ery x € X. The relation {(1, 1), (2, 2)} is a reflexive relation Venus 2 
on {1,2}. The relation {(1, 1)} is not a reflexive relation on Earth 3 
{1,2}. Mars 4 

4. A relation R on a set X is symmetric if for all x,y € X, if Jupiter 5 
(x,y) € R, then (y,x) € R. The relation {(1, 2), (2, 1} is Saturn 6 
a symmetric relation on {1, 2}. The relation {(1, 2)} is not a Uranus 7 
symmetric relation on {1, 2}. Neptune 8 

5. A relation R on a set X is antisymmetric if for all x,y € 
X, if (x,y) € R and (y,x) € R, then x = y. The re- 9 
lation {(1, 2)} is an antisymmetric relation on {1,2}. The ; 
relation {(1,2),(2,1)} is not an antisymmetric relation 
on {1, 2}. pa é 

6. A relation R on a set X is transitive if for all x, y,z € X, if 
(x, y) and (y, z) € R, then (x,z) € R. The relation {(1, 2), 12. 


(2, 3), C1, 3)} is a transitive relation on {1, 2, 3}. The relation 
{(1, 2), (2, 1)} is not a transitive relation on {1, 2}. 


7. A relation R on a set X is a partial order if R is reflexive, 
antisymmetric, and transitive. The relation 


{C1, 1), (2, 2), (3, 3), C, 2), 2, 3), CL, 3)} 


is a partial order on {1, 2, 3}. 


8. If R is a relation from X to Y, the inverse of R is the relation 


from Y to X: 
- 13. {(a, b), (a,c), (b, a), (b, d), (c, c), (c, d)} 
R = {(,2)|@y) € R}. 
16. {(b,c), (c, b), (d, d)} 
The inverse of the relation {(1, 2), (1, 3)} is {(2, 1), G3, 1}. 17. (Exercise 1) {(Hammer, 8840), (Pliers, 9921), (Paint, 452), 
9, Let Rj be a relation from X to Y and R> be a relation from Y (Carpet, 2207)} 
to Z. The composition of R; and R2 is the relation from X to 18. {(1, 1), (1, 4), (2, 2), (2, 5), (3, 3), (4, 1), (4,4), 6, 2), 
Zz (5,5)} 


20. R=R! ={(1, 1), (1, 2), (1,3), 0.4), 5), 2, D, (2, 2), 
2,3), 2,9, GB, D, GB, 2), GB, 3), 4 1, 4,2), 6, D} 


The composition of the relations 23. Antisymmetric 


R2 oR, = {(x, z) | (x, y) € Ri and (y, z) € Ro for some y € Y}. 


R, = {(1, 2), C1, 3), (2, 2)} 24. Symmetric, antisymmetric, transitive 
27. Antisymmetric 
ani 30. Reflexive, symmetric, antisymmetric, transitive, partial order 


Ro = {@2, 1), (2, 3), d, 4} 33. Antisymmetric 
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35. 
38. 


40. 
43. 
45. 


48. 


51. 


54. 
57. 


59. 
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Reflexive, antisymmetric, transitive, partial order 
Reflexive: Suppose that (x;,.x2) is in X; x Xz. Since R; is 
reflexive, x; Rix, and x2R2x2. Thus (x1, x2)R(x1, x2). 

Antisymmetric: Suppose that (x), x2)R(x, x4) and 
(x), x4)R(@1, x2). Then x1 Rix, and x, R1x1. Since Rj is anti- 
symmetric, x} = x}. Similarly, x2 = x4. Therefore (x1, x2) = 
(x, x4) and R is antisymmetric. 

Transitivity is proved similarly. 
{(, D, (2, 2), 3, 3), 4,4), C, 2), 2, 3), 2, 1), 3, 2)} 
{, 1), CL, 2), 2, 1), (2, 2)} 
True. Let (x, y), (y, z) € R7!. Then (z, y), (y, x) € R. Since 
R is transitive, (z, x) € R. Thus (x, z) € R7!. Therefore, R7! 
is transitive. 
True. We must show that (x,x) € Ro S for all x € X. Let 
x € X. Since R and S are reflexive, (x, x) € R and (x, x) € S. 
Therefore, (x, x) € Ro Sand Ro S is reflexive. 
True. Let (x, y) € RNS. Then (x, y) € Rand (x, y) € S. Since 
Rand S are symmetric, (y, x) € R and (y, x) € S. Therefore, 
(y, x) € RNS and RN S is symmetric. 
False. Let R = {(1, 2)}, § = {(2, D}. 
True. Suppose that (x, y), (y, x) € R7!. Then (y, x), (x, y) € 
R. Since R is antisymmetric, y = x. Therefore R~! is anti- 
symmetric. 
Ris reflexive and symmetric. R is not antisymmetric, not tran- 
sitive, and not a partial order. 


Section 3.4 Review 


1. 


An equivalence relation is a relation that is reflexive, sym- 
metric, and transitive. The relation 


{C1 1), (2, 2), (3, 3), CL, 2), 2, D} 
is an equivalence relation on {1, 2, 3}. The relation 
{(, 1), (3, 3), CL, 2), (2, 1} 


is not an equivalence relation on {1, 2, 3}. 


. Let R be an equivalence relation on X. The equivalence 


classes of X given by R are sets of the form 
{x € X | xRa}, 


where a € X. 


. If R is an equivalence relation on X, the equivalence classes 


partition X. Conversely, if S is a partition of X and we define 
xRy to mean that for some S € S, both x and y belong to S, 
then R is an equivalence relation. 


Section 3.4 


1. 


4. 


7. 


Equivalence relation: [1] = [3] = {1, 3}, [2] = {2}, [4] = 
{4}, [5] = {5} 

Equivalence relation: [1] = [3] = [5] = {1,3,5}, [2] = 
{2}, [4] = {4} 

Not an equivalence 
reflexive) 


relation (neither transitive nor 


10. 


11. 
14. 


17. 


20. 


24. 
26. 


28. 
31. 


34. 


37. 


40. 
43. 


46. 


49. 


53. 


Equivalence relation: [1] = [3] = [5] = {1,3,5}, [2] = 

[4] = {2, 4} 

The relation is an equivalence relation. 

The relation is not an equivalence relation. It is neither re- 

flexive nor symmetric. 

{(, 1), C, 2), 2, 1), (2, 2), 3, 3), 3, 4), (4, 3), 4, 9}, 

(1] = [2] = {1, 2}, [3] = [4] = (3, 4} 

{(, 1), C, 2), C, 3), 2, 1), 2, 2), (2, 3), (3, 1), G, 2), 

(3, 3), (4, 4)}, 

(1) = [2] = 13] = (1, 2, 3}, [4] = {4} 

{1}, (1, 3}, {1, 4}, {1, 3, 4} 

[Part (b)] 

{San Francisco, San Diego, Los Angeles}, {Pittsburgh, 

Philadelphia}, {Chicago} 

R= {(x, x) | x € X} 

Suppose that x R y. Since R is reflexive, yR y. Taking z = yin 

the given condition, we have y R x. Therefore R is symmetric. 

Now suppose that x R y and y Rz. The given condition 
tells us that zRx. Since R is symmetric, x R z. Therefore R is 

transitive. Since R is reflexive, symmetric, and transitive, R 

is an equivalence relation. 

[Part (b)] 

Cd, 1), C1, 2), C1, 3), 4), C1, 5), CL, 6), C1, 7), 

(1, 8), (1, 9), CL, 10), (2, 1), (3, 1), 4, 1), 6, 1), 

(6, 1), (7, D, (8, D, (9, D, (10, 1) 

(a) We show symmetry only. Let (x,y) € Ry M Ro. Then 
(x, y) € R; and (x,y) € Ro. Since R; and R2 are sym- 
metric, (y, x) € Ry and (y, x) € Ro. Thus (y, x) € RENR2 
and, therefore, R; M R2 is symmetric. 

(b) A is an equivalence class of Rj M Ro if and only if there 
are equivalence classes A; of R; and A2 of R2 such that 
A=A,NAp2. 

[Part (b)] Torus 

If x € X, then x € f—!(f({x})). Thus U{S | S € S} = X. 

Suppose that 


aef—' (fy) Of7! (zh) 


for some y, z € Y. Then f(a) = y and f(a) = z. Thus y = z. 
Therefore, S is a partition of X. The equivalence relation that 
generates this partition is given in Exercise 41. 

Suppose, by way of contradiction, that a € [b]. Then (a, b) € 
R. Since R is symmetric, (b,a) € R. Since R is transitive, 
(b, b) € R, which is a contradiction. Therefore [b] = @. 
Since R is not transitive, there exist (a, b), (b,c) € R, but 
(a,c) € R. Then a € [b], b € [c], anda ¢ [c]. Since R is 
reflexive, b € [b]. Therefore [b] N [c] 4 2, but [b] ¥ [c]. 
Thus the collection of pseudo equivalence classes does not 
partition X. 


p(R1) ={(1, 1), (2, 2), (3, 3), (4,4), C, 2), 3.4), 4, 2)} 
o(R1) ={, 1), 2, 1), C, 2), G4), 4, 3), 4, 2), 2,9) 
(Ri) = {C, 1), C, 2), GB, 4), 4, 2), G, 2)} 
t(a(p(R1))) = {(%, y) | x,y € {1, 2, 3, 4}} 


56. 


59. 


60. 


63. 
66. 


67. 


Let (x, y), (x, z) € T(R). Then (x, y) € R” and (y, z) € R". 
Thus (x,z) € R*”. Therefore, (x, z) € T(R) and 1(R) is 
transitive. 
Suppose that R is transitive. If (x, y) € T(R) = U{R"}, then 
there exist x = x9,...,%, = y € X such that (xj;-1,x;) € R 
fori =1,...,n. Since R is transitive, it follows that (x, y) € 
R. Thus R > 1(R). Since we always have R C 1(R), it fol- 
lows that R = t(R). 

Suppose that t(R) = R. By Exercise 56, t(R) is tran- 
sitive. Therefore, R is transitive. 


True. Let D = {(x, x) | x € X}. Then, by definition, o(R) = 
RUD, where R is any relation on X. Now 


p(R U Ro) = (Rj UR2) UD = (Ri UD) U (RUD) 
= p(R1) U p(R2). 
False. Let R} = {(1, 2), (2, 3)}, Ro = {(1, 3), (3, 4)}. 
True. Using the notation and hint for Exercise 60, 
p(t(R1)) = (Ri) UD 
and 
t(p(Ri)) = t(Ri UD). 


So we must show that t(R;) UD = t(R; UD). 

We first note that if A C B, then t(A) C Tr(B). 
Now R; C R; UD. Therefore, t(R}) © t(R; U D). Also, 
DC R, UD. Therefore, D = t(D) € t(R; UD). It follows 
that t(R;) UD C t(R; UD). 

Since Rj C t(R,),R; UD C t(R1) UD. By the note in 
the preceding paragraph, we have t(Rj UD) € t(t(R,) UD). 
Since t(R;) U D is transitive, tT(t(R,) U D) = t(R}) UD 
(Exercise 59). Therefore, t(R; UD) C t(R1) UD. 


A set is equivalent to itself by the identity function. 

If X is equivalent to Y, there is a one-to-one, onto func- 
tion f from X to Y. Now f~! is a one-to-one, onto function 
from Y to X. 

If X is equivalent to Y, there is a one-to-one, onto func- 
tion f from X to Y. If Y is equivalent to Z, there is a one-to- 
one, onto function g from Y to Z. Now g of is a one-to-one, 
onto function from X to Z. 


Section 3.5 Review 


1. 


To obtain the matrix of a relation from X to Y, we label the 
rows with the elements of X and the columns with the ele- 
ments of Y. We then set the entry in row x and column y to 1 
if xRy and to 0 otherwise. 


. A relation is reflexive if and only if its matrix has 1’s on the 


main diagonal. 


. Arelation is symmetric if and only if its matrix A satisfies the 


following: For all i and j, the ijth entry of A is equal to the 
Jjith entry of A. 


4. See the paragraph following the proof of Theorem 3.5.6. 


5. The matrix of the relation R2 o R; is obtained by replacing 


each nonzero term in AjA2 by 1. 
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Hints and Solutions to Selected Exercises 


Section 3.5 


1. 


14. 


16. 


19. 


21. 


25. 


a p XY 6 
1/0 0 1 
211 0 1 °0 
3\0 1 1 0 

12 3 4°55 
1/0 1 0 0 0 
210 0 1 0 0 
3/0 00 1 0 
4;0 00 0 1 
5 \0 0 0 0 0 


» R= {(a,w), Gy), (cy), d,w), (d, x), (dy), (d, 2)} 
11. 


The test is, whenever the ijth entry is 1, i  j, then the jith 
entry is not 1. 


(For Exercise 8) 


| 


a= 
( 


NS = 
Ooroo°o.sa 
eee Dd 


(b) Az = 


12 1 
(c) AJA, =] 0 1 O 
0 1 0 


(d) We change each nonzero entry in part (c) to 1 to obtain 


1 11 
AjA2=[{0 1 O 
0 1 0 


(e) {U, 5), , a4), C, 0), (2,5), GB, b)} 
Each column that contains 1 in row x corresponds to an ele- 
ment of the equivalence class containing x. 


Suppose that the ijth entry of A is 1. Then the jth entry of 
either A; or Az is 1. Thus either (i, 7) € R, or (i,j) € Ro. 
Therefore, (i,j) € Ry, U Rx. Now suppose that (i,j) € 
R, U R>. Then the ith entry of either A; or Az is 1. There- 
fore, the ijth entry of A is 1. It follows that A is the matrix of 
Ri UR>. 

Each row must contain exactly one | for the relation to be a 
function. 


Section 3.6 Review 


1. 
2. 


An n-ary relation is a set of n-tuples. 


A database management system is a program that helps users 
access information in a database. 


. A relational database represents data as tables and provides 


ways to manipulate the tables. 
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. Asingle attribute or combination of attributes for a relation is 


a key if the values of the attributes uniquely define an n-tuple. 


5. A query is a request for information from a database. 


6. The selection operator chooses certain n-tuples from a rela- 


tion. The choices are made by giving conditions on the at- 
tributes (see Example 3.6.3). 


. The project operator chooses specified columns from a re- 


lation. In addition, duplicates are eliminated (see Example 
3.6.4). 


. The join operation on relations R; and R2 begins by examin- 


ing all pairs of tuples, one from R and one from Ro. If the 
join condition is satisfied, the tuples are combined to form 
a new tuple. The join condition specifies a relationship be- 
tween an attribute in R; and an attribute in Ro (see Example 


3.6.5). 


Section 3.6 


1. 


11. 


14. 


17. 


22. 


{(1089, Suzuki, Zamora), (5620, Kaminski, Jones), (9354, 
Jones, Yu), (9551, Ryan, Washington), (3600, Beaulieu, Yu), 
(0285, Schmidt, Jones), (6684, Manacotti, Jones)} 


. EMPLOYEE [Name] 


Suzuki, Kaminski, Jones, Ryan, Beaulieu, 
Schmidt, Manacotti 


. BUYER [Name] 


United Supplies, ABC Unlimited, JCN Electronics, 
Danny’s, Underhanded Sales, DePaul University 


TEMP := BUYER [Part No = 20A8] 

TEMP [Name] 

Underhanded Sales, Danny’s, ABC Unlimited 
TEMP! := BUYER [Name = Danny’s] 

TEMP2 := TEMP! [Part No = Part No] SUPPLIER 
TEMP? [Dept] 

04, 96 

TEMP! := BUYER [Name = JCN Electronics] 
TEMP2 := TEMP! [Part No = Part No] SUPPLIER 
TEMP3 := TEMP2 [Dept = Dept] DEPARTMENT 
TEMP4 := TEMP3 [Manager = Manager] EMPLOYEE 
TEMP4 [Name] 

Kaminski, Schmidt, Manacotti 


Let R; and R2 be two n-ary relations. Suppose that the set of 
elements in the ith column of R; and the set of elements in 
the ith column of Ry come from a common domain for i = 
1,...,. The union of R, and R2 is the n-ary relation Rj UR2. 


TEMPI := DEPARTMENT [Dept = 23] 

TEMP2 := DEPARTMENT [Dept = 96] 

TEMP3 := TEMP1 union TEMP2 

TEMP4 := TEMP3 [Manager = Manager] EMPLOYEE 
TEMP4 [Name] 


Kaminski, Schmidt, Manacotti, Suzuki 


Chapter 3 Self-Test 


1. 
2. 


ont nan & WwW 


10. 


11. 


12. 


13. 


14. 


15. 


16. 


[Section 3.1] f is not one-to-one. f is onto. 
[Section 3.2] 

(a) bs = 35, bio = 120 

(b) (n+1)?-1 

(c) Yes 

(d) No 


. [Section 3.3] Reflexive, symmetric, transitive 

. [Section 3.3] Symmetric 

. [Section 3.4] Yes. It is reflexive, symmetric, and transitive. 
. [Section 3.4] [3] = {3, 4}. There are two equivalence classes. 
. [Section 3.1] x = y = 2.3 

. [Section 3.2] 


(a) 14 
(b) 18 
(c) 192 
(d) an, = 4k 


. [Section 3.1] Define f from X = {1,2} to {3} by f(1) = 


f(2) = 3. Define g from {1} to X by g(1) = 1. 

[Section 3.2] 

(a) ccddccccdd 

(b) cccddccddc 

(c) 5 (d) 20 

[Section 3.1] (a : b means “store item a in cell b.”) 1:1, 
784: 4, 18:5, 329:6, 43:7, 281:8, 620:9, 1141: 10, 
31:11, 684: 12 

[Section 3.4] {(a, a), (b, b), (b, d), (b, e), (d, b), (d, d), (d, e), 
(e, b), (e, d), (e, e), (c, c)} 


[Section 3.3] All counterexample relations are on {1, 2, 3}. 
(a) False. R = {(1, 1)}. 
(b) True (c) True 
(d) False. R = {(1, 1}. 
n—2 
[Section 3.2] Ss (n-—k— 2) rk+2 


k=-1 
[Section 3.3] R = {(1, 1), (2, 2), @G, 3), (4,4), C, 2), (2, 1), 
(2, 3)} 
[Section 3.4] 


(a) R is reflexive because any eight-bit string has the same 
number of zeros as itself. 

Ris symmetric because, if s; and s2 have the same 
number of zeros, then s2 and s; have the same number 
of zeros. 

To see that R is transitive, suppose that s; and s2 
have the same number of zeros and that s2 and s3 have 
the same number of zeros. Then s; and s3 have the same 
number of zeros. Therefore, R is an equivalence relation. 


(b) There are nine equivalence classes. 


17. 


18. 


19. 


20. 


21. 


22. 


23. 


24. 


(c) 11111111, 01111111, 00111111, 00011111, 00001111, 
00000111, 00000011, 00000001, 00000000 


10 
[Section 3.5] | 1 1 
01 
; 110 
[Section 3.5 (; 0 ') 
110 
[Section 3.5] | 211 
101 
1.10 
[Section 3.5] | 1 11 
101 
[Section 3.6] 


ASSIGNMENT [Team] 
Blue Sox, Mutts, Jackalopes 


[Section 3.6] 

PLAYER [Name, Age] 

Johnsonbaugh, 22; Glover, 24; Battey, 18; Cage, 30; 
Homer, 37; Score, 22; Johnsonbaugh, 30; Singleton, 31 
[Section 3.6] 

TEMP! := PLAYER [Position = p] 

TEMP? := TEMP! [ID Number = PID] ASSIGNMENT 
TEMP2 [Team] 

Mutts, Jackalopes 


[Section 3.6] 

TEMP! := PLAYER [Age > 30] 

TEMP? := TEMP! [ID Number = PID] ASSIGNMENT 
TEMP2 [Team] 

Blue Sox, Mutts 


Section 4.1 Review 


1. 
2. 


An algorithm is a step-by-step method of solving a problem. 


Input—the algorithm receives input. Output—the algorithm 
produces output. Precision—the steps are precisely stated. 
Determinism—the intermediate results of each step of execu- 
tion are unique and are determined only by the inputs and the 
results of the preceding steps. Finiteness—the algorithm ter- 
minates; that is, it stops after finitely many instructions have 
been executed. Correctness—the output produced by the al- 
gorithm is correct; that is, the algorithm correctly solves the 
problem. Generality—the algorithm applies to a set of inputs. 


. A trace of an algorithm is a simulation of execution of the 


algorithm. 


. The advantages of pseudocode over ordinary text are that 


pseudocode has more precision, structure, and universality. It 
is often readily converted to computer code. 


. An algorithm is made up of one or more pseudocode 


functions. 
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Section 4.1 


3. 


9. 


12. 


15. 


The algorithm does not receive input (but, logically, it needs 
none). If some even number greater than 2 is not the sum of 
two prime numbers, the algorithm will stop and output “no”. 
If every even number greater than 2 is the sum of two prime 
numbers, lines 2 and 3 become an infinite loop—in this case, 
the algorithm will not terminate and, therefore, will not pro- 
duce output. The algorithm lacks precision; in order to execute 
line 2, we need to know how to check whether n is the sum of 
two primes. The algorithm does have the determinism prop- 
erty. The algorithm may lack the finiteness property. We have 
already noted that if every even number greater than 2 is the 
sum of two prime numbers (which is currently unsettled), the 
algorithm will not terminate. The algorithm is not general; that 
is, it does not apply to a set of inputs. Rather, it applies to one 
set of inputs—namely, the empty set. 


Input: s,n 


Output: small, the smallest value in the sequence s 


min(s,n) { 
small = s1 
fori=2ton 
if (s; < small) // smaller value found 
small = s; 
return small 


} 
Input: s,n 
Output: small (smallest), large (largest) 


small_large(s, n, small, large) { 
small = large = s; 
fori=2ton{ 
if (s; < small) 


small = s; 
if (s; > large) 
large = sj 
} 
} 
Input: s,n 
Output: sum 


seq_sum(s, n) { 
sum = 0 
fori=lton 
sum = sum + Ss; 
return sum 


} 
Input: s,n 
Output:  s (in reverse) 
reverse(s,n) { 
i=1 
jan 


while (i < j) { 
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18. 


21. 


24. 
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swap (Sj, Sj) 


i=it+l 
j=j-1 
} 
} 
Input: A (ann x n matrix of a relation R), n 
Output: true, if R is reflexive; false, if R is not reflexive 
is_reflexive(A, n) { 
fori=l1ton 
if (Aj == 0) 
return false 
return true 
} 
Input: A (ann x n matrix of a relation R), n 
Output: true, if R is antisymmetric; false, if R is not 
antisymmetric 
is_reflexive(A, n) { 
fori=l1ton—1 
forj=i+lton 
if (Ajj == 1 A Aji == 1) 
return false 
return true 
} 
Input: A (anmxk matrix of a relation R,), B (ak xn matrix 
of a relation R2), m, k,n 
Output: C (the m x n matrix of the relation R2 o R)) 


comp-relation(A, B, m, k,n, C) { 
// first compute the matrix product AB 
fori=ltom 
forj = | ton{ 
Cj =0 
fort=1tok 
C, ij = c ij + Ait By 
} 
// replace each nonzero entry in C by 1 
fori=1ltom 
forj = 1ton 
if (Cj > 0) 
Cy =1 


Section 4.2 Review 


1. 


Finding web pages containing key words is a searching prob- 
lem. The programs that perform the search are called search 
engines. Finding medical records in a hospital is another 
searching problem. Such a search may be carried out by peo- 
ple or by a computer. 


We are given text f and we want to find the first occurrence of 
pattern p in ¢t or determine that p does not occur in ¢. 


We use the notation in the solution to Exercise 2. Determine 
whether p is in ¢ starting at index 1 in ¢. If so, stop. Otherwise, 


12. 


13. 


determine whether p is in ¢ starting at index 2 in t. If so, stop. 
Continue in this way until finding p in ¢ or determining that 
p cannot be in ¢. In the latter case, the search can be termi- 
nated when the index in f is so large that there are not enough 
characters remaining in ¢ to accommodate p. 


Sorting a sequence s means to rearrange the data so that s is in 
order (nonincreasing order or nondecreasing order). 


The entries in a book’s index are sorted in increasing order, 
thus making it easy to quickly locate an entry in the index. 


To sort s|,..., S, using insertion sort, first insert s2 in s; so 
that s,, so is sorted. Next, insert s3 in 51, 52 so that 5), s2, 53 is 
sorted. Continue until inserting s, in 51,...,5,—1 so that the 
entire sequence s1,..., 5, is sorted. 


The time required by an algorithm is the number of steps to ter- 
mination. The space required by an algorithm is the amount of 
storage required by the input, local variables, and so on. 


Knowing or being able to estimate the time and space required 
by an algorithm gives an indication of how the algorithm will 
perform for input of various sizes when run on a computer. 
Knowing or being able to estimate the time and space required 
by two or more algorithms that solve the same problem makes 
it possible to compare the algorithms. 


Many practical problems are too difficult to be solved effi- 
ciently, and compromises either in generality or correctness 
are necessary. 


. When a randomized algorithm executes, at some points it 


makes random choices. 


The requirement that the intermediate results of each step of 
execution be uniquely defined and depend only on the inputs 
and results of the preceding steps is violated. 


To shuffle s1,...,8,, first swap s; and a randomly chosen 
element in s;,...,5,. Next, swap s2 and a randomly chosen 
element in 52, ..., S,. Continue until swapping s,_; and a ran- 


domly chosen element in s,—1, Sp. 


We might generate random arrangements of sequences to use 
as input to test or time a sorting program. 


Section 4.2 


1. 


First i and j are set to 1. The while loop then compares 
t) ---t4 = “bala” with p = “bala”. Since the comparison suc- 
ceeds, the algorithm returns i = | to indicate that p was found 
in ¢ starting at index 1 in t¢. 


First 20 is inserted in 


34 


Since 20 < 34, 34 must move one position to the right 


34 


Now 20 is inserted 


20 | 34 


11. 


13. 


Since 144 > 34, it is immediately inserted to 34’s right 


20 | 34 | 144 


Since 55 < 144, 144 must move one position to the right 


20 | 34 144 
Since 55 > 34, 55 is now inserted 
20 | 34 | 55 | 144 


The sequence is now sorted. 


. Since each element is greater than or equal to the element to 


its left, the element is always inserted in its original position. 


. We first swap a; and aj, where i = | andj = rand(1,5) = 5S. 


After the swap we have 


135| 57 | 72 |101] 34 
t t 
i j 


We next swap a; and aj, where i = 2 and j = rand(2,5) = 4. 
After the swap we have 


135101} 72 | 57 | 34 


ro 
l J 
We next swap a; and aj, where i = 3 and j = rand(3, 5) = 3. 
The sequence is unchanged. 
We next swap a; and aj, where i = 
rand(4, 5) = 5. After the swap we have 


4 andj = 


135101} 72 | 34 | 57 


td 
ij 


Yes. The following values for rand sort the input in increasing 
order: 


rand(1,5)=3, rand(2,5) =2, 


rand(4,5) = 5. 


rand(3,5) = 5, 


The while loop tests whether p occurs at index i in ¢. If p 
does occur at index i in ¢, fj4;-1 will be equal to p; for all 
j = 1,...,m. Thus j becomes m + 1 and the algorithm re- 
turns i. If p does not occur at index i in #, t;4;-1 will not be 
equal to p; for some j. In this case the while loop terminates 
(without executing return /). 

Now suppose that p occurs in ¢ and its first occurrence 
is at index i in t. As noted in the previous paragraph, the al- 
gorithm correctly returns i, the smallest index in t where p 
occurs. 


16. 


19. 


22. 
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If p does not occur in ¢, then the while loop terminates 
for every i and i increments in the for loop. Therefore, the for 
loop runs to completion, and the algorithm correctly returns 0 
to indicate that p was not found in ¢. 


Input: 
Output: 


s (the sequence 5;,..., 5,), m, and key 


i (the index of the last occurrence of key ins, or 0 if 
key is not in s) 
reverse_linear_search(s, n, key) { 
i=n 
while (i > 1) { 
if (s; == key) 
return i 
i=i-l 
} 
return 0 


} 


We measure the time of the algorithm by counting the number 
of comparisons (¢;4;-1 == pj) in the while loop. 

No comparisons will be made if n — m+ 1 < 0. In the 
remainder of this solution, we assume that n — m+ 1 > 0. 

If p is in ft, m comparisons must be performed to verify 
that p is, in fact, in t. We can guarantee that exactly m compar- 
isons are performed if p is at index | in f. 

If p is not in ft, at least one comparison must be per- 
formed for each i. We can guarantee that exactly one compar- 
ison is performed for each i if the first character in p does not 
occur in ¢. In this case, n — m + 1 comparisons are made. 

Ifm <n—m-+1, the best case is that p is at index 1 in 
t.Ifn—m-+1 <™, the best case is that the first character in 
p does not occur in ¢. If m = n — m+ 1, either situation is the 
best case. 


Input: s (the sequence s1,...,5 Sn) and n 


Output: — s (sorted in nondecreasing order) 


selection_sort(s, n) { 
fori=1ton—1{ 

// find smallest in s;,... 
small_index = i 
forj=i+lton 

if (8; Ssmall_index) 

small_index = j 

swap (Sj, Ssmall_index) 


Sn 


} 
} 


Section 4.3 Review 


1. 


2. 


Analysis of algorithms refers to the process of deriving esti- 
mates for the time and space needed to execute algorithms. 


The worst-case time for input of size n of an algorithm is the 
maximum time needed to execute the algorithm among all in- 
puts of size n. 


. The best-case time for input of size n of an algorithm is the 


minimum time needed to execute the algorithm among all in- 
puts of size n. 
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. The average-case time for input of size n of an algorithm is the 


average time needed to execute the algorithm over some finite 
set of inputs all of size n. 


. f(n) = O(g(n)) if there exists a positive constant C; such that 


\f(n)| < Ci|g(n)| for all but finitely many positive integers n. 
This notation is called the big oh notation. 


. Except for constants and a finite number of exceptions, f is 


bounded above by g. 


» f(r) = Q(g(n)) if there exists a positive constant C2 such that 


\f(n)| > C2|g(n)| for all but finitely many positive integers n. 
This notation is called the omega notation. 


. Except for constants and a finite number of exceptions, f is 


bounded below by g. 


. f(n) =O(g(n)) if f(n) = O(g(n)) and f(n) = Q(g(n)). This 


notation is called the theta notation. 


Except for constants and a finite number of exceptions, f is 
bounded above and below by g. 


Section 4.3 
1. O(n) 4. @(n?) 7. O(n’) 
10. O(n) 13. O(nlgn) 14. @(n’) 
17. O(n) 20. @(n?) 23. @(n?) 
26. O(nign) 29. @(1) 
32. When n = 1, we obtain 

1=A+B+C. 


34. 
37. 


40. 


When n = 2, we obtain 


3=4A+2B+C. 


When n = 3, we obtain 


6=9A+3B+C. 


Solving this system for A, B, C, we obtain 


1 
A=B=., c=, 
2 
We obtain the formula 
2 
1424 see ae _naty 
2 2 z ° 


which can be proved using mathematical induction (see Sec- 
tion 2.4). 

n! =n(n—1)---2-1<nen---n=n" 

Since n = 2'8", n't! = (qisnyrt+! — 2@+Dle” Thus, it suf- 
fices to show that (n + 1) Ign < n? for all n > 1. A proof by 
induction shows thatn < 2”~! for alln > 1. Thus, Ign <n-1 


for all n > 1. Therefore, 
(n+ lIign< (n+ lIa— D=anr-1<r for alln > 1. 


Since f(n) = O(g(n)), there exist constants C’ > 0 and N 
such that 


f(n) < C'g(n) foralln>N. 


43. 


46. 
49. 
52. 


54. 
57. 
60. 
62. 
65. 
68. 
69. 


72. 


Let 
C = max{C’, f(1)/g(1), f(2)/g(2), -.-.F(N)/g(N)}. 
Forn < N, 
f(n)/g(n) < max{f(1)/g(1), f(2)/82), .--.f(N)/8(N)} 
= C. 
Forn > N, 


f(n) < C’g(n) < Cg(n). 


Therefore, f(n) < Cg(n) for all n. 

False. If the statement were true, we would have n” < C2” for 
some constant C and for all sufficiently large n. The preceding 
inequality may be rewritten as 


n\n 
= aC 
(3) = 
for some constant C and for all sufficiently large n. Since 
(n/2)" becomes arbitrarily large as n becomes large, we can- 
not have n” < C2” for some constant C and for all sufficiently 


large n. 
True 


True 
False. If n! > C(n + 1)! for all n > K, then, canceling n!, 
1 > C(n+ 1) for alln > K. But 1 > C(n + 1) is false if 
n > max{1/C — 1, K}. 
True 
True 
True 
False. A counterexample is f(n) = n and g(n) = 2n. 
True 
False. A counterexample is f(n) = 1 and g(n) = I/n. 
f(n) # O(g(n)) means that for every positive constant C, 
\f(2)| > Cl|g(n)| for infinitely many positive integers n. 
We first find nondecreasing positive functions fo and go 
such that for infinitely many n, fo(n) =n? and go(n) =n. 
This implies that fo(n) A O(go(n)). Our functions also satisfy 
fo(n) =n and go(n) = n? for infinitely many n [obviously dif- 
ferent n than those for which fo(n) = rn? and go(n) =n]. This 
implies that go(n) 4 O(fo(n)). If we then set f(n) =fo(n) +n 
and g(n) = go(n) +n, we obtain increasing positive functions 
for which f(n) # O(g(n)) and g(n) 4 O(f(n)). 

We begin by setting fo(2) = 2 and go(2) = 27. Then 


fo(n) =n, go(n) = nr, ifn =2. 


Because go is nondecreasing, the least n for which we may 
have go(n) = nisn = 22. So we define fo(2?) = 24 and 
g0(27) = 2?. Then 


fo(n) = nr, go(n) =n, ifn = 22. 


The preceding discussion motivates defining 


if k is even 


9k 2 
Pate 
fe") 2 "Gf kis odd 


9k 2" if kis even 
a a= { 2 if kis odd. 
Suppose that n = 2 If kis odd, fo(n) = n2 and go(n) =n; if 
kis even, fo(n) = nand go(n) = nr. Now fo and go are defined 
only for n = 2", but they are nondecreasing on this domain. 
To extend their domains to the set of positive integers, we may 
simply define fo(1) = go(1) = 1 and make them constant on 
sets of the form {i | 2 <i< gy. 


76. No 
78. (a) The sum of the areas of the rectangles below the curve is 
equal to 
: + : tet : 
2 3 n 


80. 


83. 


This area is less than the area under the curve, which is 


equal to 
rH 1 
| —dx = log, n. 
1x 


The given inequality now follows immediately. 
(b 


= 


The sum of the areas of the rectangles whose bases are on 
the x-axis and whose tops are above the curve is equal to 


if ne 
2 n—1 


Since this area is greater than the area under the curve, the 
given inequality follows immediately. 


(c) Part (a) shows that 


ww 


nae se 
2 n 


Since log, n = O(1gn) (see Example 4.3.6), 


O(log, n). 


1 : tee ; = O(lgn). 
2, n 
Similarly, we can conclude from part (b) that 
1 ! ooo : = Q(lgn). 
2 n 
Therefore, 
1+ . : = O(lgn). 
2 n 


Replacing a by b in the sum yields 


qitl a 


=a <u 


oe 


(n+ Db". 


ey 
i=0 


By Exercise 81, the sequence {(1 + 1/n)"}°° | is increasing. 


Therefore 
1\! 1\” 
= (142) <(+2) 
1 n 


86. 


89. 


91. 


94, 
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for every positive integer n. Exercise 82 shows that 


1 n 
(145) <4 
n 


for every positive integer n. Taking logs to the base 2, we 
obtain 


1 n 
1=1g2<1e(14+ >) <lg4=2. 
n 


Since 
1" 1 1 
ie(14 ) =mg (14 ) =e (“4 ) 
n n n 
= vllg(n+ 1) — Ign], 
we have 


1 < n{lg(n+ 1) — Ign] < 2. 
Dividing by n gives the desired inequality. 
Replacing b by a in the sum yields 


ae qitl n 


=D > ae i 
= is = (n+ lja". 
i=0 


By Exercise 88, the sequence {(1 + Liner ye 
Since (1 + 1/n)"*! = 4, whenn = 1, 


1 n+l n+1 n+l 
4>{1+- = . 
ae) He 


Taking logs to the base 2, we obtain 
1 
Dlg (* = ) 
n 


n+1 
2=1g4>1g(“**) =(n 
n 
Ign + 1) 


=(n 
Dividing by n + | gives the desired inequality. 


~ , is decreasing. 


Ign]. 


True. Since limy-.o0 f(n)/g(n) = 0, taking ¢ = 1, there exists 
N such that 


fn) 
g(n) 
Therefore, for alln>N, |f(n)| < |g(n)| and f(n) = O(g(n)). 


True. Let d = |c|. Since limy-. oo [f(”) |/|g(2)| = d > 0, tak- 
ing e = d/2, there exists N such that 


<1, foralln>N. 


if()| 
lg(”)| 


¢ <d/2, foralln>N. 


This last inequality may be written 


d 
< Fm! , foralln>VN, 
2 |g(n)| 2 
or 
d d 
— < If! < saad for alln > N, 
2 |g@| 2 
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or the center as in Figure 2.4.6. It considers each of the squares 
covered by the center tromino as missing. It then tiles the four 
subboards, and, by the inductive assumption, these subboards 
are correctly tiled. Therefore, the n x n board is correctly tiled. 
The Inductive Step is complete. The algorithm is correct. 


d 3d 
3 !8@)| < |f(n)| < FZ ls, for alln > N. 


Therefore, f(n) = O(g(n)). 


99. Multiply both sides of the inequality in Exercise 98 by lg e and 7 


. The proof is by strong induction on n. The Basis Steps 
use the change-of-base formula for logarithms. 


(n = 1, 2) are readily verified. 

Assume that the algorithm is correct for all k < n. We 
must show that the algorithm is correct forn > 2. Sincen > 2, 
the algorithm executes the return statement 


Section 4.4 Review 
1. An algorithm that contains a recursive function 


2. A function that invokes itself return walk(n — 1) + walk(n — 2) 


factorial(n) { 
if (@ == 0) 
return | 
return n * factorial(n — 1) 


} 


. The original problem is divided into two or more subproblems. 
Solutions are then found for the subproblems (usually by fur- 
ther subdivision). These solutions are then combined in order 
to obtain a solution to the original problem. 


10. 


By the inductive assumption, the values of walk(n — 1) and 
walk(n — 2) are correctly computed by the algorithm. Since 


walk(n) = walk(n — 1) + walk(n — 2), 
the algorithm returns the correct value of walk(n). 
(a) Input: n 

Output: 2+4+4---+2n 


1. sum(s, n) { 
2, if (n == 1) 


. Ina base case, a solution is obtained directly, that is, without 3. return 2 
a recursive call. 4. return sum(n — 1) + 2n 
; : ‘ ' Si 
. If a recursive function had no base case, it would continue to , } : 
call itself and never terminate. (b) Basis Step (n = 1) If n is equal to 1, we correctly re- 
turn 2. 


7. fl =1.f2 = 1, fn =fn-1 + fn—2 forn > 3 


Inductive Step Assume that the algorithm correctly 


8B fi=Lh=LA=2,f4 =3 computes the sum when the input is n — 1. Now suppose 
that the input to this algorithm is n > 1. At line 2, since 
Section 4.4 n # 1, we proceed to line 4, where we invoke this algo- 


1. (a) Atline 2, since 4 4 0, we proceed to line 4. The algorithm 


is invoked with input 3. 


(b) At line 2, since 3 4 0, we proceed to line 4. The algorithm 
is invoked with input 2. 


(c) Atline 2, since 2 4 0, we proceed to line 4. The algorithm 


rithm with input n — 1. By the inductive assumption, the 
value returned, sum(n — 1), is equal to 


Q4---+2(n—1). 


At line 4, we then return 


is invoked with input 1. sum(n — 1) +2n=2+---+2(n—1)+2n, 
(d) At line 2, since 1 # 0, we proceed to line 4. The algorithm Shichasthe:contectvalue: 
is invoked with input 0. 
: : 13. Input: The sequence sj,..., 5 s, and the length n of the 
(e) At lines 2 and 3, since 0 = 0, we return 1. sequence 
Execution resumes in part (d) at line 4 after computing . . 
0! (= 1). We return 0!-1 = 1. Output: The maximum value in the sequence 


Execution resumes in part (c) at line 4 after computing 
1! (= 1). We return 1!-2 = 2. 


find_max(s, n) { 


Execution resumes in part (b) at line 4 after computing : a ; ; 
2! (= 2). We return 2!-3 = 6. ee ake n—1) 
Execution resumes in part (a) at line 4 after computing eas a) , 
3! (= 6). We return 3! -4 = 24. peal 
. We use induction on i, where n = 2'. The Basis Step is i = 1. else 
In this case, the board is a tromino T. The algorithm correctly return sp 


tiles the board with T and returns. Thus the algorithm is correct 
fori = 1. 

Now assume that ifn = 2', the algorithm is cor- 
rect. Let n=2't!. The algorithm divides the board into four 
(n/2) x (n/2) subboards. It then places one right tromino in 


We prove that the algorithm is correct using induction 
on n. The base case isn = 1. If n = 1, the only item in the 
sequence is s; and the algorithm correctly returns it. 


16. 


18. 


21. 


Assume that the algorithm computes the maximum for 
input of size n — 1, and suppose that the algorithm receives 
input of size n. By assumption, the recursive call 


x = find_max(s,n — 1) 


correctly computes x as the maximum value in the sequence 
S],--.,S,—1. Ifx is greater than s,,, the maximum value in the 
sequence s1,..., 5 Sy is x—the value returned by the algorithm. 
If x is not greater than s,,, the maximum value in the sequence 
S|, +--+, Sn iS Sy—again, the value returned by the algorithm. 
In either case, the algorithm correctly computes the maximum 
value in the sequence. The Inductive Step is complete, and we 
have proved that the algorithm is correct. 


To list all of the ways that a robot can walk n meters, set s to 
the null string and invoke this algorithm. 

Input: 
Output: 


n, s (a string) 


All the ways the robot can walk n meters. Each 
method of walking n meters includes the extra 
string s in the list. 


list_walk1(n, s) { 


if (n == 1) { 
printin(s + “take one step of length 1’’) 
return 

} 

if (n == 2) { 
println(s + “take two steps of length 1”) 
println(s + “take one step of length 2”) 


return 
} 
s’ = 5 + “take one step of length 2” // concatenation 
list_-walk1(n — 2, s’) 
s’ = 5 + “take one step of length 1” // concatenation 
list_walkl(n — 1, s’) 


After one month, there is still just one pair because a pair 
does not become productive until after one month. There- 
fore, a; = 1. After two months, the pair alive in the beginning 
becomes productive and adds one additional pair. Therefore, 
az = 2. The increase in pairs of rabbits a, — a,—; from month 
n — | to month n is due to each pair alive in month n — 2 
producing an additional pair. That is, a, — dy—| =dy—2. Since 
{dy} satisfies the same recurrence relation as {f,}, a, =o, and 
a2 =f3, dn =fn4i,n = I. 


Basis Step (n = 2) 


ff =1=1-2-1=fifh+ Cy? 


Inductive Step 


Safina + (-V)"? = fnlfng1 thn) + (-D"? 
= fini tf + (-1y"? 
= fifo thrust CD + Cnr? 
= frida tfa-1) =f 


24. 


27. 


30. 
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Basis Step(n=1) f?=17=1=1-l=fif 
n+l n 

Inductive Step Sof =) fe thn =ffnt thu 
k=1 k=1 


= fri Gn + fre) = fn+ifnt2 
We use strong induction. 
Basis Steps (n= 6,7) fs =8 > 7.59 = (3/2)°. ff = 13 > 
11.39 = (3/2). 
Inductive Step 


n-2 
Sn =fi-1 + fn—2 > (5) + 
3 n-1 3 -1 
-(3) [@) 
_ 3 n-1 16 3 n-1 
-(3) [5]>G) 


We use strong induction on n. 

l=fi 

Inductive Step Suppose that n > 2 and that every posi- 
tive integer less than n can be expressed as the sum of distinct 
Fibonacci numbers, no two of which are consecutive. Let fi, 
be the largest Fibonacci number satisfying n > fy,. Ifn = fx,, 
then n is trivially the sum of distinct Fibonacci numbers, no 
two of which are consecutive. Suppose that n > f;,. By the 
inductive assumption, n — f;, can be expressed as the sum of 
distinct Fibonacci numbers fz, > fk, > +--+ > fk,» no two of 
which are consecutive: 


m 
n — fe, = he 
i=2 


Now n is expressed as the sum of Fibonacci numbers: 


Basis Step (n = 1) 


m 


n= ye 
i=l 


We next show that f;, > f;,, so that, in particular, n is the sum 
of distinct Fibonacci numbers. 

Notice that f;, <n. Since fy, is the largest Fibonacci 
number satisfying n > fi fis <fix+ Hfig = fix 


(«) 


n = fk, + Sie > Shy + fi =I = fi +1: 


This last inequality contradicts the choice of f;, as the largest 
Fibonacci number satisfying n > f;,. Therefore f;, > fy, 

The only Fibonacci numbers in the sum (x) that might 
be consecutive are f;, and f;,. If they are consecutive, we may 
also write () as 


m 
n=) fi 
i=l 
m 


= fa + fin + > fs 


i=3 
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m 
= fi, +fn—1 + > Sig 
i=3 
m 
= fon +S fic- 
i=3 
Now fi, 41 < nand fk, 41 > fx, This contradicts the choice of 
Jk, as the largest Fibonacci number satisfying n > fi,. The 
inductive step is complete. 


Using the formula fi fx2 —fe. = (—1)*! from Exercise 21, 
we obtain 


pe" pga ei 
fi fet <= fic fictt 
S  fit2 a) 
= 1 
> (i Stk 
=1 


(2 £).+(4 &) 
fh fl \A A 
+4 (2 EH) 


fn+l In 
= 1s fn fr — Int 
Snot ft fat 


Basis Step (n = 4) 
4 
SCD ii = fi + 2 — 3fs + 4fa = -142-6412=7 


k=1 


and 


(-1)*4h +f) -2=8+1-257. 


Inductive Step 


2 


n+l n 

SoC fe = SOD if + (HD + D frst 

k=1 k=1 
= (-D" (frat + fr—3) — 2+ (HD fing 
= (-1)" Lt Dfnet — fn—1 — fn—3] — 2 
=D" [a+ Dh thre) — ft —fn—3] 
= (-D"" [a+ Df + fr-1 — fr-3] — 2 
= (-1)"""[(a + Df + fr-2] — 2. 


The last step is justified by the equality f,-1 = fr—2 + fn—3. 


Chapter 4 Self-Test 


1. 


[Section 4.1] At line 2, we set large to 12. At line 3, since 
b > large (3 > 12) is false, we move to line 5. At line 5, since 
c > large (0 > 12) is false, we move to line 7, where we return 
large (12), the maximum of the given values. 


. [Section 4.1] If the set S is an infinite set, the algorithm will not 


terminate, so it lacks the finiteness and output properties. Line 
1 is not precisely stated since how to list the subsets of S and 
their sums is not specified; thus the algorithm lacks the preci- 
sion property. The order of the subsets listed in line 1 depends 
on the method used to generate them, so the algorithm lacks 
the determinism property. Since line 2 depends on the order 


of the subsets generated in line 1, the determinism property is 
lacking here as well. 


. [Section 4.2] The while loop first tests whether “110” occurs 


in ¢t at index 1. Since “110°” does not occur in ¢ at index 1, 
the algorithm next tests whether “110” occurs in ¢ at index 2. 
Since “110” does occur in f at index 2, the algorithm returns 
the value 2. 


. [Section 4.2] First 64 is inserted in 


44 


Since 64 > 44, it is immediately inserted to 44’s right 


44 | 64 


Next 77 is inserted. Since 77 > 64, it is immediately inserted 
to 64’s right 


44 | 64 | 77 


Next 15 is inserted. Since 15 < 77,77 must move one position 
to the right 


44 | 64 77 


Since 15 < 64, 64 must move one position to the right 


44 64 | 77 


Since 15 < 44, 44 must move one position to the right 


44 | 64 | 77 


Now 15 is inserted 


15 | 44 | 64 | 77 


Finally 3 is inserted. Since 3 < 77, 77 must move one position 
to the right 


15 | 44 | 64 77 


Since 3 < 64, 64 must move one position to the right 


15 | 44 64 | 77 


Since 3 < 44, 44 must move one position to the right 


15 44 | 64 | 77 


Since 3 < 15, 15 must move one position to the right 


15 | 44 | 64 | 77 


Now 3 is inserted 


3 | 15 | 44 | 64 | 77 


The sequence is sorted. 


5. [Section 4.2] We first swap a; and aj, where i = 1 and 
j = rand(1, 5) = 1. The sequence is unchanged. 
We next swap a; and aj, where i = 2 andj = 


rand(2, 5) = 3. After the swap we have 


5 | 2 | 51 | 44 | 96 


t 

ij 
We next swap a; and aj, where i = 3 and j = rand(3,5) = 5. 
After the swap we have 


5 | 2 | 96 | 44 | 51 


t t 
i j 


We next swap a; and aj, where i = 4 andj = rand(4,5) = 5. 
After the swap we have 


6. [Section 4.1] 


sort(a, b,c, x,y, Z) { 

x=a 

y=b 

Z=Cc 

if (vy < x) 
swap(x, y) 

if (¢ < x) 
swap(x, Z) 

if (z < y) 
swap(y, 2) 


7. [Section 4.2] 


repeaters(s,n) { 
pa. 
while (i < n) { 
if (5; == sis) 
println(s;) 
// skip to next element not equal to 5; 
ye 
while (i < nA 5; == sj) 
i=i+l1 


8. 


9. 


10. 
11. 
12. 
13. 


14. 
15. 


16. 
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[Section 4.1] 


test_distinct(a, b, c) { 
if (a bva 
return false 
return true 


cVb c) 


} 
[Section 4.3] 
Input: A and B (n x n matrices) and n 
Output: true (if A = B); false (if A # B) 


equal_matrices(A, B, n) { 
fori=lton 
forj = 1 ton 
if (Ajj = By) 
return false 
return true 


} 


The worst-case time is @(n”). 

[Section 4.3] @(n?) 

[Section 4.3] @(n*) 

[Section 4.3] @(n?) 

[Section 4.4] Since n 4 2, we proceed immediately to line 6, 
where we divide the board into four 4 x 4 boards. At line 7, we 
rotate the board so that the missing square is in the upper-left 
quadrant. At line 8, we place one tromino in the center. We 
then proceed to lines 9-12, where we call the algorithm to tile 
the subboards. We obtain the tiling: 


[Section 4.4] t4 = 3, ts = 5 


[Section 4.4] 
Input: n, an integer greater than or equal to | 
Output: ft, 
tribonacci(n) { 
1. if (n lvn 2vn 3) 
return | 
A. return tribonacci(n — 1) + tribonacci(n — 2) 


+ tribonacci(n — 3) 


} 


[Section 4.4] Basis Steps (n = 1, 2, 3) Ifn = 1, 2,3, at 
lines | and 2 we return the correct value, 1. Therefore, the al- 
gorithm is correct in these cases. 
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Inductive Step Assume that n > 3 and that the algorithm 
correctly computes t,, if k < n. Since n > 3, we proceed to 
line 3. We then call this algorithm to compute ¢,—1, tn—-2, and 
t,-3. By the inductive assumption, the values computed are 
correct. The algorithm then computes t,—1 + tp—2 + t,—3. But 
the formula shows that this value is equal to ¢,. Therefore, the 
algorithm returns the correct value for f,,. 


Section 5.1 Review 


1. 


We say that d divides n if there exists an integer q satisfying 
n= dq. 


. If d divides n, we say that d is a divisor of n. 


3. If d divides n, n = dq, we call g the quotient. 


4. Aninteger greater than | whose only positive divisors are itself 


and | is called prime. 


5. An integer greater than | that is not prime is called composite. 


6. If nis composite, it must have a divisor d satisfying 2 < d < 


15. 


L./n| (see Theorem 5.1.7). 


. Algorithm 5.1.8 does not run in time polynomial in the size of 


the input. 


. Any integer greater than 1 can be written as a product of 


primes. Moreover, if the primes are written in nondecreasing 
order, the factorization is unique. 


. See the proof of Theorem 5.1.12. 


. A common divisor of m and n, not both zero, is an integer that 


divides both m and n. 


. The greatest common divisor of m and n, not both zero, is the 


largest common divisor of m and n. 


. See Theorem 5.1.17. 


. A common multiple of m and n is an integer that is divisible 


by both m and n. 


. The least common multiple of m and n is the smallest positive 


common multiple of m and n. 


See Theorem 5.1.22. 16. gcd(m, n) -lem(m, n) =mn 


Section 5.1 


1. 


First d is set to 2. Since n mod d = 9 mod 2 = 1 is not equal 
to 0, d is incremented and becomes 3. 

Now n mod d = 9 mod 3 equals 0, so the algorithm re- 
turns d = 3 to indicate that n = 9 is composite and 3 is a 
divisor of 9. 


. When d is set to 2,..., 6, n mod d is not equal to zero. How- 


ever, when d becomes 7, n mod d = 637 mod 7 equals 0, so 
the algorithm returns d = 7 to indicate that n = 637 is com- 
posite and 7 is a divisor of 637. 


. First d is set to 2. Since n mod d = 3738 mod 2 equals 0, the 


algorithm returns d = 2 to indicate that n = 3738 is composite 
and 2 is a divisor of 3738. 


9. 47 12. 17 15. 1 
18. 20 21. 13 24, 32-73-11 
25. (For Exercise 14) 25 


28. 


31. 


36. 


39. 


40. 


43. 
46. 


49. 


Since d divides m, there exists g such that m = dq. Multi- 
plying by n gives mn = d(qn). Therefore, d divides mn (with 
quotient gn). 


Since a divides b, there exists g, such that b = aq. Since b 
divides c, there exists gz such that c = bq2. Now 


c¢ = bqz = (aqi)q2 = a(q192). 
Therefore, a divides c (with quotient g1q2). 


Basis Step(n=1) [J)_) Fi = Fo =3=5-2=Fi -2. 
Inductive Step Assume true for n. Now 
n n—1 
[[*= (i n) = F,(Fn — 2) 
i=0 i=0 
= (2° 4+ DO” —)p= 2?" —-1= Fy, —2. 


We use the identity 


c-1=(C-1(Ch! +087 ++ + CHD. 
Now suppose that m = ab,a > 1 and b > 1. Taking C = 2° 
and n = b, we find that 


(2%)? — te (24 — 12%)?! (2%)? pete tOe ot 1). 


Thus 2% — 1 divides (22) —1 = 2” — 1. Since a > 1, 
2 — 1 > 1. Therefore 2” — | is composite. 

Since 2 is not the product of two or more elements in E, it is 
E-prime. 

Since 12 = 2-6, it is E-composite. 

Notice that 2 +m, where m is odd, is E-prime since it is not the 
product of two or more elements in E. It follows that the set of 
E-primes is infinite. 

The E-prime 18 divides 6-6 (36 = 2-18), but 18 does not 
divide 6. 


Section 5.2 Review 


1 


n 

F So glo! 
i=0 
n 

. S hile! 
i=0 


. Perform the computation )77_9 5;2! in decimal. 


n 
2. S752! 
i=0 


4. [1+1gn] 


. Divide the number to be converted to binary by 2. The remain- 
der gives the 1’s bit. Divide the quotient by 2. The remainder 
gives the 2’s bit. Continue. 


7. Perform the computation )7'__9 h;16! in decimal. 


8. Divide the number to be converted to hexadecimal by 16. The 


10. 


remainder gives the number of 1’s. Divide the quotient by 16. 
The remainder gives the number of 16’s. Continue. 


. Use the ordinary algorithm for adding decimal numbers to add 
binary numbers—except replace the decimal addition table by 
the binary addition table. 


Use the ordinary algorithm for adding decimal numbers to add 
hexadecimal numbers—except replace the decimal addition 
table by the hexadecimal addition table. 


11. 


12. 


Let 


m 


n= Ss bj2! 
i=0 


be the binary expansion of n. Using repeated squaring, com- 
pute a!,a’,a*,a’,...,a’". Then 


m 

m pai i 

a qui-obi2 = A Eas . 
i=0 


Proceed as described in the solution to Exercise 11, only use 
the formula 


ab mod z = [(a mod z)(b mod z)] mod z. 


Section 5.2 


1. 
10. 
17. 
26. 
35. 
38. 
43. 


45. 
51. 
57. 


59. 


62. 


6 4.7 7. 1585 

163 11. 9 14. 32 

100010 20. 110010000 23. 11000 

1001000 29. 58 32. 2563 

(For Exercise 11) 9 

FE 41. 3DBF9 

2010 cannot represent a number in binary because 2 is an ille- 


gal symbol in binary. 2010 could represent a number in either 
decimal or hexadecimal. 


51 48. 4570 
(For Exercise 11) 11 54, (For Exercise 45) 33 


9450 cannot represent a number in binary because 9, 4, and 5 
are illegal symbols in binary. 9450 cannot represent a number 
in octal because 9 is an illegal symbol in octal. 9450 represents 
a number in either decimal or hexadecimal. 


The algorithm begins by setting result to 1 and x to a. Since 
n = 16 > 0, the body of the while loop executes. Since 
n mod 2 is not equal to 1, result is not modified. x becomes 
a’, and n becomes 8. 

Since n = 8 > 0, the body of the while loop executes. 
Since n mod 2 is not equal to 1, result is not modified. x be- 
comes a’, and n becomes 4. 

Since n = 4 > 0, the body of the while loop executes. 
Since n mod 2 is not equal to 1, result is not modified. x be- 
comes a®, and n becomes 2. 

Since n = 2 > 0, the body of the while loop executes. 
Since n mod 2 is not equal to 1, result is not modified. x be- 
comes a!®, and n becomes 1. 

Since n = 1 > O, the body of the while loop exe- 
cutes. Since n mod 2 is equal to 1, result becomes result * x = 
1 x a!® = a!®. x becomes a*2, and n becomes 0. 

Since n = 0 is not greater than 0, the while loop termi- 
nates. The algorithm returns result, which is equal to a!®. 


The algorithm begins by setting result to | and x toa mod z = 
5 mod 21 = 5. Since n = 10 > 0, the body of the while loop 
executes. Since n mod 2 is not equal to 1, result is not mod- 
ified. x is set to x * x mod z = 25 mod 21 = 4, and n is set 
to 5. 


65. 
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Since n = 5 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result is set to (result*x) mod z = 
4 mod 21 = 4. x is set tox * x mod z = 16 mod 21 = 16, and 
nis set to 2. 

Since n = 2 > 0, the body of the while loop executes. 
Since n mod 2 is not equal to 1, result is not modified. x is set 
to x * x mod z = 256 mod 21 = 4, and nis set to 1. 

Since n = | > 0, the body of the while loop executes. 
Since n mod 1 is equal to 1, result is set to (result*x) mod z = 
16 mod 21 = 16. x is set to x * x mod z = 16 mod 21 = 16, 
and nis set to 0. 

Since n = 0 is not greater than 0, the while loop 
terminates. The algorithm returns result, which is equal to 
a" mod z = 5!° mod 21 = 16. 


If m, is the highest power of 2 that divides m, then m = 2p, 
where p is odd. Similarly, if ng is the highest power of 2 that 
divides n, then n = 2g, where q is odd. Now mn = 2+" pq. 
Since pq is odd, mg +n, is the highest power of 2 that divides 
mn, and the result follows. 


Section 5.3 Review 
1. See Algorithm 5.3.3. 


. If a is a nonnegative integer, b is a positive integer, and r = 


amod J, then gced(a, b) = gced(b, r). 


» a> fn42 and b > fn41 
+ logs; 2m/3 


. Write the nonzero remainders as found by the Euclidean algo- 


rithm in the form 


r=n—dq 


in the order in which the Euclidean algorithm computes them. 
Substitute the formula for the next-to-last remainder into the 
last equation. Call the resulting equation E;. Substitute the 
second-to-last formula for the remainder into FE. Call the re- 
sulting equation £2. Substitute the third-to-last formula for the 
remainder into E>. Continue until the first formula for the re- 
mainder is substituted into the last E; equation. 


6. sis the inverse of n mod z if ns mod z = 1. 


7. Find numbers s’ and ¢’ such that s’n + ¢@ = 1. Sets = 


s’ mod ¢. 


Section 5.3 


1. 
4. 


7. 


10. 


90 mod 60 = 30; 60 mod 30 = 0; so gcd(60, 90) = 30. 


825 mod 315 = 195; 315mod195 = 120; 195 mod 
120 = 75; 120 mod 75=45; 75 mod 45=30; 45 mod 
30 = 15; 30 mod 15 =0; so ged(825, 315) = 15. 


4807 mod 2091=625; 2091 mod 625=216; 625 mod 
216= 193; 216 mod 193=23; 193 mod 23=9; 23 mod 
9=5; 9mod5=4; Smod4=1; 4mod1=0; _ so 
gcd(2091, 4807) = 1. 

490256 mod 337 = 258; 337 mod 258=79; 258 mod 79 = 


21; 79 mod 21=16; 21 mod 16=5; 16 mod5=1; 5 mod 
1=0; so ged(490256, 337) = 1. 
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(For Exercise 10) The nonzero remainders in the order they 
are computed by the Euclidean algorithm are 


490256 mod 337 = 258 
337 mod 258 = 79 
258 mod 79 = 21 


79 mod 21 = 16 
21 mod 16 = 5 
16 mod 5 = 1. 


Writing these equations in the form r = n — dq, where r is the 
remainder and gq is the quotient, yields 


258 = 490256 — 337-1454 
79 = 337 — 258-1 
21 = 258 — 79-3 


16 = 79 — 21-3 
5 = 21- 16-1 
1 = 16—5-3. 


Substituting the next-to-last formula for 5 into the last equa- 
tion yields 


1 = 16— (21 — 16-1)-3 = 16-4—21-3. 


Substituting the second-to-last formula for 16 into the previ- 
ous equation yields 


1 = (79 — 21-3)4— 21-3 = 79-4 —21-15. 


Substituting the third formula for 21 into the previous equation 
yields 


1=79-4 


(258 — 79 +3)15 = 79-49 — 258-15. 


Substituting the second formula for 79 into the previous equa- 
tion yields 


1 = (337 


258)49 — 258-15 = 337-49 — 258 - 64. 


Finally, substituting the first formula for 258 into the previous 
equation yields 
1 = 337-49 — (490256 — 337 - 1454)64 
= 337-93105 — 490256 - 64. 


Thus, if we set s = —64 and t = 93105, 
5 +490256 + +337 = gcd(490256, 337) = 1. 


gcd_recurs(a, b) { 
make a largest 
if (a < b) 
swap(a, b) 
return gcd_recurs1(a, b) 


} 


gcd_recurs1(a, b) { 
if (b == 0) 
return a 
r=amodb 
return gcd_recurs1(b, r) 


} 


19. 


22. 


25. 


29. 


30. 


33. 


35. 


gcd_subtract(a, b) { 
while (true) { 
// make a largest 
if (a < b) 
swap(a, b) 
if (b==0) 
return a 
a=a-b 
} 
} 


By Theorem 5.3.5, a pair a, b, a > b, would require n modu- 
lus operations when input to the Euclidean Algorithm only if 
a > fn42 and b > fn4i. Now fog = 514229, fo = 832040, 
and f3; = 1346269. Thus, no pair can require more than 
28 modulus operations in the worst case because 29 modulus 
operations would require one member of the pair to exceed 
1000000. The pair 514229, 832040 itself requires 28 modulus 
operations. 


We prove the statement by induction on n. 
Basis Step (n= 1) gcd(fi, 2) = ged(1, 1) = 1 
Inductive Step Assume that gced(f,, fr41) = 1. Now 


gcd(fn+t,fn42) = gced(fnti. fn rit fn) = ged(fn risfn) = 1. 


We use Exercise 18 with a = fr41 +f and b = f;,41 to justify 
the second equality. 


If m = 1, the result is immediate, so we assume that m > 1. 
Suppose that f is one-to-one and onto. Since m > 1, 
there exists x such that f(x) =nx mod m= 1. Thus there ex- 
ists q such that 
nx =mq+l. 
Let g be the greatest common divisor of m and n. Then g di- 
vides both m and n and also nx — mg = 1. Therefore, g = 1. 
Now suppose that gcd(m,n)=1. By Theorem 5.3.7, 
there exist s and ¢ such that 
1=sm-+ tn. 
Let k € X. Then 
k = msk + ntk. 


Therefore, 


(ntk) mod m = (k — msk) mod m = k mod m = k. 


We may argue as in the Computing an Inverse Modulo an In- 
teger subsection that if we set x = tk mod m, then f(x) = 
(ntk) mod m. Therefore, f is onto. Since f is a function from 
X to X, f is also one-to-one. 


Ifa £40,a=1-a+0-b > 0. In this case, a € X. Similarly, 
ifbA0,bEX. 

Suppose that g does not divide a. Thena = qg+r, 0<r<g. 
Since g € X, there exist s and ¢ such that g = sa + tb. Now 
qg =a tb) = (1 — qs)a + (—qt)b. 
Therefore r € X. Since g is the least element in X and 0 < r < 


g, we have a contradiction. Therefore, g divides a. Similarly, 
g divides b. 


gcd(3, 2) = ged(2, 1) = ged, 0) = 1,s = 2 


r=a q(sa 4 


38. gcd(47, 11) = ged(11, 3) = ged(3, 2) = ged(2, 1) = ged(1, 0) 
=1,9=30 


41. gcd(243, 100) = ged(100, 43) = ged(43, 14) = ged(14, 1) = 
gcd(1, 0) = 1, s=226 


42. We argue by contradiction. Suppose that 6 has an inverse mod- 
ulo 15; that is, suppose that there exists s such that 6s mod 

15 = 1. Then there exists g such that 

15 — 6sq = 1. 


Since 3 divides 15 and 3 divides 6sq, 3 divides 1. We have 
obtained the desired contradiction. Thus, 6 does not have an 
inverse modulo 15. 

That 6 does not have an inverse modulo 15 does not con- 
tradict the result preceding Example 5.3.9. In order to guar- 
antee that n has an inverse modulo @, the result preceding 
Example 5.3.9 requires that gcd(n, @) = 1. In this exercise, 
gcd(6, 15) = 3. 


Section 5.4 Review 
1. Cryptology is the study of systems for secure communications. 
2. A cryptosystem is a system for secure communications. 


3. To encrypt a message is to transform the message so that only 
an authorized recipient can reconstruct it. 


4. To decrypt a message is to transform an encrypted message so 
that it can be read. 


5. Compute c = a” mod z and send c. 


6. Compute c* mod z. z is chosen as the product of primes p and 
q. 8 satisfies ns mod (p — 1)(q— 1) = 1. 


7. The security of the RSA encryption system relies mainly on 
the fact that at present there is no efficient algorithm known 
for factoring integers. 


Section 5.4 
1. FKKGEJAIMWQ 
4. BUSHWHACKED 
7. 53 
9. z= pq = 17-23 = 391 
12. c =a" mod z= 1017! mod 391 = 186 
14. z= pq = 59-101 = 5959 
17. c =a" mod z = 5844! mod 5959 = 3237 


Chapter 5 Self-Test 


1. [Section 5.1] Ford = 2,..., 6,539 mod d is not equal to zero, 
so d increments. When d = 7, 539 mod d equals zero, so the 
algorithm returns d = 7 to indicate that 539 is composite and 
7 is a divisor of 539. 

2. [Section 5.2] 150 

3. [Section 5.1] 539 = 77-11 

4. [Section 5.3] gcd(480, 396) = gcd(396, 84) = gcd(84, 60) = 
gcd(60, 24) = gcd(24, 12) = gced(12, 0) = 12 


rrntnnm 


10. 
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. [Section 5.1] 77-13? 

. [Section 5.1] 2-57-74 -134-17 
. [Section 5.2] 110101110, LAE 
. [Section 5.3] Since 


2(100,000,000) 


2 
5 = log3/ 1004 + log3;2 5 


1 
0983/2 3 


= (41o0g3/7 100) — 1 
= 4(11.357747) — 1 = 44.430988, 
an upper bound for the number of modulus operations re- 


quired by the Euclidean algorithm for integers in the range 0 
to 100,000,000 is 44. 


. [Section 5.2] The algorithm begins by setting result to | and 


x to a. Since n = 30 > 0, the body of the while loop exe- 
cutes. Since n mod 2 is not equal to 1, result is not modified. 
x becomes a”, and n becomes 15. 

Since n= 15 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result becomes result*xx = 
1 «a? = a2. x becomes a’*, and n becomes 7. 

Since n=7 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result becomes result *x = ax 
a’ = a°. x becomes a®, and n becomes 3. 

Since n =3 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result becomes result * x = a® x 
a = a'*. x becomes a6, and n becomes 1. 

Since n= 1> 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result becomes result * x = 
a'4 x q!® = a9. x becomes a*”, and n becomes 0. 

Since n = 0 is not greater than 0, the while loop 
terminates. The algorithm returns result, which is equal 
to a”, 


[Section 5.2] The algorithm begins by setting result to | and x 
to a mod z = 50 mod 11 = 6. Since n = 30 > 0, the body of 
the while loop executes. Since n mod 2 is not equal to 1, result 
is not modified. x is set to x * x mod z = 36 mod 11 = 3, and 
nis set to 15. 

Since n = 15 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result is set to (result*xx) mod z = 
3 mod 11 = 3. xis set tox * x mod z= 9 mod 11 = 9, andn 
is set to 7. 

Since n = 7 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result is set to (result*x) mod z = 
27 mod 11 = 5. xis set tox *x mod z = 81 mod 11 = 4, and 
nis set to 3. 

Since n = 3 > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result is set to (result*x) mod z = 
20 mod 11 = 9. x is set tox *x mod z = 16 mod 11 =5, and 
nis set to 1. 

Since n = | > 0, the body of the while loop executes. 
Since n mod 2 is equal to 1, result is set to (result*x) mod z = 
45 mod 11 = 1. xis set tox * x mod z = 25 mod 11 = 3, and 
nis set to 0. 

Since n = 0 is not greater than 0, the while loop 
terminates. The algorithm returns result, which is equal to 
a” mod z = 50°" mod 11 = 1. 
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[Section 5.3] The nonzero remainders in the order they are 
computed by the Euclidean algorithm are 
480 mod 396 = 84 
396 mod 84 = 60 
84 mod 60 = 24 
60 mod 24 = 12. 


Writing these equations in the form r = n — dq, where r is the 
remainder and gq is the quotient, yields 


84 = 480 — 396-1 
60 = 396 — 84-4 
24 = 84— 60:1 
12 = 60 — 24-2. 


Substituting the next-to-last formula for 24 into the last equa- 
tion yields 


12 = 60 — 24-2 = 60 


(84 — 60) -2 = 3-60 — 2-84. 


Substituting the second formula for 60 into the previous equa- 
tion yields 


12 = 3-(396 — 84-4) — 2-84 = 3-396 — 14-84. 


Finally, substituting the first formula for 84 into the previous 
equation yields 


12 = 3-396 


14- (480 — 396) = 17-396 — 14-480. 


Thus, if we set s = 17 andt = —14, 
s +396 + t-480 = gcd(396, 480) = 12. 


[Section 5.3] The nonzero remainders in the order they are 
computed by the Euclidean algorithm are 


425 mod 196 = 33 
196 mod 33 = 31 
33 mod 31 = 2 
31 mod 2 = 1. 
Writing these equations in the form r = n — dq, where r is the 
remainder and gq is the quotient, yields 
33 = 425 — 196-2 
31 = 196 — 33-5 
2 = 33— 31-1 
1 = 31—2-15. 


Substituting the next-to-last formula for 2 into the last equa- 
tion yields 


=i 


(33 — 31) -15 = 16-31 — 15-33. 


Substituting the second formula for 31 into the previous equa- 
tion yields 


1 = 16-(196 — 33-5) — 15-33 = 16-196 — 95-33. 


13. 


14. 
15. 
16. 


Finally, substituting the first formula for 33 into the previous 
equation yields 


1 = 16-196 — 95 - (425 — 196-2) = 206-196 — 95-425. 
Thus, if we set s’ = 206 and r’ = —95, 
s' 196 + ¢ -425 = gcd(196, 425) = 1. 


Thus s = s’ mod 425 = 206 mod 425 = 206. 


[Section 5.4] z = pq = 13-17 = 221, 6=(p—Vi(q-NY= 
12-16 = 192 
[Section 5.4] s = 91 


[Section 5.4] c = a” mod z = 144!° mod 221 = 53 
[Section 5.4] a = c’ mod z = 28”! mod 221 = 63 


Section 6.1 Review 


1. If an activity can be constructed in ¢ successive steps and 
step | can be done in n; ways, step 2 can be done in nz ways, 
.., and step ¢ can be done in n; ways, then the number of 
different possible activities is nj -n2---n;. As an example, if 
there are two choices for an appetizer and four choices for a 
main dish, the total number of dinners is 2-4 = 8. 


2. Suppose that X;,..., X; are sets and that the ith set X; has 
n; elements. If {X,,..., X;} is a pairwise disjoint family, the 
number of possible elements that can be selected from X or 
X2 or... or X; is my +n2 +---+7;. As an example, suppose 
that within a set of strings, two start with a and four start with 
b. Then 2 + 4 = 6 start with either a or b. 


3. |X UY| = |X| + |Y¥| — |X 7 Y|. An example of the use of 
the Inclusion-Exclusion Principle for two sets is provided by 
Example 6.1.14. 


Section 6.1 


1. 2-4 4. 8-4-5 7. 6 
10. 5-4-3 13. 6+124+9 16. m+n 
19. 141 


22. There are 5-4-3 strings that begin with AC and 5-4-3 


strings that begin with DB. Thus the answer is 5-4-3 + 
5-4-3. 


25. Since there are three kinds of cabs, two kinds of cargo beds, 


and five kinds of engines, the correct number of ways to per- 
sonalize the big pickups is 3 - 2 - 5 = 30, not 32. 


26. 3: (1,3), (2, 2), (3, 1), where (b, r) means the blue die shows 


b and the red die shows r. 


29. 6: (2,1), (2,2), (2,3), (2,4), (2,5), (2,6), where (b, r) 


means the blue die shows b and the red die shows r. 


32. Since each die can show any one of five values, by the Mul- 


34. 10-5 


tiplication Principle there are 5 -5 outcomes in which neither 
die shows 2. 


37. 24 40. 8 


43. 2°. (Once the first four bits are assigned values, the last four 


bits are determined.) 


44. 
53: 
61. 
64. 


67. 
70. 


77. 
83. 


86. 


89. 


92. 


95. 


5-43 47. 3-443 50. 5° 
4.3 56. 5 — 4 58. 200-S+1 
40 


One one-digit number contains 7. The distinct two-digit 
numbers that contain 7 are 17,27,...,97 and 70,71,..., 
76, 78,79. There are 18 of these. The distinct three-digit 
numbers that contain 7 are 107 and Ixy, where xy is one of 
the two-digit numbers listed above. The answer is 1+ 18+19. 


5+(84+7+--+)D+(7+6+-:-+1) 
10! 73. BY(S)Q2VB) 
210 80. 2!4(2!6 — 2) 
We count the number of n x n matrices that represent sym- 
metric relations on an n-element set. Example 6.1.7 showed 
that there are n? — n entries off the main diagonal. Of these, 
half, (n2 —n)/2, are above the main diagonal. Since there are 
n entries on the main diagonal, there are 

r—n r+_n 


- t= "3 

entries on or above the main diagonal. These entries can be 
assigned values arbitrarily and there are gin’-+n)/2 ways to as- 
sign these values. Because the relation is symmetric, once 
these values are assigned, the values below the main diago- 
nal are determined. (Entry ij is 1 if and only if entry ji is 1.) 
Therefore there are 20” +”)/2 symmetric relations on an n- 
element set. 


We count the number of 1 xn matrices that represent reflexive 
and antisymmetric relations on an n-element set. 

Since the relation is reflexive, the main diagonal must 
consist of 1’s. For i and j satisfying 1 < i < j < n, we can 
assign the entries in row i, column j and row j, column i in 
three ways: 


Row i, Column j Row j, Column i 


0 0 
1 0 
0 1 


Since there are (n* — n)/2 values of i and j satisfying 
1 <i <j <n, we can assign the off-diagonal values in 
inn) /2 ways. Therefore there are 3-0/2 reflexive and 
antisymmetric relations on an n-element set. 


The truth table of an n-variable function has 2” rows since 
each variable can be either T or F. Each row of the table can 
assign the function the value T or F. Since there are 2” rows, 
the function value assignments can be made in 2?" ways. 
Therefore there are 2’ truth tables for an n-variable function. 


By the Inclusion-Exclusion Principle, the total number of 
possibilities = number of strings that begin 100 + number of 
strings that have the fourth bit | — number of strings that be- 
gin 100 and have the fourth bit 1, so the answer is 254.9794. 


By the Inclusion-Exclusion Principle, the total number of 
possibilities = number in which Connie is chairperson + 
number in which Alice is an officer — number in which 


99. 


102. 
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Connie is chairperson and Alice is an officer, so the answer 
is5-44+3-5-4—2-4. 

Let F be the set of students taking French, let B be the set of 
students taking business, and let M be the set of students tak- 
ing music. We are given that |FOBNM| = 10, |FNB| = 36, 
|F A M| = 20, |BN M| 18, |F| 65, |Bl 76, and 
|M| = 63. By Exercise 98, 


|FUBUM| = |F|+|B| +|M| —|F OB) —|FNM| 
—|BOM|+|FABNM| 
= 65+ 76 + 63 — 36 — 20 


18 + 10 = 140. 


Thus 140 students are taking French or business or music. 
Since there are 191 students, 191 — 140 = 51 are not any of 
the three courses. 
Let X be the set of integers between | and 10,000 that are 
multiples of 3, let Y be the set of integers between 1 and 
10,000 that are multiples of 5, and let Z be the set of integers 
between 1 and 10,000 that are multiples of 11. 

A multiple of 3 is of the form 3k for some integer k, 
so a multiple of 3 between | and 10,000 satisfies 


1 < 3k < 10,000. 


Dividing by 3, we obtain 


= .9993.333 cis 


Thus the multiples of 3 between 1 and 10,000 correspond to 
the values k = 1, 2,..., 3333. Therefore there are 3333 mul- 
tiples of 3 between | and 10,000. Similarly, there are 2000 
multiples of 5 between | and 10,000 and 909 multiples of 11 
between | and 10,000. Therefore |X| = 3333, |Y| = 2000, 
and |Z| = 909. 

A number that is a multiple of 3 and 5 is a multiple 
of 15. Arguing as in the previous paragraph, we find that 
there are 666 multiples of 3 and 5 between | and 10,000. 
Similarly, there are 303 multiples of 3 and 11 between | and 
10,000, there are 181 multiples of 5 and 11 between | and 
10,000, and there are 60 multiples of 3, 5, and 11 between 
1 and 10,000. Therefore |X NM Y| = 666, |X M Z| = 303, 
|YO Z| = 181, and | XN YM Z| = 60. By Exercise 98, 


IXUYUZ| = [X|+|¥|+ |Z] -|XNY|-|xnZ| 
-|YAZ|+1xn YNZ 
= 3333 + 2000 + 909 — 666 — 303 — 181 


+60 = 5152. 


Therefore there are 5152 integers between | and 10,000 that 
are multiples of 3 or 5 or 11 or any combination thereof. 


Section 6.2 Review 


1. 
2. 


An ordering of x}, ..., Xn 


There are n! permutations of an n-element set. There are 
n ways to choose the first item, n — | ways to choose the 
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second item, and so on. Therefore, the total number of per- 
mutations is 


n(n—1)---2-l=nl. 


. An ordering of r elements selected from x1, ..., Xn 


. There are n(n — 1)---(n — r + 1) r-permutations of an n- 


element set. There are n ways to choose the first item, n — 1 
ways to choose the second item, ..., and nm — r + 1 ways 
to choose the rth element. Therefore, the total number of 
r-permutations is 


n(n—1)---(n—r+1). 


5. P(n, r) 
6. An r-element subset of {x1, ..., Xn} 
7. There are 

n! 


8. 


r-combinations of an n-element set. 

There are P(n, r) ways to select an r-permutation of 
an n-element set. This r-permutation can also be constructed 
by first choosing an r-combination [C(n, r) ways] and then 
ordering it [r! ways]. Therefore, P(n, r) = C(n, r)r!. Thus 


P(n, r) n(n—1)---(n—r+1) 


r! r! 
n!} 


~ (a—nirl’ 


C(n, r) = 


C(n, r) 


Section 6.2 


1. 
4. 


16. 


19. 
21. 
24. 


» P(1,3) = 11-10-9 
13. 


4!=24 


abc, acb, bac, bea, cab, cba, abd, adb, bad, bda, dab, dba, 
acd, adc, cad, cda, dac, dca, bcd, bdc, cbhd, cdb, dbc, dcb 


10. 3! 


4! contain the substring AE and 4! contain the substring EA; 
therefore, the total number is 2 - 4!. 


We first count the number JN of strings that contain either the 
substring AB or the substring BE. The answer to the exercise 
will be: Total number of strings — N or 5! — N. 

According to Exercise 71, Section 6.1, the number of 
strings that contain AB or BE = number of strings that con- 
tain AB + number of strings that contain BE — number of 
strings that contain AB and BE. A string contains AB and BE 
if and only if it contains ABE and the number of such strings 
is 3!. The number of strings that contain AB = number of 
strings that contain BE = 4!. Thus the number of strings that 
contain AB or BE is 4!+4!—3!. The solution to the exercise is 


5!— (2-4! — 3). 


8!P(9, 5) = 81(9-8-7-6-5) 
10! 


Fix a seat for a Jovian. There are 7! arrangements for the 
remaining Jovians. For each of these arrangements, we can 
place the Martians in five of the eight in-between positions, 


25. 
31. 
33. 
36. 


39. 
42. 


43. 


46. 


49. 


which can be done in P(8, 5) ways. Thus there are 7!P(8, 5) 
such arrangements. 


C(4,3) =4 28. C(11, 3) 
C(17, 0) + C7, 1) + C17, 2) + C7, 3) +4 
C(13, 5) 


A committee that has at most one man has exactly one man 
or no men. There are C(6, 1)C(7, 3) committees with exactly 
one man. There are C(7, 4) committees with no men. Thus 
the answer is C(6, 1)C(7, 3) + C(7, 4). 


C(10, 4)C(12, 3)C(4, 2) 


First, we count the number of eight-bit strings with no two 
0’s in a row. We divide this problem into counting the num- 
ber of such strings with exactly eight 1’s, with exactly seven 
1’s, and so on. 

There is one eight-bit string with no two 0’s in a row 
that has exactly eight 1’s. Suppose that an eight-bit string 
with no two 0’s in a row has exactly seven 1’s. The 0 can 
go in any one of eight positions; thus there are eight such 
strings. Suppose that an eight-bit string with no two 0’s ina 
row has exactly six 1’s. The two 0’s must go in two of the 
blanks shown: 


-1_1_1_1_1_1_- 
Thus the two 0’s can be placed in C(7, 2) ways. Thus there 
are C(7, 2) such strings. Similarly, there are C(6, 3) eight-bit 
strings with no two 0’s in a row that have exactly five 1’s and 
there are C(5, 4) eight-bit strings with no two 0’s in a row 
that have exactly four 1’s in a row. If a string has less than 
four 1’s, it will have two 0’s in a row. Therefore, the number 
of eight-bit strings with no two 0’s in a row is 
1+8+C(7, 2) + C(6, 3) + C(5, 4). 
Since there are 2° eight-bit strings, there are 
28 —[1+8+C(7, 2) +C(, 3) + C(5, 4)] 


eight-bit strings that contain at least two 0’s in a row. 


1-48 (The four aces can be chosen in one way and the fifth 
card can be chosen in 48 ways.) 


First, we count the number of hands containing cards in 
spades and hearts. Since there are 26 spades and hearts, there 
are C(26, 5) ways to select five cards from among these 26. 
However, C(13, 5) contain only spades and C(13, 5) contain 
only hearts. Therefore, there are 


C(26, 5) — 2C(13, 5) 


ways to select five cards containing cards in spades and 
hearts. 

Since there are C(4, 2) ways to select two suits, the 
number of hands containing cards of exactly two suits is 


C(4, 2)[C(26, 5) — 2C(13, 5)]. 


There are nine consecutive patterns: A2345, 23456, 34567, 
45678, 56789, 6789T, 789TJ, 89TJQ, 9TIQK. Correspond- 
ing to the four possible suits, there are four ways for each 
pattern to occur. Thus there are 9-4 hands that are consecu- 
tive and of the same suit. 


52. 
55. 


58. 


60. 
68. 


72. 


73. 


78. 


85. 
87. 


92. 
95. 


C(52, 13) 


1 - C(48, 9) (Select the aces, then select the nine remaining 
cards.) 


There are C(13, 4)C(13, 4)C(13, 4)C(13, 1) hands that con- 
tain four spades, four hearts, four diamonds, and one club. 
Since there are four ways to select the three suits to have four 
cards each, there are 4C(13, 4)?C(13, 1) hands that contain 
four cards of three suits and one card of the fourth suit. 


210 63. 2° 65. C(50, 4) 


C(50, 4) — C(46, 4) (Total number — number with no defec- 
tives) 


Order the 2n items. The first item can be paired in 2n — 1 
ways. The next (not yet selected) item can be paired in 2n— 3 
ways, and so on. 


A list of votes where Wright is never behind Upshaw and 
each receives r votes is a string of r W’s and r U’s where, 
reading the string from left to right, the number of W’s is 
always greater than or equal to the number of U’s. Such a 
string can also be considered a path of the type described in 
Example 6.2.23, where W is a move right and U is a move up. 
Example 6.2.23 proved that there are C, such paths. There- 
fore, the number of ways the votes can be counted in which 
Wright is never behind Upshaw is C,. 

By Exercise 77, k vertical steps can occur in C(k, [k/2]) 
ways, since, at any point, the number of up steps is greater 
than or equal to the number of down steps. Then, n — k hor- 
izontal steps can be inserted among the k vertical steps in 
C(n, k) ways. Since each horizontal step can occur in two 
ways, the number of paths containing exactly k vertical steps 
that never go strictly below the x-axis is 


Cik, [k/21) Cn, K)2"*. 


Summing over all k, we find that the total number of paths is 


n 
So Ck, [k/21)C(n, K)2"*. 
k=0 
The solution counts ordered hands. 


Once—when we choose the five slots with 0’s and 1’s for the 
remaining slots. 


Use Theorems 3.4.1 and 3.4.8. 


Note that 
n—-i_ on 
aS 
k-i7k 
fori=0,1,...,k— 1. Therefore, 
Cin, = n! _ nal @-k+D 
(n — k)!k! k(k—1)--+1 
nn—1 n—-k+1 
kk-1 1 
nn n 
= ee 
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Also, 
n(n—1)---(n—k+1) — nn-een rk 
os kl iy amas Te 
Section 6.3 Review 
1. n!/(n1!--+-n;!). The formula derives from the Multiplication 


Principle. We first assign positions to the 1; items of type 1, 
which can be done in C(n, n;) ways. Having made these as- 
signments, we next assign positions to the 2 items of type 2, 
which can be done in C(n — nj, nz) ways, and so on. The 
number of orderings is then 


C(n, ny)C(n — ny, n2)---C(n— ny — +--+ — m1, Nz), 


which, after applying the formula for C(n, k) and simplifica- 
tion, gives n!/(n,!---n,!). 


. C(k+t— 1,t—- 1). The formula is obtained by considering 


k+t —1 slots and k +t — 1 symbols consisting of k x’s and 
t—1|’s. Each placement of these symbols into the slots deter- 
mines a selection. The number 7; of x’s between the first and 
second | represents n; copies of the first element in the 
set. The number nz of x’s between the second and third | 
represents n2 copies of the second element in the set, and 
so on. Since there are C(k + t — 1,t — 1) ways to select 
the positions of the |’s, there are also C(k + ¢ — 1,t — 1) 
selections. 


Section 6.3 


1. 
7. 


10. 
13. 


17. 
18. 
24. 


25. 
31. 


36. 
45. 
51. 


5! 4. 6!/(4!2!) 
Permute one token with four S’s and other tokens with one 
letter each from among ALEPERON, which can be done in 
91/2! ways. 
c(6+6-—1,6-1) 
Each such route can be designated by a string of i X’s, j Y’s, 
and k Z’s, where an X means move one unit in the x-direction, 
a Y means move one unit in the y-direction, and a Z means 
move one unit in the z-direction. There are 

G+j+b! 

iljtk! 

such strings. 
10!/(5! +3! +2!) 
c(10 +3 — 1, 10) 21. C(9+2-1,9) 
Four, since the possibilities are (0, 0), (2, 1), (4, 2), and 
(6, 3), where the pair (7, g) designates r red and g green balls. 
c(i5 +3 — 1, 15) 28. C(13 + 2 — 1, 13) 


C(12+4—1, 12) 

— (C7 +4-1,7)+C(6+4-1,6) 
+C(3+4-1,3)+C2+4-1,2) 
—Cc’+4-1,1] 

521/(13!)4 39. C(20, 5) 42. C(20, 5)? 


C15 + 6— 1, 15) 48. C(10 + 12 — 1, 10) 


Apply the result of Example 6.3.9 to the inner k — 1 
nested loops of that example. Next, write out the number 
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of iterations for i; = 1; then i; = 2; and so on. By Example 
6.3.9, this sum is equal to C(k +n — 1,k). 


Section 6.4 Review 


1. 


Let a = s;---Sp and B = t---t, be strings over {1, 
2,...,n}. Then @ is lexicographically less than 6 if either 
p< qand s; = ¢; for alli = 1,..., p; or for some i, 5; 4 fj 
and for the smallest such i, we have s; < tj. 


. Given a string s;...s,-, which represents the r-combination 


{s1,...,5,}, to find the following string t, ...t,, find the 
rightmost element s,, that is not at its maximum value. (s,s 
maximum value is n, s,_;’s maximum value is n — 1, etc.) 
Then set ¢; = s; fori = 1,...,m— 1; set ty = Sm + 1; and 
set tm41 +++ t = (Sm + 2)(Sm +3) +++. Begin with the string 
12---r. 


. Given a string s, which represents a permutation, to find the 


following string, find the rightmost digit d of s whose right 
neighbor exceeds d. Find the rightmost element r that satis- 
fies d < r. Swap d and r. Finally, reverse the substring to the 
right of d’s original position. Begin with the string 12---n. 


Section 6.4 


1. 1357 4. 12435 
7. (For Exercise 1) At lines 8-12, we find the rightmost s,, not 
at its maximum value. In this case, m = 4. At line 14, we 
increment s,,. This makes the last digit 7. Since m is the 
rightmost position, at lines 16 and 17, we do nothing. The 
next combination is 1357. 
9. 123, 124, 125, 126, 134, 135, 136, 145, 146, 156, 234, 235, 
236, 245, 246, 256, 345, 346, 356, 456 
12... 12,21 14. 12379 
17. None, since each combination has the digits in increasing 


24. 


order. 


. 123456897 


. A permutation that ends 1234 begins with a permutation of 


{5, 6, 7, 8, 9}, and there are 5! of these; therefore, the answer 
is 5!. 


Input: r,n 


Output: A list of all r-combinations of {1, 2,... 


increasing lexicographic order 


,n}in 


r_comb(r, n) { 
so=—l 
fori=ltor 
si = i 
printin(s,,...,5 Sn) 
while (true) { 
m=r 
max_val=n 
while (sy, == max_val) { 
m=m—-1 
max_val = max_val — | 


if (m == 0) 
return 

Sm = Sm + 1 

forj=m+ltor 
ss=s-ittil 

printin(s,,..., Sp) 

} 
} 


27. Input: s;,..., 5, (an r-combination of {1,..., 7}), 


r,andn 


Output: 1, ..., 8, the next r-combination (The first 


r-combination follows the last r-combination.) 


next_comb(s, r, n) { 
so =n+1// dummy value 
m=r 
max_val =n 
// loop test always fails if m = 0 
while (sy, == max_val) { 


// find rightmost element not at its maximum value 


m=m—1 
max_val = max_val — | 
} 
if (m == 0) { // last r-combination detected 
5, =0 
m= 1 
} 
// increment rightmost element 
Sm =Smt+1 
// rest of elements are successors of sy», 
forj=m+ltor 


Sj = Sj-1 +1 
} 
29. Input: s;,..., 8, (an r-combination of {1,..., 7}), 
r,andn 
Output: 1, ..., s;, the previous r-combination 


(The last r-combination precedes the first 
r-combination.) 


prev_comb(s, r,n) { 
so =n // dummy value 
// find rightmost element at least 
// 2 larger than its left neighbor 


m=r 

// loop test always fails if m = 1 

while (Sm — Sn—1 == 1) 
m=m—1 


Sm =Sm—1 
if (m== 1 As, == 0) 
m=0 
// set elements to right of index m to max values 
forj=m+ltor 
Sj=n+j-r 


31. 


Input: 
Output: 


I, Sky Skt, +++, Sp, a String a, k, andn 


A list of all r-combinations of {s,, 5x41, .--, Sn} 
each prefixed by a [To list all r-combinations of 
{s1, 52, ..., 8}, invoke this function as 

r_comb2(r,s,1,n, 4), where A is the null string.] 


r_comb2(r, s, k,n, a) { 
if (r == 0) { 
println(a) 
return 
} 
if (k == n) { 
println(a, Sy) 
return 
} 
Beat "+s 
// print r-combinations containing s, 
r_comb2(r — 1, s,k + 1,n, B) 
// print r-combinations not containing sx 
if(r<n—k) 
r_comb2(r, s,k + 1,n,@) 


Section 6.5 Review 


1. 
2. 


An experiment is a process that yields an outcome. 


An event is an outcome or combination of outcomes from an 
experiment. 


. The sample space is the event consisting of all possible 


outcomes. 


. The number of outcomes in the event divided by the number 


of outcomes in the sample space 


Section 6.5 


1. 


(H, 1), (H, 2), (H, 3), (H, 4), (H, 5), (A, 6), (T, 1), (T, 2), 
(T, 3), (T, 4), (T, 5), (T, 6) 


4. (H, 1), (H, 2), (H, 3) 
5. (1, 1, C, 3), (1, 5), (2, 2), (2, 4), (2, 6), (3, 1), (3, 3), 


. Three dice are rolled. 
. 1/52 

. C(90, 4)/C(100, 4) 
. 1/103 

. 1/[C(50, 5) +36] 


(3, 5), (4, 2), (4, 4), (4, 6), (5, 1), (5, 3), (5, 5), (6, 2), 
(6, 4), (6, 6) 


11. 1/6 
17. 4/36 


4-C(13, 5) -3-C(13, 4)C(13, 2)? 
C(52, 13) 


. 1/210 

. C(10, 5/2! 
‘ 210 7310 

. 1/5! 

. 10/C(12, 3) 


41. 18/38 


44. 
49. 
52. 


55. 
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2/38 45. 1/3 
1/4 
The possibilities are: A (correct), B (incorrect), C (incor- 


rect); and A (incorrect), B (correct), C (incorrect). In the 
first case, if the student stays with A, the answer will be 
correct; but if the student switches to B, the answer will 
be incorrect. In the second case, if the student stays with A, 
the answer will be incorrect; but if the student switches to B, 
the answer will be correct. Thus, the probability of a correct 
answer is 1/2. 
There are C(10+3-— 1, 3— 1) ways to distribute 10 compact 
discs to Mary, Ivan, and Juan. If each receives at least two 
compact discs, we must distribute the remaining six discs, 
and there are C(4 + 3 — 1,3 — 1) ways to do this. Thus the 
probability that each person receives at least two discs is 
c(44+3-1,3-1) 
c(l0+3-—1,3-1)' 


Section 6.6 Review 


1. 


A probability function P assigns to each outcome x in a 


sample space S a number P(x) so that 
0 <P) <1, forallxe S 


and 


Pay =. 


xeS 


2. P(x) = 1/n, where n is the size of the sample space. 
3. The probability of E is 


NNHnwm & 


PE) = >_ PQ). 


xeE 


. P(E) + P(E) = 1 

. FE; or E> (or both) 

. Ey and £2 

~ P(E| U Eo) = P(E)) + P(E2) — P(E 9 Ep). P(E1) + P(E2) 


equals P(x) for all x € E; plus P(x) for all x € E>, which is 
equal to P(E,) + P(E), except that P(x), for x € E, M Ep, is 
counted twice. The formula now follows. 


8. Events FE; and £2 are mutually exclusive if E) 0 E, = ©. 


9. If we roll two dice, the events “roll doubles” and “‘the sum is 


10. 


11. 
12. 
13. 
14. 
15. 


16. 


odd” are mutually exclusive. 


P(E, UE2) = P(E,) + P(E2). This formula follows from the 
formula of Exercise 7 because P(E; Ez) = 0. 


E given F is the event E given that event F occurred. 
E|F 

P(E|F) = P(EN F)/P(F) 

Events E and F are independent if P(E MN F) = P(E)P(F). 


If we roll two dice, the events “get an odd number on the 
first die” and “get an even number on the second die” are 
independent. 

Pattern recognition places items into various classes based 
on features of the items. 
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Suppose that the possible classes are C),..., Cy. Suppose 
further that each pair of classes is mutually exclusive and 
each item to be classified belongs to one of the classes. For 
a feature set F, we have 


PF | Cj) P(C) 
ier PR CPCCA) 


P(G)|F) = 


The equation 


P(G:NF) _ PIC)P(G) 

P(F) ———P(F) 
follows from the definition of conditional probability. The 
proof is completed by showing that 


P(C||F) = 


n 
POF) = S| PF | C)P(Ci), 
i=l 
which follows from the fact that each pair of classes is mu- 
tually exclusive and each item to be classified belongs to one 
of the classes. 


Section 6.6 


1. 


23. 
25. 


1/8 
. P(1) = P(3) = 3/10, P(2) = P(4) = P(5) = P(6) = 1/10 
. 1—3/10 =7/10 
. P(2) = P(4) = P(6) = 1/12. P(1) = P(3) = P(5) = 3/12. 
. 1— (1/4) 


. 3(1/12)? + 33/12)? 


. Let E denote the event “sum is 6,” and let F denote the event 


“at least one die shows 2.” Then 


1\? 2 
P(ENF) = P((2, 4)) + P((4, 2)) = 2 (3) = a 


and 


PF) = PCL, 2)) + P(2, 1) + P(Q, 2)) + P((2, 3)) 
+ P((2,4)) + P(2, 5)) + P(2, 6)) + P((3, 2)) 
+ P(A, 2)) + P((S, 2)) + P((6, 2)) 


= (in) (ta) * (is) (aa) * Ge) 
6G)" 
(3) 
(ia) (is) + (is) = aa 


Therefore, 
2 
P(E|F) = ae - Ta _ 2 
(F) aa 23 
20. (T,1), (7,2), (7,3), (7,4), (T,5), (1,6), (H,3) 
Yes 
C(90, 6)/C(100, 6) 


28. 


31. =— 


34. 


37. 
40. 
43. 


46. 


47. 


50. 


53. 


57. 


59. 


1/24 
os 
cae — 15 
Let £, denote the event “children of both sexes,” and let E2 


denote the event “at most one boy.” Then 


Pai. Aes d PUE\N&)=2 
=_, =—, an =—. 
U6 2 16 ee 16 


Now 
P(E, 0 Ep) = # = = P(E) )P(E). 
4° 128 
Therefore, the events E; and £2 are not independent. 
1/210 
1— (1/2!) 


Let E denote the event “four or five or six heads,” and let F 
denote the event “at least one head.” Then 


cd, 4) | C(10,5) , C(10, 6) 
PEN F) = 210 } 210 : 210 
210 + 252 + 210 
= i = 0.65625. 
Since P(F) = 1 — (1/2!°) = 0.999023437, 
P(E|F) SIRES = 0.656891495 
~ P(F) — 0.999023437 : 


Ifk > n/2, the probability is 1. (If none of your numbers was 
drawn, there are k numbers drawn and, together with your 
k numbers that are distinct from those drawn, we account for 
k +k > n numbers, which is impossible.) 

Suppose then that k < n/2. The probability that none 
of your numbers was drawn is C(n — k, k)/C(n, k). Thus the 
probability that at least one of your numbers was drawn is 
1—C(n—k,k)/C(n, k). 

The probability that neither Maya or Chloe wins a prize is 
C(68, 20)/C(70, 20). Thus the probability that at least one 
of them wins a prize is | — C(68, 20)/C(70, 20). 


The numbers drawn include Maya’s number, but not Chloe’s. 
Assigning Maya’s number and leaving out Chloe’s leaves 
68 remaining numbers from which to draw 19 numbers. 
Thus the answer is C(68, 19)/C(70, 20). 


Let E be the event “at least one person has a birthday on 
April 1.” Then F is the event “no one has a birthday on April 
1” Now 


P(E) =1 


= 364 -364--- 364 364\" 
P(E) = 1 ( ) 


365 +365---365 365 


Let E; denote the event “over 350 pounds,” and let Ez denote 
the event “bad guy.” Then 


P(E, U Ez) = P(E)) + P(E2) — P(E, 9 E2) 
35 20 15 40 


90° 90 90° 90° 
P(A) = 0.55, P(D) = 0.10, P(N) = 0.35 


62. 


63. 


66. 


69. 


P(B) = P(B|A)P(A) + P(B|D)P(D) + P(B|N)P(N) 
= (0.10)(0.55) + (0.30)(0.10) + (0.30)(0.35) = 0.19 
We require that 
(0.95) P(H) 
(0.95) P(H) + (0.02)(1 — P(A))” 
Solving for P(H) gives P(H) = .0206. 


Yes. Suppose that E and F are independent, that is, 
P(E)P(F) = P(EN F). Now 


P(H | Pos) = 0.5 = 


P(E)P(F) = (1 — P(E))( — P(F)) 
1 — P(E) — P(F) + P(E)P(F) 
= 1— P(E) — P(F) + P(EN F). 


By De Morgan’s law for sets, 


ENF=EUF; 
thus, 
P(ENF) = P(EUF) 
= 1— P(EUF) 
= 1—-[P(£) + P(F) — P(EN F)] 
= 1—- P(E) — P(F)+ P(EN F). 
Therefore, 


P(E)P(F) = P(ENF), 


and E and F are independent. 


Let E; be the event “runner completes the marathon on 
attempt 7.” The error in the reasoning is assuming that 
P(E2) = 1/3 = P(E3). In fact, P(Ex) # 1/3 # P(E3) 
because, if the runner completes the marathon, it is not run 
again. Although P(E;) = 1/3, 


P(E2) = P(fail on attempt | and succeed on attempt 2) 
= P(fail on attempt 1)P(succeed on attempt 2) 


2° <b. 2 
oe es 
Similarly, 
Piya f te", 
33-3 27 


Thus, the probability of completing the marathon is 


P(E, UE) U E3) = P(E\) + P(E2) + P(E3) 
1 2 4 19 
= | = — = 0.704, 
3 = 9 27 27 
which means that there is about a 70 percent chance that the 
runner will complete the marathon—not exactly a virtual 


certainty! 


Section 6.7 Review 


1. 


If a and b are real numbers and 7 is a positive integer, then 


n 
(a+by"= s C(n, Ka" *b*. 
k=0 


2. 


4. 
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In the expansion of 


(a+ b)" =(a+b)(a+b)---(a+b), 
eS 
n factors 


the term a’~*b* arises from choosing b from k factors and a 
from the other n — k factors, which can be done in C(n, k) 


ways. Summing over all k gives the Binomial Theorem. 


. Pascal’s triangle is an arrangement of the binomial coeffi- 


cients in triangular form. The border consists of 1’s, and any 
interior value is the sum of the two numbers above it: 


C(n,0) = C(n,n) = 1, for alln > 0; and C(n+1,k) = 
C(n,k — 1) + C(n,k), forall l <k<n 


Section 6.7 


1. 
3. 
9. 


10. 
17. 


xt + Arty + 6x7 y? + day? + yt? 
C(I, 7)x4y? 6. 5,987,520 
C(7, 3) + C(5, 2), since 


(at Jax +x)? (a+x) =[(a+x) + VaxP (a +x) 
= (a+x)’ +2/Sax(a+x)° +ax(a +x). 


C10 +3 — 1, 10) 13. 1 8 2856 70 56 28 8 1 


[Inductive Step only] Assume that the theorem is true 
for n. 


(a4 py"! = (a bya n b)" 
n 
= (a+b) >> Cn, ba" *ok 
k=0 
n 
= ie C(n, Katt !—kpk 
k=0 
n 
4. 2 cin, kal * pk! 
k=0 
n 
=> Cama 
k=0 


n+l 
+ Soca, k= Da" kok 
k=1 


n 
= C(n, 0)a"t! 9 + > C(n, katt !—kpk 
k=1 
+ C(n, nyap"t! 
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23. 
26. 


32. 


35. 
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n+l —k pk 


+ SoCa,k- Da 
k=1 


= C(n+1,0)a"t!p° 


n 

+ Soice, k) + C(n, k — D]a"t kok 
k=1 

C(n4 1a°p"*! 

= C(n+1, 0)a"t!p? 


n 


+ Socat1, batt kok 


ln 


k=l 
+Cin+1,n+ 1a°p"t! 
n+l 
= So C(nt 1, bat! *ok 
k=0 


The number of solutions in nonnegative integers of 
xy +x0 +--+ + x42 =n—k 


is C(kk+2+n—k—1,n—k) = C(n+1,k+1). The number 
of solutions is also the number of solutions C(k+1+n—k-—1, 
n—k) = C(n, k) with x,42 = 0 plus the number of solutions 


Ck+l+n—k—1-1,n—k-1)=C—1,k) 


with xx42 = | plus: -- plus the number of solutions C(k+1+ 
0-1, 0) = C(k, k) with xx42 = n—k. The result now follows. 


Take a = | and b = 2 in the Binomial Theorem. 
4 3xry + 3x2z + Bxy* + Gxyz + 3x2? + y? + 3y%z 
+3y7 +2 


Set a = | and b = x and replace n by n — | in the Binomial 
Theorem to obtain 
n—-1 


=> ca-1, bx". 
k=0 


Now multiply by n to obtain 


+a" 


n—1| 
n(l+x)"! = ny) C(n— 1, k)x* 


k=0 


n 
= n> C(n —1,k- Dp! 
k=l 
n 


“3 n(n — 1)! ye 
a (n—k)!(k-—1)! 


n 


=> n! bk! 
<j (n— Wk! 


= 5 C(n, Dkk !, 


k=1 


The solution is by induction on k. We omit the Basis Step. 
Assume that the statement is true for k. After k iterations, we 
obtain the sequence defined by 


k-1 
a= > oe 
= WT 5n° 
i=0 


38. 


41. 


Let Bo, ..., Bi, denote the row after Bo, ..., Bg—1 in Pascal’s 
triangle. Smoothing a’ by c to obtain a” yields 


1 
54% +ai41) 


k-1 k-2 
1 
= Saal bp ai4jBi + S ait 
i=0 i=0 
k-1 k-2 
= one (1% +)> aj4jBi +)> Gi4j41Bi + sti 
i=0 


k=l k-1 
1 
= Fe (s+ ai+jBit)_ Gi4jBi-1 + sti 
i=l 


= Sa («i+ + s ai4jBy + aut) 


k 
= n+ is 


i=0 


and the Inductive Step is complete. 


[Inductive Step only] Notice that 


nor 


cat+1,)'+cCn4lit) t= Cn, i)! 


Now 


n+1 


So cm4+1,p7! 
i=l 
n+1 n 
=5 (Sew +1, i)7! doe + 1,i4 ») 
S> cq, i! 


1 Cant 1.74 nt+2< 
= n+l, T 
2 n+l 


+ C+ int") 


= 


We use the Hint, which follows from the Binomial Theorem 
applied to (j + 1)’. By Exercise 40, p divides C(p, i) for all 
i, 1 < i < p—1. Therefore p divides (j + 1)? — j? — 1 for 
all j, 1 <j < a— 1. Thus p divides 


Yolg+ I -? 


1] = (2? — 1? -1]+ (3? —-2?-1) 
b+++ + [a? —(a— 1)? — 1] 


1)-l=a@—-a. 


Section 6.8 Review 


1. 


First Form: If n pigeons fly into k pigeonholes and k < n, 
some pigeonhole contains at least two pigeons. 

Second Form: If f is a function from a finite set X to 
a finite set Y and |X| > |Y|, then f(x1) = f(x2) for some 
X1,%2 € X, x, # x2. 

Third Form: Let f be a function from a finite set X 
to a finite set Y. Suppose that |X| = n and |Y| = m. Let 
k = [n/m]. Then there are at least k values aj,..., ax € X 
such that 


f(a) =f(@) = ++» =f(q). 


. First Form: If 20 persons (pigeons) go into six rooms 


(pigeonholes), then some room contains at least two persons. 

Second Form: In the previous example, let X be the 
set of persons, and let Y be the set of rooms. If p is a person, 
define a function f by letting f(p) be the room in which per- 
son p is located. Then for some distinct persons p; and po, 
S(p.) = f(p2); that is, the distinct persons p; and p2 are in 
the same room. 

Third Form: Let X, Y, and f be as in the last example. 
Then there are at least [20/6] = 4 persons p1, p2, p3, p4 
such that 


f (pi) =f(p2) =f (ps) =f (pa); 


that is, there are at least four persons in the same room. 


Section 6.8 


1. 


10. 


13. 


Let the five cards be the pigeons and the four suits be the 
pigeonholes. Assign each card (pigeon) to its suit (pigeon- 
hole). By the Pigeonhole Principle, some pigeonhole (suit) 
will contain at least two pigeons (cards), that is, at least two 
cards are of the same suit. 


. Let the 35 students be the pigeons and the 24 letters of the 


alphabet be the pigeonholes. Assign each student (pigeon) 
the first letter of the first name (pigeonhole). By the Pigeon- 
hole Principle, some pigeonhole (letter) will contain at least 
two pigeons (students); that is, at least two students have first 
names that start with the same letter. 


. Let the 13 persons be the pigeons and the 12 = 3 -4 possible 


names be the pigeonholes. Assign each person (pigeon) that 
person’s name (pigeonhole). By the Pigeonhole Principle, 
some pigeonhole (name) will contain at least two pigeons 
(persons); that is, at least two persons have the same first and 
last names. 


Yes. Connect processors | and 2, 2 and 3, 2 and 4, 3 and 
4. Processor 5 is not connected to any processors. Now only 
processors 3 and 4 are directly connected to the same number 
of processors. 


Let a; denote the position of the ith unavailable item. Con- 
sider 


Q,---,430; a +3,...,a30+3;  a1+6,...,a30 +6. 


These 90 numbers range in value from | to 86. By the sec- 
ond form of the Pigeonhole Principle, two of these num- 


18. 
19. 


20. 
21. 


31. 


35. 


38. 


41. 


44. 
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bers are the same. If a; = a; + 3, two are three apart. If 
aj = a; +6, two are six apart. If a; +3 = a; +6, two are three 
apart. 
n+1 
Suppose that k < m/2. Clearly, k > 1. Since m < 2n + 1, 
ks lg + : +1 
—<n+=<n : 
2 2 
Suppose that k > m/2. Then 
m—-k<m Be 
2 2 


Because m is the largest element in X,k < m. Thusk+1 <m 
and so | < m —k. Therefore, the range of a is contained in 
Gl Berea 


The second form of the Pigeonhole Principle applies. 


Suppose that a; = a;. Then either i < m/2 andj > m/2 or 
J < m/2 andi > m/2. We may assume that i < m/2 and 
j >m/2. Now 


itj=a+m—a =m. 


When we divide a by b, the possible remainders are 0, 1,..., 
b — 1. Consider what happens after b divisions. 


We suppose that the board has three rows and seven columns. 
We call two squares in one column that are the same color 
a colorful pair. By the Pigeonhole Principle, each column 
contains at least one colorful pair. Thus the board contains 
seven colorful pairs, one in each column. Again by the Pi- 
geonhole Principle, at least four of these seven colorful pairs 
are the same color, say red. Since there are three pairs of 
rows and four red colorful pairs, a third application of the Pi- 
geonhole Principle shows that at least two columns contain 
red colorful pairs in the same rows. These colorful pairs 
determine a rectangle whose four corner squares are red. 


Suppose that it is possible to mark k squares in the upper-left 
k x k subgrid and k squares in the lower-right k x k subgrid 
so that no two marked squares are in the same row, column, 
or diagonal of the 2k x 2k grid. Then the 2k marked squares 
are contained in 2k — 1 diagonals. One diagonal begins at 
the top left square and runs to the bottom right square; k — 1 
diagonals begin at the k — | squares immediately to the right 
of the top left square and run parallel to the first diagonal 
described; and k — | diagonals begin at the k — 1 squares 
immediately under the top left square and run parallel to 
the others described. By the first form of the Pigeonhole 
Principle, some diagonal contains two marked squares. This 
contradiction shows that it is impossible to mark k squares in 
the upper-left k x k subgrid and k squares in the lower-right 
k x k subgrid so that no two marked squares are in the same 
row, column, or diagonal of the 2k x 2k grid. 


Since there are n + | numbers in X, but only n distinct odd 
parts, the odd parts of two numbers must coincide. Thus one 
of these two numbers divides the other. 


We use the fact that the sum of the interior angles of a planar 
polygon with n sides is (n— 1)180 degrees. Now suppose that 
at most two interior angles are less than 180 degrees. Then at 
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least n — 2 interior angles are greater than or equal to 180 de- 
grees. Thus the sum of the interior angles and the other angles 
is greater than (n — 2)180 degrees, which is a contradiction. 


Chapter 6 Self-Test 
1. [Section 6.1] 24 
2. [Section 6.2] 6!/(3!3!) = 20 


3. [Section 6.2] We construct the strings by a three-step process. 
First, we choose positions for A, C, and E [C(6, 3) ways]. 
Next, we place A, C, and E in these positions. We can place 
C one way (last), and we can place A and E two ways (AE or 
EA). Finally, we place the remaining three letters (3! ways). 
Therefore, the total number of strings is C(6, 3) «2 +3!. 


4. [Section 6.3] 8!/(3!2!) 


5. [Section 6.3] We count the number of strings in which no 
I appears before any L and then subtract from the total num- 
ber of strings. 

We construct strings in which no / appears before any 
L by a two-step process. First, we choose positions for NV, O, 
and S; then we place the /’s and L’s. We can choose positions 
for N, O, and S in 8-7-6 ways. The /’s and L’s can then 
be placed in only one way because the L’s must come first. 
Thus there are 8-7 -6 strings in which no J appears before 
any L. 

Exercise 4 shows that there are 8!/(3!2!) strings 
formed by ordering the letters JLLINOIS. Therefore, there are 


! 
38! 3.7.6 
3!2! 


strings formed by ordering the letters ILLINOIS in which 
some / appears before some L. 
6. [Section 6.1]6-9+7+6-9-446-7-449-7-4 
7. [Section 6.1] 2” —2 
8. [Section 6.2] Two suits can be chosen in C(4, 2) ways. We 
can choose three cards of one suit in C(13, 3) ways and we 
can choose three cards of the other suit in C(13, 3) ways. 
Therefore, the total number of hands is C(4, 2)C(13, 3)*. 
9, [Section 6.7] (s — r)* = C(4, 0)s* + C(4, 1)s3(—r) 
+C(4, 2)s?(—r)? + C(4, 3)s(—r)? 
+C(4, 4)(—r)* 
= sf —453¢ + 6977? — 4573 +77 
10. [Section 6.8] Let the 15 individual socks be the pigeons and 
let the 14 types of pairs be the pigeonholes. Assign each 
sock (pigeon) to its type (pigeonhole). By the Pigeonhole 
Principle, some pigeonhole will contain at least two pigeons 
(the matched socks). 
11. [Section 6.1] 6-5+4-3+6-5-4-3-2 
12. [Section 6.2] We must select either three or four de- 
fective discs. Thus the total number of selections is 
C(5, 3)C(95, 1) + C(S, 4). 
13. [Section 6.3] 12!/(3!)* 
14. [Section 6.7] 23 -8!/(3! 1!4!) 


15. 


16. 
17. 


18. 
19. 


20. 


21. 
22. 
23. 
24. 
25. 
26. 
27. 


28. 
29. 


[Section 6.7] If we set a = 2 and b = —1 in the Binomial 
Theorem, we obtain 


n 
l=M@=24+(-b))"= ss C(n, ky2"-*(- 1). 

k=0 
[Section 6.3] C11 +4-1,4- 1) 
[Section 6.8] There are 3-2-3 = 18 possible names for the 
19 persons. We can consider the assignment of names to 
people to be that of assigning pigeonholes to the pigeons. By 
the Pigeonhole Principle, some name is assigned to at least 
two persons. 
[Section 6.7] C(n, 1) =n 


[Section 6.8] Let a; denote the position of the ith available 
item. The 220 numbers 


a1, +--+, 41105 a, +19,...,a119 +19 


range from | to 219. By the Pigeonhole Principle, two are 
the same. 


[Section 6.8] Each point has an x-coordinate that is either 
even or odd and a y-coordinate that is either even or odd. 
Since there are four possibilities and there are five points, by 
the Pigeonhole Principle at least two points, pj = (x;, yj) and 
Pj = (x, yj) have 

@ Both x; and x; even or both x; and x; odd. 
and 

= Both y; and y; even or both y; and y; odd. 
Therefore, x; + xj is even and y; + y; is even. In particular, 
(x; +.2;)/2 and (y; + y;)/2 are integers. Thus the midpoint of 
the pair p; and p; has integer coordinates. 
[Section 6.4] 12567 
[Section 6.4] 234567 


[Section 6.4] 6427153 
[Section 6.4] 631245 

[Section 6.5] 1/4 

[Section 6.6] P(H) = 5/6, P(T) = 1/6 

[Section 6.6] Let S denote the event “children of both sexes,” 
and let G denote the event “at most one girl.” Then 


P(S) = 6 3 
= 8 4 
P(G) = ee 
~ 8 2 
P(SNG) = = 
=< 
Therefore, 
P(S)P(G) = = : = - = P(SNG) 
“42° 8 : 
and S and G are independent. 
[Section 6.5] 5/36 
. C(7, 5)C(31 — 7, 2) 21-276 
Section 6. = = 0.002204158 
een eka 3629575 ~ °-00220 


30. [Section 6.6] Let J denote the event “Joe passes,” and let A 
denote the event “Alicia passes.” Then 
P(Joe fails) = PJ) = 1 — P(J) = 0.25 
P(both pass) = P(JM A) = P(/)P(A) 

= (0.75)(0.80) = 0.6 

P(both fail) = PNA) = PUA) 
= 1—P(JUA) 
= | — [P(V) + P(A) — PJ NA)] 
= 1 — [0.75 + 0.80 — 0.6] = 0.05 

P(at least one passes) = 1 — P(both fail) 

= 1-0.05 = 0.95. 


31. [Section 6.6] Let B denote the event “bug present,” and let 7, 
R, and J denote the events “Trisha (respectively, Roosevelt, 
José) wrote the program.” Then 

P(B\| JP) 
P(B| J)PU) + P(B|T)P(T) + P(B| R)P(R) 
(0.05) (0.25) 
~ (0.05) (0.25) + (0.03)(0.30) + (0.02)(0.45) 
= 0.409836065. 


P(UJ|B) = 


4-C(13, 6) -3+C(13, 5) -2+C(13, 2) 


32. [Section 6.5] C(52, 13) 


Section 7.1 Review 
1. A recurrence relation defines the nth term of a sequence in 
terms of certain of its predecessors. 
2. An initial condition for a sequence is an explicitly given value 


for a particular term in the sequence. 


3. Compound interest is interest on interest. If a person invests 
d dollars at p percent compounded annually and we let A, 
be the amount of money earned after n years, the recurrence 


relation 
An= (1+) 
= 100 n-1 
together with the initial condition Ag = d defines the se- 
quence {A,}. 


4. The Tower of Hanoi puzzle consists of three pegs mounted 
on a board and disks of various sizes with holes in their cen- 
ters. Only a disk of smaller diameter can be placed on a disk 
of larger diameter. Given all the disks stacked on one peg, the 
problem is to transfer the disks to another peg by moving one 
disk at a time. 


5. If there is one disk, move it and stop. If there are n > | disks, 
recursively move n — | disks to an empty peg. Move the 
largest disk to the remaining empty peg. Recursively move 
n — | disks on top of the largest disk. 


6. We assume that at time n, the quantity g, sold at price p, is 
given by the equation p, = a — bqn, where a and b are pos- 
itive parameters. We also assume that p, = kqn+1, where 
k is another positive parameter. If we graph the price and 
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quantity over time, the graph resembles a cobweb (see, e.g., 
Figure 7.1.5). 


7. Ackermann’s function A(m, n) is defined by the recurrence 
relations 


A(m, 0) = A(m — 1, 1) 
A(m,n) = A(m — 1, A(m,n — 1)) 


m>1 


m>1l1,n>1 


and initial conditions 


A(O,n) =n+1 n> 0. 
Section 7.1 
1. dy = Gn) +4; a = 3 
» An = CAAAn-1 5. Ao = 2000 


. Ay = 2280, Ar = 2599.20, Az = 2963.088 
» An = (1.14)"2000 


. We must have A, = 4000 or (1.14)"2000 = 4000 or 
(1.14)” = 2. Taking the logarithm of both sides, we must have 
n log 1.14= log 2. Thus 


ZTrrnn 


— log2 
= log 1.14 


5.29. 


18. We count the number of n-bit strings not containing the pat- 
tern 000. 


= Begin with 1. In this case, if the remaining (n — 1)- 
bit string does not contain 000, neither will the n-bit 
string. There are S,—; such (n — 1)-bit strings. 


®™ Begin with 0. There are two cases to consider. 


1. Begin with 01. In this case, if the remaining 
(n — 2)-bit string does not contain 000, neither 
will the n-bit string. There are S,—2 such (n — 2)- 
bit strings. 

2. Begin with 00. Then the third bit must be a | and 
if the remaining (n — 3)-bit string does not con- 
tain 000, neither will the n-bit string. There are 
Sn—3 such (n — 3)-bit strings. 

Since the cases are mutually exclusive and cover all 
n-bit strings (n > 3) not containing 000, we have S,, = S,—1+ 
Sn—2 + Sp—3 for n> 3. $; =2 (there are two 1|-bit strings), 
S2 =4 (there are four 2-bit strings), and $3=7 (there are 
eight 3-bit strings but one of them is 000). 


19. There are S,—; n-bit strings that begin | and do not contain 
the pattern 00 and there are S,—2 n-bit strings that begin 0 
(since the second bit must be 1) and do not contain the pat- 
tern 00. Thus S,, = S,—1+S,—2. Initial conditions are S; = 2, 
So =3. 

22. S; = 2, S2 = 4, $3 = 7, S4 = 12 

25. C3 =5, C4 = 14, Cs = 42 

28. We first prove that ifn > 5, then C,, is not prime. Suppose, 
by way of contradiction, that C,, is prime for some n > 5. By 
Exercise 27,n +2 < C,. Thus C,, does not divide n + 2. By 
Exercise 26, 


(n+ 2Cn41 => (4n +P 2)Cn- 


684 


31. 


36. 
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Thus C,, divides (n + 2)C,41. By Exercise 27, Section 5.3, 
C, divides either n + 2 or Cy41. Since C, does not divide 
n + 2, C, divides C,41. Therefore there exists an integer 
k > 1 satisfying C,+1 = kC,. Thus 


(n + 2)kC, = (4n + 2)Cy. 
Canceling C,, we obtain 
(n+ 2)k = (4n+ 2). 


If k = 1, the preceding equation becomes n + 2 = 4n + 2, 
and thus n = 0, which contradicts the fact that n > 5. Simi- 
larly, if A = 2, then n = 1, and if k = 3, then n = 4, both of 
which contradict the fact thatn > 5. Ifk > 4, 


4n+2=k(n+2) > 4(n+2)=4n+8. 


Therefore 0 > 6. Thus k does not exist. This contradiction 
shows that if n > 5, C,, is not prime. 

Directly checking n = 0, 1, 2,3,4 shows that only 
Cz = 2 and C3 = 5 are prime. 
Let P,, denote the number of ways to divide a convex (n + 2)- 
sided polygon, n > 1, into triangles by drawing n — | lines 
through the corners that do not intersect in the interior of the 
polygon. We note that P; = 1. 

Suppose that n > 1 and consider a convex (n + 2)- 
sided polygon (see the following figure). 


K 


n+2-(k+1)+1l=n-k+2sides 


We choose one edge ab and construct a partition of the 
polygon by a two-step procedure. First we select a triangle 
to which side ab belongs. This triangle divides the origi- 
nal polygon into two polygons: one having k + 1 sides, 
for some k satisfying 1 < k <_ ny; and the other having 
n—k + 2 sides (see the preceding figure). By definition, 
the (k + 1)-sided polygon can be partitioned in P,_| ways 
and the (n — k + 2)-sided polygon can be partitioned in P;,_; 
ways. (For the degenerate cases k = 1 and k = n, we set 
Po = 1.) Therefore, the total number of ways to partition the 
(n + 2)-sided polygon is 


n 
P,r= Pars 
k=1 


Since the sequence P), P2,... satisfies the same recurrence 
relation as the Catalan sequence Ci, C2,... and Pp = P; = 
1= Cp = C,, it follows that P, = C, for alln > 1. 


[For n = 3] 


Step 1—move disk 3 from peg | to peg 3. 
Step 2—move disk 2 from peg | to peg 2. 


38. 


40. 
46. 


47. 


49. 
52. 


Step 3—move disk 3 from peg 3 to peg 2. 
Step 4—move disk | from peg | to peg 3. 
Step 5—move disk 3 from peg 2 to peg 1. 
Step 6-—move disk 2 from peg 2 to peg 3. 
Step 7—move disk 3 from peg | to peg 3. 
Let @ and £ be the angles shown in Figure 7.1.6. The geom- 
etry of the situation shows that the price tends to stabilize if 
and only if w + 6B > 180°. This last condition holds if and 


only if —tan 6 < tana. Since b = —tan B and k = tana, we 
conclude that the price stabilizes if and only if b < k. 


A(2,2) =7,A(2,3)=9 43. AG,n) = 2743 —3 
If m= 0, 


A(m,n+ 1) = A(O,n+ 1) 
=n+2>n+1 
= A(0,n) = A(m, n). 


The last inequality follows from Exercise 44. 
We use induction on z. For the Basis Step (z = 0), we have 
AO(3, y,0) = 1 = y°. For the Inductive Step, assume the 
result for z. We then have AO(3, y,z + 1) = AO(2, y, y*) = 
joy ee yt! 
Ley = ys 
Use Exercises 41 and 42. 
We prove the statement by using induction on x. The induc- 
tive step will itself require induction on y. 

Exercise 49 shows that the equation is true for 
x = 0, 1, 2 and for all y. 
Basis Step (x=2) See Exercise 49. 


Inductive Step (Case x implies case x +1) Assume that 
x >2and 


A(x, y) = AO(x, 2, y+ 3) —3 for all y > 0. 
We must prove that 
A(xt+1,y) =AO@ + 1,2,y+3)—3 for all y > 0. 


We establish this last equation by induction on y. 


Basis Step (y=0) We must prove that 


A(w + 1,0) = AO(w + 1, 2, 3) — 3. 


Now 


AO(x + 1, 2,3) —3 
= AO(x, 2, AO(x + 1,2,2)) —3 _ by definition 


= AO(x, 2,4) —3 by Exercise 51 
= A(x, 1) by the inductive 

assumption on x 
= A(x+ 1,0) by (7.1.12). 


Inductive Step (Case y implies case y+1) Assume that 


A(x + 1, y) = AO(x + 1,2, y +3) —3. 


We must prove that 


A(x +l,y+1) =AO( + 1,2, y+4) —3. 


55. 


58. 
60. 


63. 


66. 


69. 


Now 


AO(x+ 1,2,y+4)-3 
= AO(x, 2, AO(x+ 1,2, y+3)) —3 _ by definition 


= AO(x, 2, A(x + 1, y) +3) -—3 by the inductive 


assumption on y 


= A(x, A(x + 1, y)) by the inductive 


assumption on x 


=Atx+ly+l by (7.1.13). 


Suppose that we have n dollars. If we buy orange juice the 
first day, we have n — | dollars left, which may be spent in 
Rn—1 ways. Similarly, if the first day we buy milk or beer, 
there are R,-2 ways to spend the remaining dollars. Since 
these cases are disjoint, R, = Ry—1 + 2Ry-2. 


53 = 1/2, S4 = 3/4 


A function f from X = {1,...,n} into X will be denoted 


(i, i2,..-,%n), which means that f(k) = iz. The problem 77. 
then is to count the number of ways to select ij, ..., i so 
that if i occurs, so do 1,2,...,i—1. 


We shall count the number of such functions having 
exactly j 1’s. Such functions can be constructed in two steps: 
Pick the positions for the j 1’s; then place the other numbers. 
There are C(n, j) ways to place the 1’s. The remaining num- 


bers must be selected so that if i appears, so do 1,...,i— 1. 
There are F,_; ways to select the remaining numbers, since 
the remaining numbers must be selected from {2,..., n}. 


Thus there are C(n, j)F,—; functions of the desired type hay- 
ing exactly j 1’s. Therefore, the total number of functions 
from X into X having the property that if i is in the range of 
f, then so are 1,...,i— 1, is 


dE CO. DFn-j = 


j=l J 


So Cn, n = j)Fnj 


1 


n 


{uy} is not a recurrence relation because, if m is odd and 
greater than 1, wu, is defined in terms of the successor u3n+1- 
uj, for 2 < i < 7, is equal to one. As examples, 


w=u=1 


72. 


74. 
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Let A, denote the amount at the end of n years and let i be 
the interest rate expressed as a decimal. The discussion fol- 
lowing Example 7.1.3 shows that A, = (1+/)"Ao. The value 
of n required to double the amount satisfies 


2Ap9 = (1+i)"Ap or 2=(14+ i)". 


If we take the natural logarithm (logarithm to the base e) of 
both sides of this equation, we obtain 


In2 =nIn(1 +1). 


Thus 
In2 
n= ———. 
Indi +a) 
Since In2 = 0.6931472... and In(1 + i) is approximately 


equal to i for small values of i, n is approximately equal to 
0.69... /i, which, in turn, is approximately equal to 70/r. 


1,3, 2: 2,3, 1:23 =2 


We count the number of rise/fall permutations of 1,...,” 
by considering how many have n in the second, fourth, ..., 
positions. 

Suppose that 1 is in the second position. Since any 
of the remaining numbers is less than n, any of them may 
be placed in the first position. Thus we may select the num- 
ber to be placed in the first position in C(n — 1, 1) ways 
and, after selecting it, we may arrange it in Ej; = 1 way. 
The last n — 2 positions can be filled in E,-2 ways since 
any rise/fall permutation of the remaining n — 2 numbers 
gives a rise/fall permutation of 1,...,. Thus the number 
of rise/fall permutations of 1,..., with n in the second 
position is C(n — 1, I)E, En—2. 

Suppose that 7 is in the fourth position. We may select 
numbers to be placed in the first three positions in C(n— 1, 3) 
ways. After selecting the three items, we may arrange them 
in E3 ways. The last n — 4 numbers can be arranged in E,,_4 
ways. Thus the number of rise/fall permutations of 1, ...,n 
with n in the fourth position is C(n — 1, 3)E3E,~4. 

In general, the number of rise/fall permutations of 
1,..., with n in the (2;)th position is 


C(n — 1, 2j — IEx-1En—y.- 


Summing over all j gives the desired recurrence relation. 


ug = U9 = U5 = Hig = Wg = U4 = Wy = I, Section 7.2 Review 
Use equation (6.7.4) to write 1. Use the recurrence relation to write the nth term in terms of 
h certain of its predecessors. Then successively use the recur- 
4 rence relation to replace each of the resulting terms by certain 
Sk, n) = S>S(k-1, i). P as en OIE 
aa of their predecessors. Continue until an explicit formula is 
i= ; 
: . . obtained. 
We use the terminology of Exercise 90, Section 6.2. Choose . ; . 
2. An nth-order, linear homogeneous recurrence relation with 
one of n + 1 people, say P. There are sy j;—1 ways for P to fici : lati f the f 
sit alone. (Seat the other n people at the other k — | tables.) EONS RE COC MCIenS 1:2 recurrence retauomor meu 
Next we count the number of arrangements in which P is not An = C{An—| + C2An—2 + +++ + CKAn—k- 
alone. Seat everyone but P at k tables. This can be done in 
: . 3. dn = 64n—1 — 8dn—2 
Sn,k Ways. Now P can be seated to the right of someone in 
4. To solve 


n ways. Thus there are ns, , arrangements in which P is not 
alone. The recurrence relation now follows. 


An = C1dn—1 + C24y-2, 
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first solve the equation 
P= cit+c2 


for t. Suppose that the roots are ft; and f and that t) 4 fo. 
Then the general solution is of the form 


dy = bt + dth, 


where b and d are constants. The values of the constants can 
be obtained from the initial conditions. 
If t; = t =f, the general solution is of the form 


an = bt" + dnt’, 


where again b and d are constants. The values of the con- 
stants can again be obtained from the initial conditions. 


Section 7.2 


1. Yes; order 1 

4. No 

7. No 
10. Yes; order 3 
11. a, = 2(—3)" 
15. a, = 27+! — 4" 


29. 


2 An = Qs + 3")/5 
+ An = 2(—4)" + 3n(—4)" 
«Ry = (HD + 2"*14/3 


. Let d, denote the deer population at time n. The initial con- 


dition is dj = O. The recurrence relation is 
dy = 100n + 1.2dy-1, 
dy = 100n + 1.2d,—1 = 100n + 1.2[100(n — 1) + 1.2d,—2] 
= 100n + 1.2-100(n — 1) + 1.27dy_2 
= 100n + 1.2-100(n — 1) 
+ 1.27[100(n — 2) + 1.2dy—3] 
= 100n + 1.2+100(n — 1) 
+ 1.27 -100(n — 2) + 1.23dy_3 


n> 0. 


n-1 


y 1.2). 100(n — i) + 1.2"do 
i=0 


n-1 


S 1.2). 100(n — i) 


i=0 


n-1 


n—-1 
= 100n }> 12° = 1.2-100 > +1277 
i=0 i=l 


100n(1.2" — 1) 
124 
(n— 1)1.2" —nl.2"-! +1 


120 
(1.2 — 1)2 


: n> 0. 


1/2 


32. 
35. 
38. 


41. 


43. 
46. 
49. 
52. 


55. 


(0) 
n/C 


Set by = ay/n! to obtain b, = —2by_1 + 3by—2. Solving 
gives d, = n! by, = (n!/4)[5 — (—3)"]. 


We establish the inequality by using induction on n. 

The base cases n = | and n = 2 are left to the reader. 
Now assume that the inequality is true for values less than 
n+ 1. Then 


Sn+2 = fntl +Jn 


14/5 
2 


and the Inductive Step is complete. 

an = 62" + da" +1 

ay = b/2" + d3” — (4/3)2” 

The argument is identical to that given in Theorem 7.2.11. 


Recursively invoking this algorithm to move the n — k, disks 
at the top of peg 1 to peg 2 takes T(n — k,) moves. Mov- 
ing the k, disks on peg 1 to peg 4 requires 2" — 1 moves 
(see Example 7.2.4). Recursively invoking this algorithm to 
move the n—k,, disks on peg 2 to peg 4 again takes T(n — ky) 
moves. The recurrence relation now follows. 


From the inequality 
Kn (kn + 1) <n 


5) ; 
we can deduce ky, < /2n. Since 
n—k, < knlkn + ae 


it follows that r,, < k,. Therefore, 


T(n) = (kn tin — 1)2" +1 
< kyr +1 
< W/2n2Y™ 41 
= 0(4v"), 


Section 7.3 Review 


1. 


Let b, denote the time required for input of size n. Simulate 
the execution of the algorithm and count the time required 
by the various steps. Then b, is equal to the sum of the times 
required by the various steps. 


. Selection sort selects the largest element, places it last, and 


then recursively sorts the remaining sequence. 


3. O(n?) 


4. Binary search examines the middle item in the sequence. If 


the middle item is the desired item, binary search terminates. 
Otherwise, binary search compares the middle item with 
the desired item. If the desired item is less than the middle 
item, binary search recursively searches in the left half of 
the sequence. If the desired item is greater than the middle 
item, binary search recursively searches in the right half of 
the sequence. The input must be sorted. 


5. Ifa, is the worst-case time for input of size n, dy = 1+ |n/2). 
6. O(lgn) 


. Merge maintains two pointers to elements in the two input 


sequences. Initially the pointers reference the first elements 
in the sequences. Merge copies the smaller element to the 
output and moves the pointer to the next element in the 
sequence that contains the element just copied. It then re- 
peats this process. When a pointer moves off the end of one 
of the sequences, merge concludes by copying the rest of the 
other sequence to the output. Both input sequences must be 
sorted. 


8. ©(n), where nis the sum of the lengths of the input sequences 


9. Merge sort first divides the input into two nearly equal parts. 


10. 
11. 


12. 


13. 


It then recursively sorts each half and merges the halves to 
produce sorted output. 

An = A\n/2| + Aint (2) +2 — 1 

If the input size is a power of two, the size is always divisible 
by 2 and the floors vanish. 

An arbitrary input size falls between two powers of two. 
Since we know the worst-case time when the input size is a 
power of two, we may bound the worst-case time for input of 
arbitrary size by the worst-case times for inputs whose sizes 
are the powers of two that bound it. 


O(nlgn) 


Section 7.3 


1. 


At line 2, since i > j (1 > 5) is false, we proceed to line 4, 
where we set k to 3. At line 5, since key (*G’) is not equal to 
53 (‘J’), we proceed to line 7. At line 7, key < 5% (6G < ‘J’) 
is true, so at line 8 we set j to 2. We then invoke this algorithm 
with i = 1, j = 2 to search for key in 


s="C, 5 = ‘GC’. 


At line 2, since i > j (1 > 2) is false, we proceed 
to line 4, where we set k to 1. At line 5, since key (‘G’) is 
not equal to s; (‘C’), we proceed to line 7. At line 7, key 
< sx (G’ < ‘C’) is false, so at line 10 we set i to 2. We then 
invoke this algorithm with i = j = 2 to search for key in 


52 = ‘GC. 


At line 2, since i > j (2 > 2) is false, we proceed to 
line 4, where we set k to 2. At line 5, since key (‘G’) is equal 
to s2 (‘G’), we return 2, the index of key in the sequence s. 


. At line 2, since i > j (1 > 5) is false, we proceed to line 4, 


where we set k to 3. At line 5, since key (‘Z’) is not equal to 
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53 (‘J’), we proceed to line 7. At line 7, key < sz ((Z’ < ‘J’) 
is false, so at line 10 we set i to 4. We then invoke this algo- 
rithm with i = 4, j = 5S to search for key in 


soa “MM, s5 = ‘X’, 


At line 2, since i > j (4 > 5) is false, we proceed 
to line 4, where we set k to 4. At line 5, since key (‘Z’) is 
not equal to s4 (‘M’) we proceed to line 7. At line 7, key 
< sg (Z’ < ‘M’) is false, so at line 10 we set i to 5. We then 
invoke this algorithm with i = j = 5 to search for key in 


se "X". 


At line 2, since i > j (5 > 5) is false, we proceed 
to line 4, where we set k to 5. At line 5, since key (‘Z’) is 
not equal to ss (‘X’), we proceed to line 7. At line 7, key 
< spt (Z’ < ‘X’) is false, so at line 10 we set i to 6. We then 
invoke this algorithm with i = 6,7 = 5. 

At line 2, since i > j (6 > 5) is true, we return 0 to 
indicate that we failed to find key. 


. Consider the input 10, 4, 2 and key = 10. 
10. 


The idea is to repeatedly divide the sequence as nearly as 
possible into two parts and retain the part that might contain 
the key. Only after obtaining a subsequence of length 1 or 
2, do we test whether the subsequence contains the key. The 
following algorithm implements this design. 


binary_search_nonrecurs(s, n, key) { 


i=1 

j=n 

// the body of the loop executes only if the subsequence 
HW Si cceys s; has length greater than or equal to 3 


while (i < j — 1) { 
k= +j/2! 
if (sg < key) 
i=k+1 
else 
j=k 
} 
fork = itoj 
if (sp == key) 
return k 
return 0 


} 


We first prove that if a sequence of length n is input to 
the while loop, where n is a power of 2, say n = 2", m > 2, 
the loop iterates m — | times. The proof is by induction on 
m. The Basis Step is m = 2. In this case n = 4. Assuming 
that i = 1 and n = 4, in the while loop, & is first set to 2. 
Then either i is set to 3 orj is set to 2. Thus the loop does not 
execute again. Therefore the loop iterates | = m — | time. 
The Basis Step is complete. 

Now suppose that if a sequence of length n = 2” is 
input to the while loop, the loop iterates m— 1 times. Suppose 
that n = 2”+!, Assuming that i = 1 and n = 2”*!, in the 
while loop, k is first set to 2”. Then either i is set to 2’””+1 orj 
is set to 2”. Thus at the next iteration of the loop, a sequence 
of length 2’”” is processed. By the inductive assumption, the 
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24. 
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loop iterates an additional m — 1 times. Therefore the loop 
iterates a total of m times. The Inductive Step is complete. 

Next we prove that if a sequence of length n, where 
n satisfies 2”-! < n < 2”, m > 2, is input to the while 
loop, the loop iterates at most m — 1 times. The proof is by 
induction on m. The Basis Step is m = 2. In this case we 
have 2 < n < 4. Thus n is either 3 or 4. In the preceding 
paragraphs, we proved that if n = 4 the loop iterates one 
time. If n = 3, it is easy to check that the loop iterates one 
time. The Basis Step is complete. 

Now assume that if a sequence of length n, where 
n satisfies 2"-! <n < 2", m > 2, is input to the while 
loop, the loop iterates at most m — | times. Suppose that 
n satisfies 2” <n < 2™+1_ When n is even, the sequence 
is divided evenly and the next sequence processed by the 
loop has length n/2. Since n/2 satisfies 2”-! < n/2 < 2”, 
by the inductive assumption the loop iterates at most m — 1 
more times. When n is odd, the sequence is divided into 
two parts—one part of length (n — 1)/2 and the other of 
length (n + 1)/2. Since n is odd, 2” <n < 2”*!, Therefore 
2" <n+1<2"*!, Thus 2”! <(n + 1)/2 < 2’. In this 
case, the inductive assumption tells us that the loop iterates 
at most m — 1 more times. We also have 2” <n—1<2™+! 
and 2”! < (n—1)/2 <2”. 1f2"—! < (n—1)/2, we may use 
the inductive assumption to conclude that the loop iterates at 
most m — | more times. If 2”"~! = (n — 1)/2, we may use 
the result proved just after the algorithm to conclude that the 
loop iterates m— 2 more times. In every case the loop iterates 
at most m — | more times. Together with the first iteration, 
we conclude that if n satisfies 2” <n < 2™+1 the while loop 
iterates at most m times. The Inductive Step is complete. 

Suppose that n satisfies 2”-! < n < 2”. Then the 
while loop iterates at most m — | times. This accounts for 
m—| tests of the form s; < key. At the for loop, either i = j or 
i =j+1. Thus there are at most two additional comparisons 
(of the form s; == key). Thus if n satisfies aml cn <2, 
the algorithm uses at most m+ 1 comparisons. Since 2”"—! < 
n <2",m—1 < Ign < m. Therefore [1gn] = m. Thus the 
algorithm uses at most 1 + m = 1 + [lgn] comparisons. 


The algorithm is not correct. If s is a sequence of length 1, 
5; = 9, and key = 8, the algorithm does not terminate. 


The algorithm is correct. The worst-case time is © (log n). 


Algorithm B is superior if 2 < n < 15. (Forn = | andn = 
16, the algorithms require equal numbers of comparisons.) 


Suppose that the sequences are aj,...,d, and by,..., Dn. 
(a) aj <bj <an<bo<:-: (b)a,<b, 

11 

4,1; 4,3; 4,2; 3,2 

4,3,2,1 

Using the notation in the hint, every pair p;, p; is an inversion 


in either p or p® but not both. Since there are n(n — 1)/2 
total pairs p;, p;, the total number of inversions in p and ps 
is n(n — 1)/2. Since there are n! permutations of n items, 
adding the inversions in p and p® for all permutations p gives 
n!n(n — 1)/2. But in this computation, each inversion was 


37. 


39. 


40. 
41. 
43. 


56. 


57. 
58. 


counted twice [once in the pair p, p® and once in the pair 
p®, (p&)® = pR, p]. Therefore, the total number of inver- 
sions in all of the permutations of {1, ..., m} is n!n(n — 1)/4. 


The idea is to use a divide-and-conquer approach. Sup- 
pose that we divide the input pj,..., pn, into two (approx- 
imately) equal parts: pj,...,px and pry1,..-,Pn, Where 
k = |(n+ 1)/2]. We can then recursively count the number 
of inversions in the first part and the number of inversions in 
the second part. The remaining inversions to count are those 
of the form p;, pj, where index / is in the first part and index 
j is in the second part. The total number of inversions is, 
then, the sum of the number of inversions in the first part, the 
number of inversions in the second part, and the number of 
inversions of the form p;, pj, where index i is in the first part 
and index j is in the second part. We can count the latter type 
of inversions if the first and second parts are sorted (after 
counting the inversions in each part prior to sorting). 

Leti = 1 andj = k +1. If p; < pj, since each half is 
sorted, p; < Pm for all indexes m in the second half. There- 
fore p; is not in any inversion where the other element is from 
the first second half. In this case, we need not consider p; fur- 
ther. We can increment i and repeat this process. If pj > pj, 
then p;, p; is an inversion. Furthermore, since the first half is 
sorted, Pm > pj for all indexes m in the first half. Therefore 
Pm, p; is an inversion for all indexes m in the first half. We 
can count k inversions. We need not consider p; further. We 
can increment j and repeat this process. 

If we copy each element we need not consider further, 
we simply have the merge algorithm together with counting 
inversions. To effect the sorting, we can use merge sort. So 
if we use merge sort, together with the enhanced merge, we 
have our O(n lgn) algorithm. 


Algorithm 7.3.11 computes a” by using the formula a” = 
a” a" —m : 

bn = Dinj2) + Oyaty2) + 1, b1 = 0 

by = 1,b3 = 2,b4 =3 42. b, =n-1 

We prove the formula by using mathematical induction. The 
Basis Step, n = 1, has already been established. 


Assume that b, = k — 1 for all k < n. We show that 
b, =n— 1. Now 


bn = Din/2) + bin+iy/2) +1 
=|*| i+|2e | 141 
2 2, 


by the inductive assumption 


1 
=(5| |" | Laat, 
2 2 
If n = 1, then i = j and we return before reaching line 6b, 
10, or 14. Therefore, b} = 0. If n = 2, then j = i+ 1. There 


is one comparison at line 6b and we return before reaching 
line 10 or 14. Therefore, b2 = 1. 


b3 = 3,b4 =4 


When n > 2, bi(n+1)/2) Comparisons are required for the 
first recursive call and b\,/2; comparisons are required for 


59. 


60. 


69. 
70. 


the second recursive call. Two additional comparisons are 
required at lines 10 and 14. The recurrence relation now 
follows. 


Suppose that n = 2". Then (7.3.12) becomes 
box = 2by-1 + 2. 
Now 
bo = 2bz-1 +2 
= 2[2bp-2 +2] +2 
= bur +24+2=-:- 


= 2 1p, 4214 9k? 4.24.2 
aa Oe OD) as eee) 
= 214 9k_9 
n 3n 
=n—-2+-=~-2. 
jae nee 


We use the following fact, which can be verified by consid- 
ering the cases x even and x odd: 


3x el a 3(x+ 1) 
2 2 

Let a, denote the number of comparisons required by 
the algorithm in the worst case. The cases n = | andn = 2 
may be directly verified. (The case n = 2 is the Basis Step.) 
Inductive Step Assume that a, < [(3k/2) — 2] for2 <k 
<n. We must show that the inequality holds for k = n. 

If n is odd, the algorithm partitions the array into 
subclasses of sizes (n — 1)/2 and (n + 1)/2. Now 


2 =3x—=2 fore = 1) 2,06<5 


Qn = A(n-1)/2 t Q(n+1)/2 + 2 


fee 1 | 


aa 2 
+ [SPC 2] 42 
2 
_ 3-1) 24.5 = 28 3 
2 2 2 


II 
—"T 

w 
wl] g 

| 

wo 
—_1 


The case n even is treated similarly. 

O(n) 

If n = 1, sort just returns; therefore, all of the zeros precede 
all of the ones. The Basis Step is proved. 

Assume that for input of size n — 1, after sort is in- 
voked all of the zeros precede all of the ones. Suppose that 
sort is invoked with input of size n. If the first element is a 
one, it is swapped with the last element. sort is then called re- 
cursively on the first n— 1 elements. By the inductive assump- 
tion, within the first n — 1 elements all of the zeros precede 
all of the ones. Since the last element is a one, all of the zeros 
precede all of the ones for all n elements. If the first element 
is a zero, sort is called recursively on the last n — 1 elements. 
By the inductive assumption, within the last n — 1 elements 
all of the zeros precede all of the ones. Since the first ele- 


75. 


77. 
80. 
83. 


85. 
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ment is a zero, all of the zeros precede all of the ones for all 
n elements. In either case, sort does produce as output a rear- 
ranged version of the input sequence in which all of the zeros 
precede all of the ones, and the Inductive Step is complete. 


Ifn = 2', 
yk = 3azi-1 + 2k 
so 
An = A = 3az-1 + 2k 
= 3[Bay2 + 241] + 2* 
= 37ap2 +3-2k-1 4 9k 


= Stay + 3% 1.214 3h 2.92 4... 
4.3 .2k-1 4 ok 


= 3* + 2(3* — 2k) Ga 
= 3-3*—2.2k 


= 3.3/8" _ 2n. 
Line («) results from the equation 
(a = b)(ak—!p° a aé?p! ee a'bk-2 4+ abk}) = qd p* 


with a= 3 andb=2. 
bn = byasny/2) + P{n/2) +3 
bn = 4n — 3 


We will show that b, < bn4i,n = 1,2,.... We have the 
recurrence relation 


bn = bia4ny/2) + B[n/2) + CLa4n)/2), [n/2I- 


Basis Step bz = 2b, +¢1,1 = 2b) = by 


Inductive Step Assume that the statement holds for k < n. 
In case n is even, we have by = 2bn/2 + Cn/2,n/2; $0 


baat = bnt2)/2 + bnj2 + C(n42)/2,n/2 
= bn/2 + bn/2 + Cn/2,n/2 = bn. 


The case n is odd is similar. 

ex85(s, i,j) { 

if@==)) 
return 

m=((i+))/2! 
ex85(s, i, m) 
ex85(s,m-+ 1, j) 
combine(s, i, m, j) 


} 


88. We prove the inequality by using mathematical induction. 


Basis Step aj =0<0=), 


Inductive Step Assume that a, < by fork <n. Then 


An SF An j2\ + ang 2) +2180 


S Dinj2\ + Ong yz) +2182 = bn. 
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91. Let c = a. If nis a power of m, say n = m*, then 


An = Ank = Ank-1 + d 
= [dyi2 + d] +d 
= Ay k-2 2d 


= a,0 +kd =c+kd. 
An arbitrary value of n falls between two powers of m, say 


m*! <n<m 


k 
This last inequality implies that 
k—1<log,,n<k. 
Since the sequence a is nondecreasing, 
Ak! S An SL Aye. 


Now 


Q(log,, 2) = c+ (—-1+log,,n)d <c+(k—l)d 


= Ayk-! San 


and 


= 
A 


S Ank = C+ kd 
c++ log,, njd = O(log, n). 


A 


Thus a, = ©(log,, n). By Example 4.3.6, a, = O(1gn). 


Section 7.4 Review 


L, 


Computational geometry is concerned with the design and 
analysis of algorithms to solve geometry problems. 


2. Given n points in the plane, find a closest pair. 


. Compute the distance between each pair of points and choose 


the minimum distance. 


. Find a vertical line / that divides the points into two nearly 


equal parts. Then recursively solve the problem for each of 
the parts. Let 5; be the distance between a closest pair in the 
left part, and let dp be the distance between a closest pair in 
the right part. Let 6 = min{5,, 5g}. Then examine the points 
that lie within a vertical strip of width 25 centered about 
1. Order the points in this strip in increasing order of the 
y-coordinates and examine the points in this order. Compute 
the distance between each point p and the following seven 
points. Anytime there is a pair whose distance is less than 
5, update 5. At the conclusion, 6 is the distance between a 
closest pair. 


. The worst-case time of the brute-force algorithm is © (n?). 


The worst-case time of the divide-and-conquer algorithm is 
O(nlgn). 


Section 7.4 


1. 


The 16 points sorted by x-coordinate are (1,2), (1,5), (1, 9), 
(3,7), (3, 11), (5,4), (5,9), (7, 6), (8, 4), (8,7), (8,9), (11, 3), 
(11,7), (12, 10), (14,7), (17, 10), so the dividing point is 


(7,6). We next find 6, = J/8, the minimum distance among 
the left-side points (1,2), (1,5), (1,9), (3,7), (3, 11), (5,4), 
(5,9), (7,6), and 6g = 2, the minimum distance among the 
right-side points (8, 4), (8, 7), (8,9), (11,3), (11,7), (12, 10), 
(14,7), (17, 10). Thus 6 = min{6z, 5p} = 2. The points, 
sorted by y-coordinate in the vertical strip, are (8, 4), (7, 6), 
(8,7), (8,9). In this case we compare each point in the strip 
to all the following points. The distances from (8, 4) to (7, 6), 
(8,7), (8,9) are not less than 2, so 6 is not updated at this 
point. The distance from (7, 6) to (8,7) is »/2, so 6 is updated 
to xf, The distances from (7,6) to (8,9) and from (8, 7) to 
(8,9) are greater than a/2, so 6 remains x/2., Therefore, the 
distance between the closest pair is J2. 


4. Consider the extreme case when all of the points are on the 


vertical line. 


10. Let B be either of the left or right 6 x 5 squares that make 


up the 6 x 26 rectangle (see Figure 7.4.2). We argue by con- 
tradiction and assume that B contains four or more points. 
We partition B into four 5/2 x 6/2 squares as shown in Fig- 
ure 7.4.3. Then each of the four squares contains at most one 
point, and therefore exactly one point. Subsequently we refer 
to these four squares as the subsquares of B. 


The figure 
Dae . 
\ 
\ 
be 1s ed 
\ 
\ 
i 
ew 
\ 
\d 


e (e 


shows the following construction. We reduce the size of the 
subsquares, if possible, so that 


13. 


™@ Each subsquare contains one point. 
| The subsquares are the same size. 
| The subsquares are as small as possible. 


Since at least one point is not in a corner of B, the subsquares 
do not collapse to points and so at least one point is on a side 
of a subsquare s interior to B. We choose such a point and call 
it p. We select a subsquare s’ nearest p. We label the two cor- 
ner points of s’ on the side farthest from p, e and c. We draw a 
circle of radius 5 with center at c and let a be the (noncorner) 
point where this circle meets the side of s. Note that this circle 
meets a side of s in anoncorner point. Choose a point b in s on 
the same side as a between a and e. Let d be the correspond- 
ing point on the opposite side of s. Now the length of the di- 
ameter of rectangle R = bdce is less than 5; hence, R contains 
at most one point. This is contradiction since R contains p and 
the point in s’. Therefore, B contains at most three points. 


In addition to p.x and p.y, we assume that each point p has 
another field p.side, which we use to indicate whether p is 
on the left side or the right side when the points are divided 
into two nearly equal parts. The extra argument, /abel, to 
rec_find_all_26_once sets p.side to label for all points p. 


find_all_25_once(p,n) { 
5 = closest_pair(p, n) // original procedure 


if (5 > 0) { 
sort p}, ..-, Pn by x-coordinate 
rec_find_all_25_once(p, 1, n, 5, A) // = empty string 
} 
} 
rec_find_all_26_once(p, i, j, 5, label) { 
if G—i < 3) { 
sort pj, ..., pj by y-coordinate 
directly find and output all distinct pairs less than 
26 apart 
for k = itoj 
px.side = label 
return 
} 
k=(G+))/2] 
1 = px.x 


rec_find_all_25_once(p, i, k, 5, L) 
rec_find_all_26_once(p, k + 1, j, 6, R) 


merge pj, ..., px and px41,..., pj by y-coordinate 
t=0 
fork =itoj 
if (pp.x >1—2*5 A pe.x <14+2 5) { 
t=t+1 
Vt = Pk 
} 


fork=1tot—1 
fors =k-+ 1 to min{t,k +31} 
if (dist(vg, Vs) < 2% 5 A vp.sidem = vs.side) 
printin(vy +“ + vs) 
fork =itoj 
px.side = label 


16. 


18. 


21. 
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We show that for each point p, there are at most 6 distinct 
points whose distance to p is 6. It will then follow that the 
number of pairs 6 apart is less than or equal to 6n. 

We argue by contradiction. Suppose that some point 
p has 7 distinct neighbors p;,...,p7 whose distance to p is 
5. Then we have the situation 


Let C be the circumference of this circle. Since each dj is at 
least 5, we have 


i 
Ins =C> Sod > 785. 
i=l 


Therefore, x > 7/2 = 3.5, which is a contradiction. (A 
more careful estimate shows that for each point p, there are 
at most 5 distinct points whose distance to p is 6.) 


The inequality is proved by induction. The Basis Steps are 
n = 2,3, which are immediate. For the Inductive Step, we 
have 


An SZ Ajn/2) + 4y~ngiy/2) FON S tn/2) +em+/2) +n = th. 


an < ty = O(nign) 


Chapter 7 Self-Test 


1. 


[Section 7.1] (a) 3,5,8,12  (b) a4j=3) (C) dy=an—1-+n 


2. [Section 7.2] Yes 
3: 
4 


. [Section 7.1] Let X be an n-element set and choose x € X. 


[Section 7.1] An = (1.17)An—1, Ao = 4000 


Let k be a fixed integer, 0 < k < n— 1. We can select a 
k-element subset Y of X — {x} in C(n — 1, k) ways. Having 
done this, we can partition Y in Px ways. This partition to- 
gether with X — Y partitions X. Since all partitions of X can 
be generated in this way, we obtain the desired recurrence 
relation. 


. [Section 7.2] a, = 2(—2)”" — 4n(—2)" 


6. [Section 7.2] a, = 3-5" + (—2)” 


. [Section 7.1] If the first domino is placed as shown, 


there are a,—; ways to cover the 2 x (n — 1) board that 
remains. 
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2|2X1 


n 


If the first two dominoes are placed as shown, there 
are d,—2 ways to cover the 2 x (nm — 2) board that remains. 


1x2 


1x2 


It follows that a, = dy—1 + dy_2. 

By inspection, a; = | and az = 2. Since {a,} satisfies 
the same recurrence relation as the Fibonacci sequence and 
a, =f and az = fz, it follows that a; = fi+; fori = 1,2,.... 


. [Section 7.2] Consider a string of length n that contains an 


even number of 1’s that begins with 0. The string that follows 
the 0 may be any string of length n — 1 that contains an even 
number of 1’s, and there are c,_; such strings. A string of 
length n that contains an even number of 1’s that begins with 
2 can be followed by any string of length n — 1 that contains 
an even number of 1’s, and there are c,—; such strings. A 
string of length n that contains an even number of 1’s that 
begins with 1 can be followed by any string of length n — 1 
that contains an odd number of 1’s. Since there are 3”~! 
strings altogether of length n — 1 and c,_; of these contain 
an even number of 1’s, there are 3”! —cy_| strings of length 
n — | that contain an odd number of 1’s. It follows that 


—1 —1 
Cn = 2Cn-1 + 3" Cn-1 = Cn-1 + ar 


An initial condition is c; = 2, since there are two strings (0 
and 2) that contain an even number (namely, zero) of 1’s. 
We may solve the recurrence relation by iteration: 


Cn = Cn-1 4 gro! = Cn-2 4 Sia t gro! 
=e, $31 $3? +---43"1 
3-3 3" 41 
~ 7" 3-1 2 


. [Section 7.3] by = by—1 + 1, bo = 0 

. [Section 7.3] by = 1, bo = 2,b3 =3 

. [Section 7.3] b, =n 

. [Section 7.3] n(n + 1)/2 = O(n*). The given algorithm is 


faster than the straightforward technique and is, therefore, 
preferred. 


[Section 7.4] The 18 points sorted by the x-coordinate are 
C1, 8), (2, 2), (3, 13), (4, 4), (4, 8), (6, 10), (7, 1), (7, 7), (7, 13), 
(10, 1), (10,5), (10,9), (10,13), (12,3), (13,8), (14,5), 
(16, 4), (16, 10), so the dividing point is (7, 13). We next find 
6, = V8, the minimum distance among the left-side points 
(1, 8), (2, 2), (3, 13), (4, 4), (4, 8), (6, 10), (7, 1), (7, 7), (7, 13), 
and dr = J5, the minimum distance among the right-side 
points (10, 1), (10, 5), (10, 9), (10, 13), (12, 3), (13, 8), 14, 5), 
(16,4), (16, 10). Thus 6 = min{5;, 5g} = V5. The points, 
sorted by y-coordinate in the vertical strip, are (7, 1), (7,7), 


14. 


15. 


16. 


(6, 10), (7, 13). In this case we compare each point in the 
strip to all the following points. Since no pair is closer than 
J5, the algorithm does not update 5. Therefore, the distance 
between the closest pair is J5. 


[Section 7.4] If we replace “three” by “two,” when there are 
three points, the algorithm would be called recursively with 
inputs of sizes 1 and 2. But a set consisting of one point has 
no pair—let alone a closest pair. 


[Section 7.4] Each 6/2 x 5/2 box contains at most one point, 
so there are at most four points in the lower half of the rect- 
angle. 


[Section 7.4] O(n(lg n)*) 


Section 8.1 Review 


1. 


15. 


16. 


An undirected graph consists of a set V of vertices and a set 
E of edges such that each edge e € E is associated with an 
unordered pair of vertices. 


. Friendship can be modeled by an undirected graph by letting 


the vertices denote the people and placing an edge between 
two people if they are friends. 


. A directed graph consists of a set V of vertices and a set E of 


edges such that each edge e € E is associated with an ordered 
pair of vertices. 


. Precedence can be modeled by a directed graph by letting the 


vertices denote the tasks and placing a directed edge from 
task 1; to task ; if f; must be completed before 4. 


. If edge e is associated with vertices v and w, e is said to be 


incident on v and w. 


. If edge e is associated with vertices v and w, v and w are said 


to be incident on e. 


. If edge e is associated with vertices v and w, v and w are said 


to be adjacent. 


. Parallel edges are edges that are incident on the same pair of 


vertices. 


. An edge incident on a single vertex is called a loop. 


. A vertex that is not incident on any edge is called an isolated 


vertex. 


. A simple graph is a graph with neither loops nor parallel 


edges. 


. A weighted graph is a graph with numbers assigned to the 


edges. 


. A map with distances can be modeled as a weighted graph. 


The vertices are the cities, the edges are the roads between 
the cities, and the numbers on the edges are the distances 
between the cities. 


. The length of a path in a weighted graph is the sum of the 


weights of its edges. 

A similarity graph has a dissimilarity function s where s(v, w) 
measures the dissimilarity of vertices v and w. 

The n-cube has 2” vertices labeled 0, 1,...,2” — 1. An edge 
connects two vertices if the binary representation of their 
labels differs in exactly one bit. 


17. 
18. 
19. 


20. 


21. 


22. 


23. 


A serial computer executes one instruction at a time. 
A serial algorithm executes one instruction at a time. 


A parallel computer can execute several instructions at a 
time. 


A parallel algorithm can execute several instructions at a 
time. 


The complete graph on n vertices has one edge between each 
distinct pair of vertices. It is denoted Ky. 


A graph G = (V, E) is bipartite if there exist subsets V; and 
V2 (either possibly empty) of V such that V} N V2 = ©, 
V, UV2 = V, and each edge in £ is incident on one vertex in 
V, and one vertex in V2. 


The complete bipartite graph on m and n vertices has dis- 
joint vertex sets V; with m vertices and V2 with n vertices in 
which the edge set consists of all edges of the form (v1, v2) 
with vy; € Vj and v2 € V2. 


Section 8.1 


1. 


4. 


5. 


The graph is an undirected, simple graph. 


Pheasants 


Skyscrapers 


The graph is a directed, nonsimple graph. 


Pheasants Skyscrapers 


Since an odd number of edges touch some vertices (c and d), 
there is no path from a to a that passes through each edge 
exactly one time. 


. (a,c, e,b,c,d,e,f, d, b, a) 
11. 


V = {v1, v2, v3, v4}. E = {e1, €2, €3, e4, €5, eo}. ey and eg are 
parallel edges. es is a loop. There are no isolated vertices. G 
is not a simple graph. e; is incident on v; and v2. 


14. 


27. 
32. 


37. 


40. 
43. 


46. 


49. 


. Bipartite. Vj) = 
. Not bipartite 
. Bipartite. Vj = {vi}, V2 = {v2, v3}. 
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A 


a v2, V5}, = v4}. 


K2,3 K3,3 


(b, c, a, d, e) 
Two classes 
1 2 
3 
° 5 
4 
00 01 
10 11 
n 
0 1 0 1 0 
: 1 0 
1 0 O1 00 
1 0 11 10 
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50. 


53 


z=xty + z=zt+l1 


. f is not one-to-one. Let G; be the graph with vertex set 
{1, 2, 3} and edge set {(1, 2)}, and let G2 be the graph with 
vertex set {1, 2,3, 4} and edge set {(1, 2)}. Then G; 4 Go, 
but f(G1) = 1 =f(G). 

f is onto. Let n be a nonnegative integer. If n = 0, let 
G be the graph with vertex set {1, 2, 3} and edge set @. Then 
f(G) = 0 =n. Ifn > 0, let G be the graph with vertex set 
{1,2,...,,n+ 1} and edge set 


{(, 2), (2,3),..., (2,2 + 1}. 
Then f(G) = n. Therefore f is onto. 


Section 8.2 Review 


1 


. A path is an alternating sequence of vertices and edges 


(Vo, €1, V1, €2, V2, +++) Vn—1s Cn» Vn)s 


in which edge e; is incident on vertices vj; and v; for i = 
1 ee 

. A simple path is a path with no repeated vertices. 

. C, 2,3, 1) 


. Acycle is a path of nonzero length from v to v with no re- 
peated edges. 


. Asimple cycle is a cycle from v to v in which, except for the 
beginning and ending vertices that are both equal to v, there 
are no repeated vertices. 


6. (1, 2,3, 1,4, 5, 1) 


7. A graph is connected, if, given any vertices v and w, there is 


a path from v to w. 


8. 
1 2 
a 
9. 
il 2 4 5 
3 
10. Let G = (V,E) be a graph. (V’, E’) is a subgraph of G if 


V’ CV, E' CE, and, for every edge e’ € F’, if e’ is incident 
on v’ and w’, then v’, w’ € V’. 


11. 


12. 


13. 


14. 
15. 
16. 


17. 


18. 
19. 


20. 


21. 
22. 


23. 


The graph of Exercise 8 is a subgraph of the graph of 
Exercise 9. 


Let G be a graph and let v be a vertex in G. The subgraph G’ of 
G consisting of all edges and vertices in G that are contained 
in some path beginning at v is called the component of G 
containing v. 


The graph of Exercise 8 is a component of the graph of 
Exercise 9. 


One 
The degree of vertex v is the number of edges incident on v. 


An Euler cycle in a graph G is a cycle that includes all of the 
edges and all of the vertices of G. 


A graph G has an Euler cycle if and only if G is connected 
and the degree of every vertex is even. 


The graph of Exercise 8 has the Euler cycle (1, 2, 3, 1). 


The graph of Exercise 9 does not have an Euler cycle because 
it is not connected. 


The sum of the degrees of the vertices in a graph equals twice 
the number of edges in the graph. 


Yes 


The graph is connected and v and w are the only vertices hav- 
ing odd degree. 


Yes 


Section 8.2 


1. 
4. 


7. 
10. 


16. 


19. 


22. 
24. 


27. 
31. 


Cycle, simple cycle 
Cycle, simple cycle 
Simple path 


Qt 0 [1 


Suppose that there is such a graph with vertices a, b, c, d, 
e, f. Suppose that the degrees of a and bare 5. Since the graph 
is simple, the degrees of c, d, e, and f are each at least 2; thus 
there is no such graph. 


(a, a), (b,c, g, b), (b,c, d, f, g, b), 
(b, c,d, e,f, g,b), (c, 8,f.d,0), 
(c, g,f, e, d, c), (d, f, e, d) 


Every vertex has degree 4. 
G, = ({v1}, ©) 

G2 = ({v2}, ©) 

G3 = ({v1, v2}, S) 

G4 = ({v1, v2}, fer}) 
There are 17 subgraphs. 


13. 


28. No Euler cycle 
No Euler cycle 


34. 


37. 
39. 
43. 


46. 


48. 


51. 


54. 


56. 


59. 
62. 
65. 


69. 


For 


an Euler cycle is (10, 9, 6, 5, 9, 8, 5, 4, 8, 7, 4, 2, 5, 3, 2, 1, 
3, 6, 10). The method generalizes. 

m=n=2orm=n=1 

d and e are the only vertices of odd degree. 

The argument is similar to that of the proof of Theorem 
8.2.23. 

True. In the path, for all repeated a, 


eliminate a,..., b. 


Suppose that e = (v, w) is ina cycle. Then there is a path P 
from v to w not including e. Let x and y be vertices in G— {e}. 
Since G is connected, there is a path P’ in G from v to w. Re- 
place any occurrence of e in P’ by P. The resulting path from 
v to w lies in G — {e}. Therefore, G — {e} is connected. 


The union of all connected subgraphs containing G’ is a 
component. 


Let G be a simple, disconnected graph with n vertices having 
the maximum number of edges. Show that G has two compo- 
nents. If one component has i vertices, show that the compo- 
nents are K; and K,_;. Use Exercise 11, Section 8.1, to find a 
formula for the number of edges in G as a function of i. Show 
that the maximum occurs when i = 1. 


«| 


Modify the proofs of Theorems 8.2.17 and 8.2.18. 
Use Exercises 59 and 61. 


We first count the number of paths (vo, v1, ..., vx) of length 
k > 1. The first vertex vp may be chosen inn ways. Each sub- 
sequent vertex may be chosen in n — | ways (since it must be 
different from its predecessor). Thus the number of paths of 
length k is n(n — 1)*. 

The number of paths of length k, 1 < k <n, is 


it k (n— 1k -1 
Sona I* =n(n ae | a 


k=1 


n(n — 1)[(n— 1)* = 1] 
n—2 : 


If v is a vertex in V, the path consisting of v and no edges is a 
path from v to v; thus vRv for every vertex v in V. Therefore, 
Ris reflexive. 


71. 
74. 


76. 
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Suppose that vRw. Then there is a path (vo, ..., Vn), 
where vo = v and v, = w. Now (i, ..., vo) is a path from 
w to v, and thus wRv. Therefore, R is symmetric. 

Suppose that vRw and wRx. Then there is a path P; 
from v to w and a path Pz from w to x. Now P; followed 
by P2 is a path from v to x, and thus vRx. Therefore, R is 
transitive. 

Since R is reflexive, symmetric, and transitive on V, R 
is an equivalence relation on V. 


2 


Let s,, denote the number of paths of length n from v; to v1. 
We show that the sequences 51, 52,... and fj, fo,... satisfy 
the same recurrence relation, s; = fo, and s. = f%, from 
which it follows that s, = fr41 forn > 1. 

If n = 1, there is one path of length | from v, to vy, 
namely the loop on v;; thus, 5s; = fo. 

If n = 2, there are two paths of length 2 from v; to 
vy: (11, V1, V1) and (V1, v2, v1); thus, s2 = fp. 

Assume that n > 2. Consider a path of length n from 
v, to vy. The path must begin with the loop (1, v1) or the 
edge (V1, v2). 

If the path begins with the loop, the remainder of the 
path must be a path of length n — | from vj to v;. Since there 
are s,—1 such paths, there are s,—; paths of length n from v1 
to v; that begin (v1, v1,...). 

If the path begins with the edge (1, v2), the next edge 
in the path must be (v2, v;). The remainder of the path must 
be a path of length n — 2 from v; to v;. Since there are s,_2 
such paths, there are s,—2 paths of length n from v; to v, that 
begin (11, v2, ¥1,...). 

Since any path of length n > 2 from v; to vy be- 
gins with the loop (v1, v;) or the edge (v1, v2), it follows 
that 


Sn = Sp—1 + Sp-2- 


Because the sequences s;,52,... and fj, fo,... sat- 
isfy the same recurrence relation, s; = f2, and sz = fs, it 
follows that s, = fr41 forn > 1. 


Suppose that every vertex has an out edge. Choose a vertex 
vo. Follow an edge out of vo to a vertex v;. (By assumption, 
such an edge exists.) Continue to follow an edge out of v; 
to a vertex vj;1. Since there are a finite number of vertices, 
we will eventually return to a previously visited vertex. At 
this point, we will have discovered a cycle, which is a con- 
tradiction. Therefore, a dag has at least one vertex with no 
out edges. 


Section 8.3 Review 
1. 


A Hamiltonian cycle in a graph Gis a cycle that contains each 
vertex in G exactly once, except for the starting and ending 
vertex that appears twice. 


. The graph of Figure 8.3.9 has a Hamiltonian cycle and an 


Euler cycle. The Hamiltonian and Euler cycles are the graph 
itself. 


. The graph of Figure 8.3.2 has a Hamiltonian cycle, but not an 


Euler cycle. The Hamiltonian cycle is shown in Figure 8.3.3. 
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9. 


10. 
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The graph does not have an Euler cycle, because all of the 
vertices have odd degree. 


&. 
7 


5 


has an Euler cycle because it is connected and every vertex 
has even degree. It does not have a Hamiltonian cycle. To 
prove that it does not have a Hamiltonian cycle, we argue by 
contradiction. Suppose that the graph has a Hamiltonian cy- 
cle. Then, because vertices 2, 3, 4, and 5 all have degree 2, all 
the edges in the graph would have to be included in a Hamil- 
tonian cycle. Since the graph itself is not a cycle, we have a 
contradiction. 


. The graph 


. The graph consisting of two vertices and no edges has nei- 
ther a Hamiltonian cycle nor an Euler cycle because it is not 
connected. 

. The traveling salesperson problem is: Given a weighted 
graph G, find a minimum-length Hamiltonian cycle in G. The 
Hamiltonian cycle problem simply asks for a Hamiltonian 
cycle—any Hamiltonian cycle will do. The traveling sales- 
person problem asks not just for a Hamiltonian cycle, but for 
one of minimum length. 

. Asimple cycle 


. A Gray code is a sequence 51, 52, ..., 52», where each 5; is a 
string of n bits, satisfying the following: 


@ Every n-bit string appears somewhere in the sequence. 


M@ s; and s;4; differ in exactly one bit, i = 1,..., 
2” — 1. 


BH son and s; differ in exactly one bit. 
See Theorem 8.3.6. 
. See the discussion preceding Algorithm 8.3.10. 


Section 8.3 


1. (d,a,e,b,c,h, g,f, j,i, d) 
3. We would have to eliminate two edges each at b, d, i, and k, 
leaving 19— 8 = 11 edges. A Hamiltonian cycle would have 
12 edges. 
6. (a,b,c, j, i,m,k, d, e, f, l, g,h, a) 
9. 
a b 
c 
d e 
12. If n is even and m> 1 or if m is even and n> 1, there is a 


Hamiltonian cycle. The sketch shows the solution in case n 
is even. 


15. 
18. 


21. 


25. 


28. 


32. 


35. 


oS 
ie 2: 


If n = 1 or if m = 1, there is no cycle and, in particu- 
lar, there is no Hamiltonian cycle. Suppose that n and m are 
both odd and that the graph has a Hamiltonian cycle. Since 
there are nm vertices, this cycle has nm edges; therefore, the 
Hamiltonian cycle contains an odd number of edges. How- 
ever, we note that in a Hamiltonian cycle, there must be as 
many “up” edges as “down” edges and as many “left” edges 
as “right” edges. Thus a Hamiltonian cycle must have an even 
number of edges. This contradiction shows that if n and m are 
both odd, the graph does not have a Hamiltonian cycle. 


When m =nandn > | 


Any cycle C in the n-cube has even length since the vertices 
in C alternate between an even and an odd number of 1’s. 

Suppose that the n-cube has a simple cycle of length 
m. We just observed that m is even. Now m > 0, by defini- 
tion. Since the n-cube is a simple graph, m # 2. Therefore, 
m> 4. 

Now suppose that m > 4 and m is even. Let G be the 
first m/2 members of the Gray code G,—1. Then 0G, 1GF 
describes a simple cycle of length m in the n-cube. 


Suppose that the algorithm begins (a, b, c, e, f), which will 
then not extend. Since f is adjacent to a, the path will be mod- 
ified to (a, f, e, c, b). This path cannot be extended. Since b 
is adjacent to a, the path will be modified to (a, b, c, e, f). We 
are now in an infinite loop. 


If the first choices are (b, c,d, e), the algorithm will not 
terminate. 


For alln > 3, the algorithm always finds a Hamiltonian cycle 
in Ky. 


No. Let (1,...,Vn, v1) be a Hamiltonian cycle. If the 
choices are vj,...,V,, the algorithm finds a Hamiltonian 
cycle. 


37. 


40. 
43. 


46. Yes, (a,c, d,f, g,e, b) 
. : 10. 
Section 8.4 Review 

1. Label the start vertex 0 and all other vertices oo. Let T be the 
set of all vertices. Choose v € T with minimum label and 
remove v from 7. For each x € T adjacent to v, relabel x with 
the minimum of its current label and the label of v + w(v, x), 
where w(v, x) is the weight of edge (v, x). Repeat if z ¢ T. 

2. See Example 8.4.2. 

3. See the proof of Theorem 8.4.3. 13. 

Section 8.4 

1. 7; (a, b,c, f) 

4. 7; (b,c, f,j) 

6. An algorithm can be modeled after Example 8.4.2. 16. 

9. Modify Algorithm 8.4.1 so that it begins by assigning the 
weight co to each nonexistent edge. The algorithm then 
continues as written. At termination, L(z) will be equal to 
oo if there is no path from a to z. 

Section 8.5 Review 

1. Order the vertices and label the rows and columns of a matrix 19 
with the ordered vertices. The entry in row i, column j, i 4 j, : 
is the number of edges incident on i andj. If i = /, the entry is 
twice the number of loops incident on i. The resulting matrix 
is the adjacency matrix of the graph. 

2. The ijth entry in A” is equal to the number of paths of length 
n from vertex i to vertex j. 

3. Order the vertices and edges and label the rows of a matrix 22. 
with the vertices and the columns with the edges. The entry in 24, 
row v and column e is | if e is incident on v and 0 otherwise. 

The resulting matrix is the incidence matrix of the graph. 
Section 8.5 

i abcde 
afo 1111 Pas 
b{1 0100 28. 
e}1 101 1 
dj1010 1 
e\l 0 1 1 0 

4. vi v2 V3 V4 V5 V6 31. 
vy f/O 1 1 0 0 +0 
wftl o 41 0 0 0 
V3 1 t 0 0 0 0 
vwu}]O 0 0 0 0 O 
vy |O0 0 0 0 0 1 
vw \O 0 0 0 1 O 


Yes. If (vj, .--,Vn—1, Vn), VI = Vn, is a Hamiltonian cycle, he 
(V1, .--, Vn—1) is a Hamiltonian path. 
Yes, (a, b, d, g,m,1,h, i,j, e,f, kc) 


Yes, (i,j, g, h, e, d,c, b, a, f) 
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eaoerea 
ooorr 


wWwNne 


ADUWNAF 


[For Ks] 


43333 
34333 
33433 
33343 
33334 


The graph is not connected. 


a 
b 
é a 
d 


Gis not connected. 

Because of the symmetry of the graph, if v and w are vertices 
in Ks, there is the same number of paths of length n from v 
to v as there is from w to w. Thus all the diagonal elements 
of A” are equal. Similarly, all the off-diagonal elements of A” 
are equal. 

Ifn > 2, 


dn = 4an—1 by Exercise 29 


1 
=4 (5) [47-! + (-1)"] by Exercise 30. 


The formula can be directly verified for n = 1. 
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Section 8.6 Review 


1. 


Graphs G; and G2 are isomorphic if there is a one-to-one, 
onto function f from the vertices of G; to the vertices of G2 
and a one-to-one, onto function g from the edges of G to the 
edges of Gz, so that an edge e is incident on v and w in G; if 
and only if the edge g(e) is incident on f(v) and f(w) in G2. 


. The following graphs 


are isomorphic. An isomorphism is given by f(a) = 1, 
f(b) = 2, flo.) = 4, f(d) = 3, and g(a,b) = (1,2), 
g(b, c) = (2, 4), g(c, d) = (4, 3), gd, a) = (G3, 1). 


aS 


are not isomorphic; the first graph has two vertices, but the 
second graph has three vertices. 


. The following graphs 


. A property P is an invariant if, whenever G and G2 are iso- 


morphic graphs, if G; has property P, then G2 also has prop- 
erty P. 


. To show that two graphs are not isomorphic, find an invariant 


that one graph has and the other does not have. 


. Two graphs are isomorphic if and only if for some orderings 


of their vertices, their adjacency matrices are equal. 


. A rectangular array of vertices 


Section 8.6 


1. 


10. 


17. 
20. 


Relative to the vertex orderings a, b,c, d, e, f, g for G;, and 
1,3,5,7, 2, 4, 6 for G2, the adjacency matrices of G; and G2 
are equal. 


. Relative to the vertex orderings a, b, c,d, e,f,g,h,i,j for 


G,, and 5, 6, 1, 2,7, 4, 10, 8, 3, 9 for G2, the adjacency ma- 
trices of G; and G2 are equal. 


. The graphs are not isomorphic since they do not have the 


same number of vertices. 


The graphs are not isomorphic since Gj has a simple cycle 
of length 3 and G2 does not. 


. The graphs are not isomorphic. The edge (1, 4) in Gz has 


5(1) = 3 and 5(4) = 3 but there is no such edge in G, (see 
also Exercise 26). 


All are isomorphic to K2. 


If r = 2, all are isomorphic to 


There are no connected, simple, 3-regular, 5-vertex graphs 
since the number of vertices having odd degree is even. 
If r = 4, all are isomorphic to Ks. 


In Exercises 22-28, we use the notation of Definition 8.6.1. 


22. 


25. 


If (vo, v1, ---, Vx) is a simple cycle of length k in Gj, then 
(f(vo). f(v1), -.-,f(vx)) is a simple cycle of length k in G2. 
[The vertices f(vj), i= 1,...,k — 1, are distinct, since f is 
one-to-one. ] 

In the hint to Exercise 22, we showed that if C= 
(vo, V1,--++, Vk) is a simple cycle of length k in G;, then 
(f(vo), (v1), ---,f(ve)), which here we denote f(C), is a 
simple cycle of length k in Gy. Let Cj, C2,...,C, de- 
note the n simple cycles of length k in G;. Then f(C;), 
f(C2),...,f(Cn) are n simple cycles of length k in Go. 
Moreover, since f is one-to-one, f(C1), f(C2), ....f(Cn) are 
distinct. 


28. The property is an invariant. If (vo, v1,..., Vn) is an Euler 
cycle in Gj, then, since g is onto, (f(vo), f(v1), .--,f(n)) is 
an Euler cycle in Gp. 

31. IN 

34, ve 

36. 

; 

d e b 
e 
c 

39. 

(a) (b) 

42. Define g((v, w)) = (f(v), f(w)). 

43. f@) = Lf) =2,f) =3,f(@) =2 

46. f(a) = 1, f(b) =2,f(c) =3,f(d) = 1 


Section 8.7 Review 


1. 


. A contiguous region 


A graph that can be drawn in the plane without its edges 
crossing 


3. f=e-—v+2 


. Edges of the form (v, vj) and (v, v2), where v has degree 2 


and vy; 4 v2 


. Given edges of the form (v, v;) and (v, v2), where v has de- 


gree 2 and vj # v2, a series reduction deletes vertex v and 
replaces (v, vj) and (v, v2) by (11, v2). 


. Two graphs are homeomorphic if they can be reduced 


to isomorphic graphs by performing a sequence of series 
reductions. 


. A graph is planar if and only if it does not contain a subgraph 


homeomorphic to Ks or K3.3. 


Section 8.7 


1. 


6. 


12. 


15. 


Planar 


. 22=242424343434+4+44+5, 


soe=14.f=e-—v+2=14-942=7 
A graph with five or fewer vertices and a vertex of degree 2 is 
homeomorphic to a graph with four or fewer vertices. Such 


a graph cannot contain a homeomorphic copy of K3,3 or Ks. 


If Ks is planar, e < 3v — 6 becomes 10 < 3-5-6=9. 


19. 


23. 


26. 


29. 


32. 
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Tt contains 


Assume that G does not have a vertex of degree 5. Show that 
2e > 6v. Now use Exercise 13 to deduce a contradiction. 


Section 8.8 Review 


1. 


Instant Insanity consists of four cubes each of whose faces is 
painted one of the four colors, red, white, blue, or green. The 
problem is to stack the cubes, one on top of the other, so that 
whether the stack is viewed from front, back, left, or right, 
one sees all four colors. 


. Draw a graph G, where the vertices represent the four colors 


and an edge labeled i connects two vertices if the opposing 
faces of cube i have those colors. Find two graphs where 

™ Each vertex has degree 2. 

™ Each cube represents an edge exactly once in each 

graph. 

®@ The graphs have no edges in common. 
One graph represents the front/back stacking, and the other 
represents the left/right stacking. 
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— 8.8 


) G8 


G, Gy 
4. 
()? 
R BR ss B 
3 
A | | 
w G WwW G 
1 2 
G, G 
7. (a) 
2, ’ 

Re—4 CoB Re4 OB 
Wad3 G Wad3 G 
Gy G 

2 

R 3 <>B Re 4 1B 
Wao G Ww G 

G Gy 

‘i 
R 4 1B R r B 

3 _) 
1 3 

w G wh? G 

Gs G6 
Rag! B 
T<} 
wr2 G 

G; 


(b) Solutions are Gj, Gs; G1, G7; G2, Ga; G2, Go; 
G3, Go; and G3, G7. 
13. One edge can be chosen in C(2 + 4 — 1, 2) = 10 ways. The 
three edges labeled 1 can be chosen in C(3 + 10 — 1,3) = 
220 ways. Thus the total number of graphs is 2204. 


15. 
3 4 
2 ii 
19. According to Exercise 14, not counting loops, every vertex 


must have degree at least 4. In Figure 8.8.5, not counting 
loops, vertex W has degree 3 and, therefore, Figure 8.8.5 does 


not have a solution to the modified version of Instant Insan- 
ity. Figure 8.8.3 gives a solution to regular Instant Insanity 
for Figure 8.8.5. 


Chapter 8 Self-Test 


1. [Section 8.1] V = {v1, v2, v3, va}. E = {e1, e2, e3}. ey and e2 
are parallel edges. There are no loops. v; is an isolated ver- 
tex. Gis not a simple graph. e3 is incident on v2 and vq. v2 is 
incident on e1, é2, and e3. 

2. [Section 8.2] It is a cycle. 

3. [Section 8.3] (v1, v2, V3, V4, V5, V7, Vo, V1) 

4. [Section 8.5] 

vy V2 V3 V4 V5 V6 V7 
V1 0 
v2 
V3 
V4 
V5 
V6 
V7 


® 
® 
N 
® 
w 
i 
£& 
® 
wn 
ins 
a 
® 
x 
iss) 
oo 
iss 
\o 
® 
i) 
® 


VI 1 
v2 1 
v3 | 0 
v4 | 0 
v5 | O 
ve | O 
wz \O 0 0 O 


6. [Section 8.5] The number of paths of length 3 from v2 to v3 


rFPoOorcoco 
i) 
rPreococo 


0 0 


7. [Section 8.6] The graphs are isomorphic. The orderings v1, 
V2, V3, V4, Vs and w3, W1, W4, W2, Ws produce equal adjacency 
matrices. 

8. [Section 8.6] The graphs are isomorphic. The orderings v1, 
V2, V3, V4, V5, V6 and w3, We, W2, Ws, W1, W4 produce equal 
adjacency matrices. 

9. [Section 8.1] There are vertices (a and e) of odd degree. 


10. [Section 8.1] 


11. [Section 8.2] 


at U1 al 
“fe oe afy 
U2 V3 v2 U4 U3 U4 
Uy U1 Ui) 
a/| (2) ey e3 ane 
e e e 
U2 V3 «U4 U2 U3 «U4 U2 U3 «U4 


12. 


13. 
14. 


15. 


16. 


17. 
18. 


19. 


20. 


21. 
23. 
24. 


[Section 8.2] 


Uy ey U2 &2 U3 


U7 €3 
V6 Us €10 U4 


[Section 8.3] (000, 001, 011, O10, 110, 111, 101, 100, 000) 


[Section 8.3] A Hamiltonian cycle would have seven edges. 
Suppose that the graph has a Hamiltonian cycle. We would 
have to eliminate three edges at vertex b and one edge at 
vertex f. This leaves 10 — 4 = 6 edges, not enough for a 
Hamiltonian cycle. Therefore, the graph does not have a 
Hamiltonian cycle. 

[Section 8.5] No. Each edge is incident on at least one 
vertex. 

[Section 8.1] If we let V; denote the set of vertices contain- 
ing an even number of |’s and V2 the set of vertices con- 
taining an odd number of 1’s, each edge is incident on one 
vertex in V; and one vertex in V2. Therefore, the n-cube is 
bipartite. 


[Section 8.2] No. There are vertices of odd degree. 


[Section 8.3] In a minimum-weight Hamiltonian cycle, ev- 
ery vertex must have degree 2. Therefore, edges (a, b), (a,j), 
G,i), Gi, h), (g,f), (f,e), and (e, d) must be included. We 
cannot include edge (b, h) or we will complete a cycle. This 
implies that we must include edges (h, g) and (b, c). Since 
vertex g now has degree 2, we cannot include edge (c, g) or 
(g, d). Thus we must include (c, d). This is a Hamiltonian 
cycle and the argument shows that it is unique. Therefore, it 
is minimal. 

[Section 8.6] 


[Section 8.6] 
[Section 8.4] 9 


[Section 8.4] (a, e, f, i, g, z) 
[Section 8.4] 12 


22. [Section 8.4] 11 
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25. [Section 8.7] The graph is planar: 
a d 


f b 


26. [Section 8.7] The graph is not planar; the following subgraph 
is aia to Ks: 


PR 
Ly 


€ 


27. [Section 8.7] A simple, planar, connected graph with e edges 
and v vertices satisfies e < 3v — 6 (see Exercise 13, Section 
8.7). If e = 31 and v = 12, the inequality is not satisfied, so 
such a graph cannot be planar. 


28. [Section 8.7] For n = 1, 2,3, itis possible to draw the n-cube 
in the plane without having any of its edges cross: 


n= 1 mel n=3 

We argue by contradiction to show that the 4-cube is 
not planar. Suppose that the 4-cube is planar. Since every cy- 
cle has at least four edges, each face is bounded by at least 
four edges. Thus the number of edges that bound faces is at 
least 4f. In a planar graph, each edge belongs to at most two 
bounding cycles. Therefore, 2e > 4f. Using Euler’s formula 


for graphs, we find that 
2e > 4(e-—v+2). 
For the 4-cube, we have e = 32 and v = 16, so Euler’s for- 
mula becomes 
64 = 2-32 > 4(32 — 16+ 2) = 72, 


which is a contradiction. Therefore, the 4-cube is not planar. 
The n-cube, for n > 4, is not planar since it contains the 
4-cube. 


29. [Section 8.8] 
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30. 
31. 


32. 
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[Section 8.8] See the hints for Exercises 31 and 32. 
[Section 8.8] 


2 
()>(): ()>(): |} 
3 
Gy Gz G3 
2 2 4 
3 | 3 | : 
Gi Gs G 


We denote the two edges incident on B and G labeled | in the 
graph of Exercise 29 as 1 and 1’ here. 


[Section 8.8] The puzzle of Exercise 29 has four solutions. 
Using the notation of Exercise 31, the solutions are G;, Gs; 
Go, Gs; G3, Go; and G4, Ge. 


Section 9.1 Review 


1. 


A free tree T is a simple graph satisfying the following: If v and 
w are vertices in 7, there is a unique simple path from v to w. 


. Arooted tree is a tree in which a particular vertex is designated 


the root. 


. The level of a vertex v is the length of the simple path from 


the root to v. 


. The height of a rooted tree is the maximum level number that 


occurs. 


5. See Figure 9.1.9. 


6. In the rooted tree structure, each vertex represents a file or a 


. Suppose that there are n frequencies. If n = 


folder. Directly under a folder f are the folders and files con- 
tained in f. 


. A Huffman code can be defined by a rooted tree. The code for 


a particular character is obtained by following the simple path 
from the root to that character. Each edge is labeled with 0 or 
1, and the sequence of bits encountered on the simple path is 
the code for that character. 


2, build the 
tree shown in Figure 9.1.11 and stop. Otherwise, let f; and jj 
denote the smallest frequencies, and replace them in the list 
by f; + fj. Recursively construct an optimal Huffman coding 
tree using the modified list. In the tree that results, add two 
edges to a vertex labeled fj +f, and label the added vertices jj 
and fj. 


Section 9.1 


1. 


The graph is a tree. For any vertices v and w, there is a unique 
simple path from v to w. 


. The graph is a tree. For any vertices v and w, there is a unique 


simple path from v to w. 


7n=1 
8. a-1; b-1; c-1; d-1; e-2; f-3; g-3; h-4; i-2; j-3; k-0 


11. 


27 


32. 


35. 
38. 


Height = 4 
. PEN 17. SALAD 
0111100010 21. 0110000100100001111 


Let T be a tree. Root T at some arbitrary vertex. Let V be the 
set of vertices on even levels and let W be the set of vertices 
on odd levels. Since each edge is incident on a vertex in V and 
a vertex in W, T is a bipartite graph. 


é,8 


The radius is the eccentricity of a center. It is not necessarily 
true that 2r = d (see Figure 9.1.5). 


Section 9.2 Review 


Nn mn b&b WwW 


« Let (vo, .--, Vn—1, Vn) be a path from the root vo to v,. We call 


Vn—1 the parent of vy. 


. Let (vo,..., Vn) be a path from the root vo to v,. We call 


(Vj, ..-, Vn) descendants of vj;_1. 


. vand w are siblings if they have the same parent. 

. A terminal vertex is one with no children. 

. If vis not a terminal vertex, it is an internal vertex. 
. An acyclic graph is a graph with no cycles. 

. See Theorem 9.2.3. 


Section 9.2 


1. 
4. 
7. 


Kronos 
Apollo, Athena, Hermes, Heracles 
bd 10. e,f, g.isj 


13. 
22. 


27. 
30. 


33. 


36. 


a,b,c, d,e 17. They are siblings. 
25. 
A single vertex is a “cycle” of length 0. 


Each component of a forest is connected and acyclic and, 
therefore, a tree. 


Suppose that G is connected. Add parallel edges until the re- 
sulting graph G* has n — | edges. Since G* is connected and 
has n — | edges, by Theorem 9.2.3, G* is acyclic. But adding 
an edge in parallel introduces a cycle. Contradiction. 


Section 9.3 Review 


1. 


5. 


A tree T is a spanning tree of a graph G if T is a subgraph of 
G that contains all of the vertices of G. 


. A graph G has a spanning tree if and only if G is connected. 


. Select an ordering of the vertices. Select the first vertex and la- 


bel it the root. Let T consist of this single vertex and no edges. 
Add to the tree all edges incident on this single vertex that do 
not produce a cycle when added to the tree. Also add the ver- 
tices incident on these edges. Repeat this procedure with the 
vertices on level 1, then those on level 2, and so on. 


. Select an ordering of the vertices. Select the first vertex and la- 


bel it the root. Add an edge incident on this vertex to the tree, 
and add the additional vertex v incident on this edge. Next add 
an edge incident on v that does not produce a cycle when added 
to the tree, and add the additional vertex incident on this edge. 
Repeat this process. If, at any point, we cannot add an edge in- 
cident on a vertex w, we backtrack to the parent p of w and try 
to add an edge incident on p. When we finally backtrack to the 
root and cannot add more edges, depth-first search concludes. 


Depth-first search 


Section 9.3 


1. 


4. 
7. 


The path (h, f, e, g, b, d, c, a) 


10. 


13. 


17. 
20. 


23. 


26. 


27. 


30. 


33. 
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The two-queens problem clearly has no solution. For the three- 
queens problem, by symmetry, the only possible first column 
positions are upper left and second from top. If the first move 
is first column, upper left, the second move must be to the 
bottom of the second column. Now no move is possible for 
the third column. If the first move is first column, second from 
top, there is no move possible in column two. Therefore, there 
is no solution to the three-queens problem. 


False. Consider K4. 


First, show that the graph T constructed is a tree. Now use 
induction on the level of T to show that T contains all the 
vertices of G. 


Suppose that x is incident on vertices a and b. Removing x 
from T produces a disconnected graph with two components, 
U and V. Vertices a and b belong to different components— 
say,a € Uandb € V. There is a path P from a to b in T’. 
As we move along P, at some point we encounter an edge 
y = (v,w) with v € U,w é€ V. Since adding y to T — {x} 
produces a connected graph, (7 — {x}) U {y} is a spanning tree. 
Clearly, (T’ — {y}) U {x} is a spanning tree. 

Suppose that T has n vertices. If an edge is added to T, the re- 
sulting graph T’ is connected. If T’ were acyclic, T’ would be 
a tree with n edges and n vertices. Thus 7’ contains a cycle. If 
T' contains two or more cycles, we would be able to produce 
a connected graph 7” by deleting two or more edges from T’. 
But now 7” would be a tree with n vertices and fewer than 
n — | edges—an impossibility. 


ei} €2 €& €5 &€3 €4 C7 2&8 
(abca) io 0 0 1 1 8 0 
(acda) 0 1 0 0 1:0 0 1 
(acdb) 0 0 1 0 0 1 0 41 
(bcedebl) \O0 O 0 1 0 1 1 1 


Input: A graph G = (V, E) with n vertices 
Output: true if G is connected 
false if G is not connected 
is_connected(V, E) { 
T = Dfs(V, E) 
// T = (V’, E’) is the spanning tree returned by bfs 
if (\V'| == n) 
return true 
else 
return false 
} 
bfs_track_parent(V, E, parent) { 
S= (v1) 


// set v1’s parent to 0 to indicate that v; has no parent 
parent(v,) = 0 
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V’ = {vi} 
E=8 
while (true) { 
for each x € S, in order, 
for each y € V — V’, in order 
if ((x, y) is an edge) { 
add edge (x, y) to E’ and y to V’ 
parent(y) =x 
} 
if (no edges were added) 
return T 
S = children of S ordered consistently with the original 
vertex ordering 
} 
} 


34. print_parents(V, parent) { 
for each v € V 
printin(v, parent(v)) 


} 


37. An algorithm can be obtained by modifying the four-queens 
algorithm. The array row is replaced by the array p, which 
is the permutation. A conflict for p(k) now means that for 
some i < k, p(i) = p(k), that is, the value p(k) has already 
been assigned. To obtain all of the permutations, when we 
find a permutation, we print it and continue (whereas in the 
four-queens algorithm, being content with one solution, we 
terminated the algorithm). 


perm(n) { 
‘all 
pil) =0 
while (k > 0) { 
p(k) = p(k) + 1 
while (p(k) < n A p(k) conflicts) 
p(k) = p(k) + 1 
if (p(k) <n) 
if (k == n) 
printin(p) 
else { 
k=k+1 
p(k) =0 


40. The idea of the backtracking algorithm is to scan the grid (we 
chose to scan top to bottom, left to right), skipping positions 
where numbers were preassigned, and, at the next available 
position, we try 1, then 2, then 3, and so on, until we find a le- 
gal value (i.e., a value that does not conflict within its 3 x 3 sub- 
square, within its row, or within its column). If such a value is 
found, we continue with the next available position. If no such 
value can be found, we backtrack to the last position where we 
assigned a value; if that value was i, we try i+ 1, i+2 and so on. 


In the following algorithm, the value s(i, /) is the value 
in row i, column J, or 0 if no value is stored there. We assume 
that initially all values in s are set to 0, except for those values 
that are specified in the puzzle. Finally, show_values prints the 
array S. 


sudoku(s) { 
i=0 
ia 
// advance advances i and j to the next position in which 
// a value is not specified. It proceeds down a column first. 
advance(i, j) 
while (§ > 1Aj> 1) { 
// search for a legal value 
sj) =sGj+1 
// not.valid(i, j) returns true if the value s(i, /) 
// conflicts with the previously chosen and specified 
// values, and false otherwise. 
while (s(i, j) < 10 A not_valid(i, j)) 
sj) =sGj+1 
// if no value found, backtrack 
if (s(i, 7) == 10) { 
si, j) =0 
// retreat moves i and j to the previous position in 
// which a value is not specified. It proceeds up a 
// column first. 
retreat(i, j) 


} 
else 

advance(i, j) // sets j to 10 if advanced off board 
if == 10) { 


// Solution! 
show_values() 
return 
} 
} 
} 


Section 9.4 Review 


1. A minimal spanning tree is a spanning tree with minimum 
weight. 

2. Prim’s Algorithm builds a minimal spanning tree by iteratively 
adding edges. The algorithm begins with a single vertex. Then 
at each iteration, it adds to the current tree a minimum-weight 
edge that does not complete a cycle. 


3. A greedy algorithm optimizes the choice at each iteration. 


Section 9.4 
1. 4. 


10. If v is the first vertex examined by Prim’s Algorithm, the 
edge will be in the minimal spanning tree constructed by the 
algorithm. 


13. 


14. 


16. 
20. 


23. 


27. 


30. 


Suppose that G has two minimal spanning trees T; and 7. 
Then, there exists an edge x in 7 that is not in T7. By Exer- 
cise 23, Section 9.3, there exists an edge y in T> that is not in 
T, such that 73 = (T; — {x}) U {y} and Ty = (Tz — {y}) U {x} 
are spanning trees. Since x and y have different weights, either 
T3 or T4 has weight less than T;. This is a contradiction. 


False 


a a b 
2 3 
d c 
6 
False. Consider Ks with the weight of every edge equal to 1. 
Input: The edges E of an n-vertex, connected, weighted 
graph. If e is an edge, w(e) is equal to the weight 
of e; if e is not an edge, w(e) is equal to oo (a value 
greater than any actual weight). 
Output: A minimal spanning tree. 
kruskal(E, w, n) { 
V=m 
EF=D 
T= (V’, E) 
while (|E’| < n— 1) { 
among all edges that if added to 7’ would not 
complete a cycle, choose e = (vj, vj) of 
minimum weight 
E'=E' U{e} 
Vi=V'U {vi, vi} 
re (V’, E) 
} 
return 7” 
} 
Terminate Kruskal’s Algorithm after k iterations. This groups 


the data into n — k classes. 


We show that aj = 7 and a2 = 3 provide a solution. We 
use induction on n to show that the greedy solution gives an 
optimal solution for n > 1. The casesn = 1, 2,..., 8 may be 
verified directly. 

We first show that if n > 9, there is an optimal solution 
containing at least one 7. Let S’ be an optimal solution. Sup- 
pose that S’ contains no 7’s. Since S’ contains at most two 1’s 
(since S’ is optimal), S’ contains at least three 3’s. We replace 
three 3’s by one 7 and two 1’s to obtain a solution S. Since 
|S| = |S’|, S is optimal. 

If we remove a 7 from S, we obtain a solution S* to the 
(n — 7)-problem. If S* were not optimal, S could not be op- 
timal. Thus S* is optimal. By the inductive assumption, the 
greedy solution GS* to the (n — 7)-problem is optimal, so 
|S*| = |GS*|. Notice that 7 together with GS* is the greedy 
solution GS to the n-problem. Since |GS| = |S|, GS is optimal. 


Suppose that the greedy algorithm is optimal for all denomi- 
nations less than a,,—1 + dm. We use induction on n to show 
that the greedy algorithm is optimal for all n. We may assume 
that n > adm—| + dm. 
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Consider an optimal solution S for n. First suppose that 
S uses at least one a, coin. The solution, S with one a, coin 
removed, is optimal for n — ay. (If there was a solution for 
nN — A» using fewer coins, we could add one a,, coin to it to 
obtain a solution for n using fewer coins than S, which is im- 
possible.) By the inductive assumption, the greedy solution for 
N— dy is optimal. If we add one a,, coin to the greedy solution 
for n—d», we obtain a solution G for n that uses the same num- 
ber of coins as S. Therefore, G is optimal. But G is also greedy 
because the greedy solution begins by removing one a,, coin. 

Now suppose that S does not use an a, coin. Let i be the 
largest index such that S uses an q; coin. The solution, S with 
one a; coin removed, is optimal for n — a;. By the inductive 
assumption, the greedy solution for n — a; is optimal. Now 


N > An—1 + 4m = ai + Gn, 


sO N — dj > dm. Therefore, the greedy solution uses at least 
one a), coin. Thus there is an optimal solution for n — a; that 
uses an d,, coin. If we add one a; coin to this optimal solution, 
we obtain an optimal solution for n that uses an a,, coin. The 
argument in the preceding paragraph can now be repeated to 
show that the greedy solution is optimal. 


Section 9.5 Review 


i 


A binary tree is a rooted tree in which each vertex has either 
no children, one child, or two children. 


. A left child of vertex v is a child designated as “left.” 


3. A right child of vertex v is a child designated as “right.” 


4. A full binary tree is a binary tree in which each vertex has 


5. i+1 


either two children or zero children. 
6. 2i+1 


7. Ifabinary tree of height / has ¢ terminal vertices, thenlgt < h. 


. A binary search tree is a binary tree T in which data are as- 


sociated with the vertices. The data are arranged so that, for 
each vertex v in 7, each data item in the left subtree of v is less 
than the data item in v, and each data item in the right subtree 
of v is greater than the data item in v. 


. See Figures 9.5.4 and 9.5.5. 
10. 


Insert the first data item in a vertex and label it the root. In- 
sert the next data items in the tree according to the following 
steps. Begin at the root. If the data item to be added is less 
than the data item at the current vertex, move to the left child 
and repeat; otherwise, move to the right child and repeat. If 
there is no child, create one, put an edge incident on it and the 
last vertex visited, and store the data item in the added vertex. 


Section 9.5 


1. 


Example 9.5.5 showed that n— 1 games are played. Since there 
are two choices for the winner of each game, the tournament 
can unfold in 2”~! ways. 


. No. Based on past performance, it is likely that certain teams 


will defeat other teams. Someone knowledgeable about bas- 
ketball will take this into account. For example, through 2016 
a number 16 seed has never defeated a number | seed. 
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5. 
FOUR 
AND SCORE 
AGO ] FOREFATHERS ] OUR | SEVEN | 
BROUGHT ] FORTH | 
8. False. Consider 9. 


12. 


15. 
18. 
21. 
22. 


25. 


mi+1,(m—1)i+1 

t-1 

Balanced 

Balanced 

A tree of height 0 has one vertex, so No = 1. Ina balanced bi- 


nary tree of height 1, the root must have at least one child. If the 
root has exactly one child, the number of vertices will be min- 
imized. Therefore, Nj = 2. Ina balanced binary tree of height 
2, there must be a path from the root to a terminal vertex of 
length 2. This accounts for three vertices. But for the tree to be 
balanced, the root must have two children. Therefore, Nx = 4. 
Suppose that there are n vertices in a balanced binary tree of 
height 4. Then 


+2 
n> Nn=fn+3—1> (5) 1, 


for h > 3. The equality comes from Exercise 24 and the last 
inequality comes from Exercise 27, Section 4.4. Therefore, 


3 h+2 
1 = ; 
n+1> (5) 


Taking the logarithm to the base 3/2 of each side, we obtain 
log39(n + 1) > h+2. 
Therefore, 


h< [logs 2(n +1)]-—2= O(gn). 


Section 9.6 Review 


1. Preorder traversal processes the vertices of a binary tree by be- 


2. 


ginning at the root and recursively processing the current ver- 


tex, the vertex’s left subtree, and then the vertex’s right subtree. 
Input: PT, the root of a binary tree 


Output: Dependent on how “process” is interpreted 


YEARS ] 


3. 


4. 


5. 


preorder (PT) { 

if (PT == null) 
return 

process PT 
1 = left child of PT 
preorder(l) 
r = right child of PT 
preorder (r) 


} 


Inorder traversal processes the vertices of a binary tree by be- 
ginning at the root and recursively processing the vertex’s left 
subtree, the current vertex, and then the vertex’s right subtree. 

Input: 
Output: 


PT, the root of a binary tree 


Dependent on how “process” is interpreted 


inorder (PT) { 

if (PT ==null) 
return 

1 = left child of PT 
inorder(1) 
process PT 
r = right child of PT 
inorder (r) 


} 


Postorder traversal processes the vertices of a binary tree by 
beginning at the root and recursively processing the vertex’s 
left subtree, the vertex’s right subtree, and then the current 
vertex. 


Input: 
Output: 


PT, the root of a binary tree 


Dependent on how “process” is interpreted 


postorder (PT ) { 

if (PT == null) 
return 

1 = left child of PT 
postorder (1) 
r = right child of PT 
postorder (r) 
process PT 

} 


. In the prefix form of an expression, an operator precedes its 


operands. 


. Polish notation 


. In the infix form of an expression, an operator is between its 


operands. 


. In the postfix form of an expression, an operator follows its 


operands. 


. Reverse Polish notation 
12. 
13. 


No parentheses are needed. 

In a tree representation of an expression, the internal ver- 
tices represent operators, and the operators operate on the 
subtrees. 


Section 9.6 


1. preorder inorder 


ABDCE  BDAEC 
4. preorder inorder 
ABCDE  EDCBA 


6. 


postorder 
DBECA 
postorder 
EDCBA 


A BC D 


prefix: *+AB—CD 
+CD—x 


postfix: AB 


prefix: — *+*+ABCDE + *«+ABCD 
postfix: AB+C*xD+Ex*xAB+CxD4 


11. 


prefix 
usual infix 
parened infix 


14. 


prefix 


parened infix 
16. —4 
22. 


A B 


+ =f ABC 
: A+B-C 


: ((A+B)—C) 


: —*Ax*BC/C+DE 
usual infix: A*« Bx C—C/(D+E) 


> ((A* (B* C)) — (C/(D + E))) 


19. 0 


B) 


25. 


28. 
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Input: PT, the root of a binary tree 
Output: PT, the root of the modified binary tree 


swap_children (PT) { 
if (PT == null) 
return 
swap the left and right children of PT 
1 = left child of PT 
swap_children(1) 
r = right child of PT 
swap-children(r) 
} 


31. If T is a binary tree, we let post(T) denote the order in which 


32. 


35. 


the vertices of T are visited under postorder traversal. We let 
revpost(T) denote the reverse of post(T). We prove by induc- 
tion on the number of nodes in a tree T that the order in which 
funnyorder visits the nodes of T is revpost(T). 

The assertion is evident if T has no nodes. Thus the 
basis step is proved. 

Now assume that the order in which funnyorder visits 
the nodes of a tree T’ having fewer than n nodes is revpost(T"). 
Let T be an n-node tree. We must prove that the order in which 
funnyorder visits the nodes of T is revpost(T). 

Let 7; be the left subtree of T, let T> be the right sub- 
tree of T, and let r be the root of T. By the inductive assump- 
tion, the order in which funnyorder visits the nodes of T, 
is revpost(T\), and the order in which funnyorder visits the 
nodes of 7 is revpost(T2). The pseudocode shows that the 
order in which funnyorder visits the nodes of T is 


r, revpost(T2), revpost(T)). 
The reverse of this list is 
post(T,), post(T2), r, 


which is the order in which postorder visits the nodes of T. 
The inductive step is complete. 


Define an initial segment of a string to be the first i > 1 char- 
acters for some i. Define r(x) = 1, forx = A, B,...,Z; and 
r(x) = —1, forx = +,—-,*, /. If x; ---x, is a string over 
{A,...,Z, +, —, *, /}, define 


P(X +++ Xp) =r) +++ + rn). 


Then a string s is a postfix string if and only if r(s) = 1 and 
r(s’) = 1, for all initial segments s’ of s. 


Let G be the graph with vertex set {1, 2,..., } and edge set 
{0,0 |i=2,...n}. 


The {1} is a vertex cover of G of size 1. 
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38. Input: 


Output: 


PT, the root of a nonempty tree 


Each vertex of the tree has a field in_cover that is 
set to true if that vertex is in the vertex cover or to 
false if that vertex is not in the vertex cover. 


tree_cover(PT ) { 
flag = false 
ptr = first child of PT 
while (ptr ! = null) { 
tree_cover(ptr) 


if (in_cover of ptr == false) 
flag = true 
ptr = next sibling of ptr 


} 
in_cover of PT = flag 


Section 9.7 Review 


1. A decision tree is a binary tree in which the internal ver- 
tices contain questions with two possible answers, the edges 
are labeled with answers to the questions, and the terminal 
vertices represent decisions. If we begin at the root, an- 
swer each question, and follow the appropriate edges, we 
will eventually arrive at a terminal vertex that represents a 
decision. 


2. The worst-case time of an algorithm is proportional 


to the height of the decision tree that represents the 
algorithm. 


. A decision tree that represents a sorting algorithm has n! 
terminal vertices corresponding to the n! possible permuta- 
tions of input of size n. If h is the height of the tree, then h 
comparisons are required in the worst case. Since Ign! < h 
and Ign! = ©(nlgn), worst-case sorting requires at least 
Q(nlgn) comparisons. 


Section 9.7 
1. 


4. In this graph only, if the left pan is heavier, go right. 


C1CaC3C4: CsCgCzCg } 


C.L Co,L Cs.H 


CyH C3,L 


ala Gib 


Cs,H 


Cs, L 


C2 


C1C2C3: CoCoCr] 


CsCoC1 2 C7CgCr 


wel 


7. 


12. 


There are 28 possible outcomes to the fourteen-coins puzzle. 
A tree of height 3 has at most 27 terminal vertices; thus at least 
four weighings are required in the worst case. In fact, there is 
an algorithm that uses four weighings in the worst case: We 
begin by weighing four coins against four coins. If the coins do 
not balance, we proceed as in the solution given for Exercise 4 
(for the twelve-coins puzzle). In this case, at most three weigh- 
ings are required. If the coins do balance, we disregard these 
coins; our problem then is to find the bad coin from among 
the remaining six coins. The six-coins puzzle can be solved 
in at most three weighings in the worst case, which, together 
with the initial weighing, requires four weighings in the worst 
case. 


Let f(n) denote the number of weighings needed to solve the 
n-coin problem in the worst case. Let T be the decision tree 
that represents this algorithm for input of size n, and let h de- 
note the height of T. Then the algorithm requires h weighings 
in the worst case, so h = f(n). Since there are n — | possible 
outcomes, T has at least n — | terminal vertices. By the analog 
of Theorem 9.5.6 for “trinary” trees, log3(n — 1) < h=f(n). 


The decision tree analysis shows that at least [lg5!] = 7 
comparisons are required to sort five items in the worst case. 
The following algorithm sorts five items using at most seven 
comparisons in the worst case. 

Given the sequence a), ..., a5, we first sort a}, a2 (one 
comparison) and then a3, a4 (one comparison). (We assume 
now that a, < az and a3 < a4.) We then compare az and a4. 
Let us assume that az < a4. (The case az > aq is symmetric, 
and for this reason that part of the algorithm is omitted.) At 
this point we know that 


a, <ar<a4 and a3 < a4. 


Next we determine where as belongs among a), a2, and a4 
by first comparing a5 with az. If as < a2, we next compare 
as with a,; but if as > a2, we next compare a5 with a4. In 
either case, two additional comparisons are required. At this 
point, a1, a2, a4, ds is sorted. Finally, we insert a3 in its proper 
place. If we first compare a3 with the second-smallest item 
among a}, a2, a4, a5, only one additional comparison will 
be required, for a total of seven comparisons. To justify this 
last statement, we note that the following arrangements are 
possible after we insert as in its correct position: 


a5 <a, <a <a 
a, <da5 <a) < a4 
a, < a2 <d5 < a4 


a, < a2 < a4 < as. 


If a3 is less than the second item, only one additional com- 
parison is needed (with the first item) to locate the correct 
position for a3. If a3 is greater than the second item, at most 
one additional comparison is needed to locate the correct po- 
sition for a3. In the first three cases, we need only compare a3 
with either az or as to find the correct position for a3 since we 
already know that a3 < ag. In the fourth case, if a3 is greater 
than az, we know that it goes between az and a4. 


14. 


17. 


20. 
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We can consider the numbers as contestants and the internal 
vertices as winners where the larger value wins. 


Suppose we have an algorithm that finds the largest value 
among X1,...,Xn,. Let x1,...,%, be the vertices of a graph. 
An edge exists between x; and x; if the algorithm compares 
x; and x;. The graph must be connected. The least number of 
edges necessary to connect n vertices isn — 1. 


By Exercise 16, tournament sort requires 2 — 1 comparisons 
to find the largest element. By Exercise 18, tournament sort 
requires k comparisons to find the second-largest element. 
Similarly, tournament sort requires at most k comparisons to 
find the third-largest, at most k comparisons to find the fourth- 
largest, and so on. Thus the total number of comparisons is at 
most 


Dk < 2* + Kk 
< kk + Kk 
= 2-2*k = Inlgn. 


[2* — 1) + 2 


Section 9.8 Review 


1. 


Free trees T; and T2 are isomorphic if there is a one-to-one, 
onto function f from the vertex set of T; to the vertex set of 
T satisfying the following: Vertices v; and v; are adjacent in 
T| if and only if the vertices f(v;) and f(v;) are adjacent in T. 
Let 7; be a rooted tree with root r; and let 7 be a rooted 
tree with root r2. Then 7; and T> are isomorphic if there is 
a one-to-one, onto function f from the vertex set of T; to the 
vertex set of T> satisfying the following: 

(a) v; and v; are adjacent in 7 if and only if f(v;) and f(v;) 

are adjacent in 7). 


(b) f(r) =f). 

Let 7; be a binary tree with root r; and let 72 be a binary 

tree with root r2. Then 7; and Ty are isomorphic if there is 

a one-to-one, onto function f from the vertex set of T; to the 

vertex set of T> satisfying the following: 

(a) v; and v; are adjacent in 7) if and only if f(v;) and f(v;) 
are adjacent in 7>. 


(b) f(r) =f). 

(c) visa left child of w in 7 if and only if f(v) is a left child 
of f(w) in T>. 

(d) v is aright child of w in 7; if and only if f(v) is a right 
child of f(w) in 79. 


C(2n, n)/(n+ 1) 


Given binary trees 7; and 72, we first check whether either is 
empty (in which case it is immediate whether they are isomor- 
phic). If both are nonempty, we first check whether the left 
subtrees are isomorphic and then whether the right subtrees 
are isomorphic. 7; and T2 are isomorphic if and only if their 
left and right subtrees are isomorphic. 


Section 9.8 


1. 


Isomorphic. f(vj}) = wi, f(v2) = ws, f3) = 


f(v4) = wa, f(vs) = wo, f (v6) = we. 


W3, 
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4. 


10. 


13. 
16. 


19. 


22. 


25. 


27. 


29. 


. Isomorphic as 
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Not isomorphic. 7> has a simple path of length 2 from a vertex 
of degree 1 to a vertex of degree 1, but 7; does not. 

rooted trees. f(vj)=wi, f(v2)=wa, 
f(v3) =w3, f(v4) =w, f(vs) =Wo, f(v6) =ws, f(v7) = 7, 
JS (vg) = wg. Also isomorphic as free trees. 

Not isomorphic as binary trees. The root of 7; has a left child 


but the root of Tz does not. Isomorphic as rooted trees and as 
free trees. 


/\ 


ASINK SE) 
PS<<?P dN 


Let b, denote the number of nonisomorphic, n-vertex full bi- 
nary trees. Since every full binary tree has an odd number of 
vertices, b, = 0 if n is even. We show that if n = 21+ 1 is 
odd, b, = C;, where C; denotes the ith Catalan number. 

The last equation follows from the fact that there is a 
one-to-one, onto function from the set of i-vertex binary trees 
to the set of (2i + 1)-vertex full binary trees. Such a function 
may be constructed as follows. Given an i-vertex binary tree, 
at every terminal vertex we add two children. At every vertex 
with one child, we add an additional child. Since the tree that 
is obtained has i internal vertices, there are 2i+ 1 vertices total 
(Theorem 9.5.4). The tree constructed is a full binary tree. No- 
tice that this function is one-to-one. Given a (2i + 1)-vertex 
full binary tree 7’, if we eliminate all the terminal vertices, 
we obtain an i-vertex binary tree T. The image of T is T’. 
Therefore, the function is onto. 


There are four comparisons at lines 1 and 3. By Exercise 24, 
the call bin_tree_isom(Ic_r,, lc_r2) requires 6(k — 1) +2 com- 
parisons. The call bin_tree_isom(rc_r,, rc_r2) requires four 
comparisons. Thus the total number of comparisons is 


4+ 6(k—-1)+2+4=6k+4. 

Let 7* denote the tree constructed. Then 7* is a full binary 
tree. Each vertex in T becomes an internal vertex in T*. Since 
we added only terminal vertices, the original n — | vertices 
in T are the only internal vertices in 7*. By Theorem 9.5.4, 
T* has n terminal vertices. Therefore T* € X,. We leave it 
to the reader to check that this mapping is a bijection. By 
Theorem 9.8.12, there are C,_; (n — 1)-vertex binary trees. 
Therefore |X,| = Cy-1. 

By Theorem 9.5.4, a tree in X; has n — | internal vertices 
and 2n — | total vertices. Thus we may choose the vertex v in 


2n — 1 ways and the vertex to mark (left or right) in 2 ways. 
Therefore |X7| = 2(2n — 1). 


33. Using iteration, we have 


2(2n — 1) 
Cr = —————Ch-1 
n+1 


2(2n — 1) 2(2n — 3) 


= Cy 
are | hn n—2 


2?(2n — 1)(2n — 3) 
oo (n+1)n a 
23(2n — 1)(2n — 3)(2n — 5) , 
= n+ Dan—D mS 


2"-1(2n — 1)(2n Sad 
~ at Dnn—-). 3 


1 - — 1)(2n—3)--- ‘ 


n+1 n(n—1)---3-2 
1 —— od 
n+1 n'n! 
1 [(2n)(2n — 2) -+-2][(2n — 1)(2n — 3)---3] 
~ n+l { nin! \ 
= pe: = : C(2n, n). 


n+1 nin! n+1 


Section 9.9 Review 


1. In a game tree, each vertex shows a particular position in the 


game. In particular, the root shows the initial configuration of 
the game. The children of a vertex show all possible responses 
by a player to the position shown in the vertex. 


. In the minimax procedure, values are first assigned to the ter- 


minal vertices in a game tree. Then, working from the bottom 
up, the value of a circle is set to the minimum of the values of 
its children, and the value of a box is set to the maximum of 
the values of its children. 


3. A search that terminates n levels below the given vertex. 


4. An evaluation function assigns to each possible game position 


the value of the position to the first player. 


. Alpha-beta pruning deletes (prunes) parts of the game tree and 


thus omits evaluating parts of it when the minimax procedure 
is applied. Alpha-beta pruning works as follows. Suppose that 
a box vertex v is known to have a value of at least x. When a 
grandchild w of v has a value of at most x, the subtree whose 
root is the parent of w is deleted. Similarly, suppose that a 
circle vertex v is known to have a value of at most x. When a 
grandchild w of v has a value of at least x, the subtree whose 
root is the parent of w is deleted. 


6. An alpha value is a lower bound for a box vertex. 


7. An alpha cutoff occurs at a box vertex when a grandchild w of 


v has a value less than or equal to the alpha value of v. 


8. A beta value is an upper bound for a circle vertex. 


. A beta cutoff occurs at a circle vertex when a grandchild w of 


v has a value greater than or equal to the beta value of v. 
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Section 9.9 


1. 
1]? [2 0)! 
EOLOLOIOTOLO) 
1 1 
0 0) 
The first player always wins. The winning strategy is to first 12. The value of the root is 3. 
take one token; then, whatever the second player does, leave 14. (For Exercise 11) 
one token. 
4. The second player always wins. If two piles remain, leave piles 10 
with equal numbers of tokens. If one pile remains, take it. 
7. Suppose that the first player can win in nim. The first player 5 (10) @: 
can always win in nim’ by adopting the following strategy: 
Play nim’ exactly like nim unless the move would leave an J 
: ‘ ; : 12 9 BS) 10 3 
odd number of singleton piles and no other pile. In this case, 


leave an even number of piles. 
Suppose that the first player can always win in nim’. (12) (8)(4) (9) (s) we Sty Od 


The first player can always win in nim by adopting the fol- 


lowing strategy: Play nim exactly like nim’ unless the move 15. 3-2=1 
would leave an even number of singleton piles and no other 18. 4-1=3 
pile. In this case, leave an odd number of piles. 19. 
9. 
20 


@) (8) 20) 


O will move to a corner. 
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22. Input: The root PT of a game tree; the type PT_ type 
of PT (box or circle); the level PT_ level of PT; 
the maximum level n to which the search is to 
be conducted; an evaluation function E; and 

a number ab_ val (which is either the alpha- or 
beta-value of the parent of PT). (The initial call 
sets ab_ val to oo if PT is a box vertex or to 
—oo if PT is a circle vertex.) 


Output: The game tree with PT evaluated 


alpha_beta_prune(PT, PT_type, PT_level, n, E, ab_val) { 
if (PT_level == n) { 
contents(PT) = E(PT) 


return 

} 

if (PT_type == box) { 
contents(PT ) = —co 


for each child C of PT { 
alpha_beta_prune(C, circle, PT_level + 1,n, 
E, content(PT )) 
c_val = contents(C) 
if (c_val > ab_val) { 
contents(PT ) = ab_val 
return 


if (c_val > contents(PT )) 
contents(PT ) = c_val 
} 
} 
else { 
contents(PT ) = co 
for each child C of PT { 
alpha_beta_prune(C, box, PT_level + 1, n, 
E, content(PT )) 
c_val = contents(C) 
if (c_val < ab_val) { 
contents(PT ) = ab_val 
return 
} 
if (c_val < contents(PT)) 
contents(PT ) = c_val 
} 
} 
} 


23. We first obtain the values 6, 6, 7 for the children of the root. 
Then we order the children of the root with the rightmost child 
first and use the alpha-beta procedure to obtain 


27. m:nmeans player m plays player n. The left child of an m:n 
vertex means that m won that match, and the right child of an 
m : n vertex means that n won that match. The terminal ver- 
tices show who was the game winner (i.e., the first player to 
score at least 2 points). 


4 
clefe e 
4 
4 4 
1:2 


1 3:2 2 3:1 


ww 
N 
we 
— 


Chapter 9 Self-Test 
1. [Section 9.1] 


2. [Section 9.1] a-2, b-1, c-0, d-3, e-2, f-3, g-4, h-5, i-4, j-5, 


k-5, 1-5 
3. [Section 9.1] 5 
4. [Section 9.2] 
(a) b 
(b) a,c 


(c) d,a,c,h,j,k,l 
(d) 


5. [Section 9.5] 


7. [Section 9.6] ABFGCDE 
9. [Section 9.6] GFBEDCA 
10. [Section 9.4] 


6. [Section 9.5] 16 


8. [Section 9.6] BGFAEDC 


2 14 3 
6 12 
6 
5 
4 
8 9 


11. 


12. 


13. 


14. 


15. 
16. 


17. 


18. 


19. 


20. 
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[Section 9.4] (1, 4), C1, 2), (2, 5), (2, 3), (3, 6), (6, 9), (4, 7), 
(7, 8) 

[Section 9.4] (6, 9), (3, 6), (2, 3), (2, 5), (1, 2), (1, 4), (4, 7), 
(7, 8) 

[Section 9.4] Consider a “shortest-path algorithm” in which at 
each step we select an available edge having minimum weight 


incident on the most recently added vertex (see the discussion 
preceding Theorem 9.4.5). 


[Section 9.1] 


[Section 9.2] True. See Theorem 9.2.3. 


[Section 9.2] True. A tree of height 6 or more must have seven 
or more vertices. 


[Section 9.2] False. 
[Section 9.6] 


postfix: EBD/ *CA-— — 
parened infix: ((E£ * (B/D)) — (C —A)) 


[Section 9.8] True. If f is an isomorphism of 7; and 72 as 
rooted trees, f is also an isomorphism of 7; and T> as free 
trees. 


[Section 9.8] False. 


a 


T; T2 
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21. [Section 9.5] 25. [Section 9.3] 


PROCESSING 


CLEAN] PRODUCES] d sf 


BUT | MANUSCRIPTS] PROSE] 


h i 


oq 


CLEAR] NOT | 


NECESSARILY ] 


22. [Section 9.5] We first compare MORE with the word WORD 
in the root. Since MORE is less than WORD, we go to the left 
child. Next, we compare MORE with PROCESSING. Since d f 
MORE is less than PROCESSING, we go to the left child. € 
Since MORE is greater than CLEAN, we go to the right child. 
Since MORE is greater than MANUSCRIPTS, we go to the g h 
right child. Since MORE is less than NOT, we go to the left 
child. Since MORE is less than NECESSARILY, we attempt 
to go to the left child. Since there is no left child, we conclude 27. [Section 9.8] Isomorphic. f(v1) = we, f(v2) = w2, f(v3) = 
that MORE is not in the tree. ws, f(va) = w7, f(vs) = wa, f(ve) = wi, f(v7) = ws, 
23. [Section 9.3] Fa) = we. 
28. [Section 9.8] Not isomorphic. 7; has a vertex (v3) on level 1 
of degree 3, but Tz does not. 


26. [Section 9.3] 


a 
im 
ist 


29. [Section 9.7] An algorithm that requires at most two weigh- 
d f ings can be represented by a decision tree of height at most 2. 
However, such a tree has at most nine terminal vertices. Since 
there are 12 possible outcomes, there is no such algorithm. 
Therefore, at least three weighings are required in the worst 


case to identify the bad coin and determine whether it is heavy 
24. [Section 9.3] or light. 


oq 
~ 


Q 
> 
al 


30. [Section 9.7] 


oq 
> 


31. [Section 9.7] According to Theorem 9.7.3, any sorting algo- 


32. 


33. 
34. 


rithm requires at least Cnlgn comparisons in the worst case. 
Since Professor Sabic’s algorithm uses at most 100” compar- 
isons, we must have Cnlgn < 100n for alln > 1. If we cancel 
n, we obtain Clgn < 100 for all n > 1, which is false. There- 
fore, the professor does not have a sorting algorithm that uses 
at most 100n comparisons in the worst case for all n > 1. 


[Section 9.7] In the worst case, three comparisons are required 
to sort three items using an optimal sort (see Example 9.7.2). 

If n = 4, binary insertion sort sorts three items (three 
comparisons—worst case) and then inserts the fourth item in 
the sorted three-item list (two comparisons—worst case) for a 
total of five comparisons in the worst case. 

If n = 5, binary insertion sort sorts four items (five 
comparisons—worst case) and then inserts the fifth item in the 
sorted four-item list (three comparisons—worst case) for a to- 
tal of eight comparisons in the worst case. 

If n = 6, binary insertion sort sorts five items (eight 
comparisons—worst case) and then inserts the sixth item in 
the sorted five-item list (three comparisons—worst case) for a 
total of eleven comparisons in the worst case. 

The decision tree analysis shows that any algorithm re- 
quires at least five comparisons in the worst case to sort four 
items. Thus binary insertion sort is optimal if n = 4. 

The decision tree analysis shows that any algorithm re- 
quires at least seven comparisons in the worst case to sort five 
items. It is possible, in fact, to sort five items using seven com- 
parisons in the worst case. Thus binary insertion sort is not 
optimal ifn = 5. 

The decision tree analysis shows that any algorithm re- 
quires at least ten comparisons in the worst case to sort six 
items. It is possible, in fact, to sort six items using ten com- 
parisons in the worst case. Thus binary insertion sort is not 
optimal if n = 6. 

[Section 9.9] 3 —1=2 


[Section 9.9] Let each row, column, or diagonal that contains 
one X and two blanks count 1. Let each row, column, or di- 
agonal that contains two X’s and one blank count 5. Let each 
row, column, or diagonal that contains three X’s count 100. 
Let each row, column, or diagonal that contains one O and 
two blanks count —1. Let each row, column, or diagonal that 
contains two O’s and one blank count —5. Let each row, col- 
umn, or diagonal that contains three O’s count — 100. Sum the 
values obtained. 


35. [Section 9.9] 


36. 
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[Section 9.9] 


Section 10.1 Review 


i. 


= 


A network is a simple, weighted, directed graph with a des- 
ignated vertex having no incoming edges, a designated vertex 
having no outgoing edges, and nonnegative weights. 


A source is a vertex with no incoming edges. 
A sink is a vertex with no outgoing edges. 
The weight of an edge is called its capacity. 


A flow assigns each edge a nonnegative number that does not 
exceed the capacity of the edge such that for each vertex v, 
which is neither the source nor the sink, the flow into v equals 
the flow out of v. 


The flow in an edge is the nonnegative number assigned to it 
as in Exercise 5. 


If Fj is the flow in edge (i, j), the flow into vertex j is }>; Fi. 
If Fj; is the flow in edge (i, j), the flow out of vertex i is yi Fj. 


Conservation of flow refers to the equality of the flow into and 
out of a vertex. 


They are equal. 


If a network has multiple sources, they can be tied together 
into a single vertex called the supersource. 


If a network has multiple sinks, they can be tied together into 
a single vertex called the supersink. 


Section 10.1 


iE 


4. 


7. 


(b, c) is 6, 3; (a, d) is 4, 2; (c, e) is 6, 1; (c, z) is 5, 2. The value 
of the flow is 5. 

Add edges (a, w1), (a, w2), (a, W3), (A, z), (B, 2), and (C, z) 
each having capacity oo. 
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Section 10.2 Review 


1. 
2. 


. Ignoring the direction of edges, let P = (vo, .. 


A maximal flow is a flow with maximum value. 


Ignoring the direction of edges, let P = (vo, ..., Vn) be a path 
from the source to the sink. If an edge in P is directed from 
v;-1 to vj, we say that it is properly oriented with respect to P. 
-, Vn) be a path 
from the source to the sink. If an edge in P is directed from 
v; to vj-1, we Say that it is improperly oriented with respect 
to P. 


. We can increase the flow in a path when every properly ori- 


ented edge is under capacity and every improperly oriented 
edge has positive flow. 


. Let A be the minimum of the numbers Cj — Fjj, for properly 


oriented edges (i,j) in the path, and Fj, for improperly ori- 
ented edges (i, /) in the path. Then the flow can be increased 
by A by adding A to the flow in each properly oriented edge 
and by subtracting A from the flow in each improperly ori- 
ented edge. 


. Start with a flow (e.g., assign each edge flow zero). Search for 


a path as described in Exercise 4. Increase the flow in such a 
path as described in Exercise 5. 


Section 10.2 


1. 
4. 


10. 


1 


(a, w1) — 6, (a, w2) — 0, (a, w3) — 3, (wi, b) — 6, (wa, b) — 0, 
(w3, d)—3, (d, c)—3, (b, c)—2, (b, A)—4, (c, A)—2, (c, B)—3, 


(A, z) — 6, (B, z) — 3 


(a, A—7:00) — 3000, (a, A—7:15) —3000, (a, A—7:30) — 2000, 


(A — 7:00, B — 7:30) — 1000, (A — 7:00, C — 7:15) — 2000, 
(A — 7:15, B — 7:45) — 1000, (A — 7:15, C — 7:30) — 2000, 
(A — 7:30, C — 7:45) — 2000, (B — 7:30, D — 7:45) — 1000, 
(C — 7:15, D — 7:30) — 2000, (B — 7:45, D — 8:00) — 1000, 
(C — 7:30, D — 7:45) — 2000, (C — 7:45, D — 8:00) — 2000, 
(D—7:45, z)—3000, (D—7:30, z)—2000, (D—8:00, z)—3000. 


All other edges have flow equal to 0. 


13. 


16. 
19. 


d 2,2 e 


The maximum flow is 9. 


Suppose that the sum of the capacities of the edges incident on 
ais U. Each iteration of Algorithm 10.2.5 increases the flow 
by 1. Since the flow cannot exceed U, eventually the algorithm 
must terminate. 


Section 10.3 Review 


1. 


A cut in a network consists of a set P of vertices and the com- 
plement P of P, where the source is in P and the sink is in P. 


. The capacity of a cut (P, P) is the number 


dou: 


ieP jeP 


. The capacity of any cut is greater than or equal to the value of 


any flow. 


. A minimal cut is a cut having minimum capacity. 


. If the value of a flow equals the capacity of a cut, then the 


flow is maximal and the cut is minimal. The value of a flow F 
equals the capacity of a cut (P, P) if and only if F, ij = Cy for 
allie P,j € P, and Fj = 0 for alli € P,j € P. 


. Let P be the set of labeled vertices, and let P be the set of un- 


labeled vertices at the termination of Algorithm 10.2.4. It can 
be shown that the conditions 


B Fy =CjforallieP,jeP 
§ Fj =Oforallie P,jeP 


of Exercise 5 hold. Thus the flow is maximal. 


Section 10.3 


1. 


20. 


8; minimal 
. P = {a, b, d} 
. P= {a,d} 
. P= {a, w1, wo, w3, b, d, e} 


. P={a,w),w2,w3,b, c,d, d', e,f, A, B, C} 
. P= {a,b,c,f, g,h,j,k,1,m} 


Ll 251 
a 5b & 
with Cap = 1, Coz = 2, Map = 1, Mpz = 2. 
Alter Algorithm 10.2.4. 


23. False. Consider the flow 
Ld 2,1 


cs = = ° 


a b £ 
and the cut P = {a, b}. 


Section 10.4 Review 


In the solutions to Exercises 1-5, G is a directed, bipartite graph 
with disjoint vertex sets V and W in which the edges are directed 
from V to W. 


1. A matching for Gis a set of edges with no vertices in common. 


2. A maximal matching for G is a matching containing the max- 
imum number of edges. 

3. A complete matching for G is a matching E having the prop- 
erty that if v € V, then (v, w) € E for some w € W. 


4. Adda supersource a and edges from a to each vertex in V. Add 
a supersink z and edges from each vertex in W to z. Assign all 
edges capacity 1. We call the resulting network a matching 
network. Then, a flow in the matching network gives a match- 
ing in G [v is matched with w if and only if the flow in edge 
(v, w) is 1]; a maximal flow corresponds to a maximal match- 
ing; and a flow whose value is |V| corresponds to a complete 
matching. 


5. If SC V, let 
R(S) = {w € W|v € S and (v, w) is an edge in G}. 


Hall’s Marriage Theorem states that there exists a complete 
matching in G if and only if |S| < |R(S)| for all § C V. 


Section 10.4 
1. P= {a, A, B, D, Jo, Js} 
3. Finding qualified persons for jobs 
6. Finding qualified persons for all jobs 


9. All unlabeled edges are 1,0. There is no complete matching. 


13. Each row and column has at most one label. 


17. If 6(G) = 0, then |S] — |R(S)| < 0, for all S C V. By Theo- 
rem 10.4.7, G has a complete matching. 
If G has a complete matching, then |S| — |R(S)| < 0, 
for all S C V, so 6(G) < 0. If S = ©, |S| — |R(S)| = 0, so 
5(G) = 0. 
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Chapter 10 Self-Test 


1. [Section 10.1] In each edge, the flow is less than or equal to 
the capacity and, except for the source and sink, the flow into 
each vertex v is equal to the flow out of v. 


2. [Section 10.1] 3 

3. [Section 10.1] 3 

4. [Section 10.1] 3 

5. [Section 10.2] (a, b, e, f, g, z) 

6. [Section 10.2] Change the flows to Fy, = 2, Fe, = 1, 
Fe =1,F pg = 1, Fe = 1. 

7. [Section 10.2] Fao 3, Foe 3, Fed 4, Faz 4, 
Fae 2, Fog 2, Ffjc 2, Fg 1, Fy. 1, and all 
other edge flows zero. 

8. [Section 10.2] Fup 0, Foc 5; Fed 5, Faz 8, 
Fey = 3, Fea = 3, Fae = 8, Fer = 3, Fee = 3, Fan = 4, 
Fei = 2, Fj.z = 6, Fhi = 4, Fi = 6, and all other edge flows 
Zero. 


9. [Section 10.3] a 
10. [Section 10.3] 6 


11. [Section 10.3] No. The capacity of (P, P) is 6, but the capacity 
of (P’, P’), P’ = {a, b,c, e, f}, is 5. 


12. [Section 10.3] P = {a, b,c, e, f, g, h, i} 
13. [Section 10.4] 


True, b—False, c—False, d—True 


14. [Section 10.4] See the solution to Exercise 13. 


15. [Section 10.4] A — Jo, B— J), C — J3, D — Js is a complete 
matching. 


16. [Section 10.4] P = {a} 


Section 11.1 Review 
1. A combinatorial circuit is a circuit in which the output is 
uniquely defined for every combination of inputs. 
2. A sequential circuit is a circuit in which the output is a function 


of the input and state of the system. 


3. An AND gate receives input x; and x2, where x; and x2 are 
bits, and produces output | if x; and x2 are both 1, and 0 oth- 
erwise. 


4. An OR gate receives input x; and x2, where x; and x2 are bits, 
and produces output 0 if x; and x2 are both 0, and | otherwise. 
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. A NOT gate receives input x, where x is a bit, and produces 


output | if x is O, and 0 if x is 1. 


. An inverter is a NOT gate. 


. A logic table of a combinatorial circuit lists all possible inputs 


together with the resulting outputs. 


..,X, are defined 
recursively as follows. 0, 1,2x1,...,%, are Boolean expres- 
sions. If X; and Xz are Boolean expressions, then (X;), X1, 
X, V X2, and X; A X2 are Boolean expressions. 


. A literal is the symbol x or ¥ that appears in a Boolean 


expression. 


Section 11.1 


1. xX, A x2 
x FLA %2 Xi AX2 
Xp x2 x1 AX2 | | >o— 
14 0 a 
1 0 1 
Oo 1 1 
0 O 1 
4. 
xX) X23 (G1 A x2) V 1 A x3)) AX3 
1 1 1 0 
1 1 0 1 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 1 
0 0 1 0 
0 0 0 1 
7. If x = 1, the output y is undetermined: Suppose that x = 1 


10. 
13. 
16. 


19. 
22. 


and y = 0. Then the input to the AND gate is 1, 0. Thus the 
output of the AND gate is 0. Since this is then NOTed, y = 1. 
Contradiction. Similarly, if x = 1 and y = 1, we obtain a 
contradiction. 

0 

1 


Is a Boolean expression. x1, x2, and x3 are Boolean expres- 
sions by (11.1.2). x2 V x3 is a Boolean expression by (11.1.3c). 
(x2 V x3) is a Boolean expression by (11.1.3a). x1 A (x2 V x3) 
is a Boolean expression by (11.1.3d). 


Not a Boolean expression 


ee Gee ae 


25. (AA B)V (CAA) 


> 
& 
io) 


(A A B)v (CAA) 


corr OOF Ke 
oOorororcr 
Oororoorr 


coco CO Re Ree 


27. (AA(CV (DAC))) V (BA (DV (CAA) V C)) 


29. 


32. 


AB (AVB)AA 

1 1 1 

1 0 1 

0 I 0 

0 O 0 
A 

o—_] ie eee 
B 

B C 
A 
: B C |g 

A B Cc 


Section 11.2 Review 


An kwWN = 


»(aVb)Vc=av (bVc), (aNb)Ac=anr (bAc) 


aVb=bvVa,a\b=baa 


» AaA(bVc) = (aNb)V (adc), av (bAc) = (av b) A (av c) 


aVO=a,aAl=a 


aVa=1,aAa=0 


. Boolean expressions are equal if they have the same values for 


all possible assignments of bits to the literals. 


. Combinatorial circuits are equivalent if, whenever the circuits 


receive the same inputs, they produce the same outputs. 


. Let C; and C2 be combinatorial circuits represented, respec- 


tively, by the Boolean expressions X; and X27. Then C; and Cz 
are equivalent if and only if X; = X2. 


Section 11.2 18. 
1. 
xX] x2 Xp A X2 XV X2 
1 1 0 0 21. 
1 0 1 1 
0 1 1 1 
0 0 1 1 
4, 
X, X2 %3 X1 V (X2 V x3) (x1 A x2) V x3 
1 1 1 1 1 
1 1 O 0 0 
101 1 1 1. 
1 0 O 1 1 
0 1 1 1 1 
0 1 O 1 1 
0 0 1 1 1 2; 
0 0 O 1 1 4. 
~ 6. 
Xx] Xt VX] 7. 
8 
1 1 
0 0) 9 
9. 
Xp X23 xX A (%2 A x3) (x1 A X2) V (x1 A X3) 
1 1 1 0 0 2. 
1 1 O 1 1 
1 0 1 1 1 
1 0 O 1 1 
0 1 1 0 0 
0 1 0 0 0 
0 0 1 0 0 
0 0 0 0 0 
11. 
x x 
1 1 a 
0 0 
14. False. Take x) = 1, x2 = 1,x3 =0 
16. 
a@ ob « av (bac) (av b) A (aVc) 
1 1 1 1 1 
1 1 O 1 1 
L oO 7 1 1 
1 0 O 1 1 
Oo 1 1 1 1 
0 1 +0 0 0 
0 oO 1 0 0 
0 0 0 0 0 
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The Boolean expressions that represent the circuits are 
(AA B)V (AAC) andA A (BV C). The expressions are equal 
by Theorem 11.2.1(c). Therefore, the switching circuits are 
equivalent. 


Cc D E 


Section 11.3 Review 


A Boolean algebra consists of a set S containing distinct ele- 
ments 0 and 1, binary operators + and -, and a unary opera- 
tor’ on S satisfying the associative, commutative, distributive, 
identity, and complement laws. 


3. x+1=1,x0=0 
x+xy=x,x(x+y)=x 5. (x) =x 
0’=1,1'=0 

(ty) =xy’, @yy =x +y’ 


X+xX=xX,xx =x 


. The dual of a Boolean expression is obtained by replacing 0 


by 1, 1 by 0, + by -, and - by +. 


. The dual of a theorem about Boolean algebras is also a theo- 


rem. 


Section 11.3 


One can show that the associative and distributive laws hold 
for lcm and ged directly. The commutative law clearly holds. 
To see that the identity laws hold, note that 


Icm(x, 1) =x and ged(x, 6) =x. 
Since 
Iem(x, 6/x) =6 and ged(x,6/x) = 1, 


the complement laws hold. Therefore, (S,+, -,’, 1,6) is a 
Boolean algebra. 


. We show only 


x*(x+z)=(x-y)+(x-z)_ forall x,y,z € Sy. 
Now 


x+(y +z) = min{x, max{y, z}} 
(x+y) + (v+z) = max{min{x, y}, min{x, z}}. 
We assume that y < z. (The argument is similar if y > z.) 


There are three cases to consider: x < y; y < x < z;andz <x. 
If x < y, we obtain 


x+(y +z) = min{x, max{y, z}} 


= min{x, z} = x = max{x, x} 


max{min{x, y}, min{x, z}} 


(x+y) + (Xz). 
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If y < x < z, we obtain 


x+(y +z) = min{x, max{y, z}} 


min{x, z} = x = max{y, x} 


= max{min{x, y}, min{x, z}} 


(x+y) + (xz). 


If z < x, we obtain 


x+(y-+2) = min{x, max{y, 2}) 
= min{x, z} = z = max{y, z} 
= max{min{x, y}, min{x, z}} 


= (x-y) + (+2). 


7. IfXUY =UandXNY =@, then Y =X. 
8. xy +20 = x(x+ y)y 


11. 
14. 
15. 


18. 


21. 


23. 


x+y =1ifand only ifx+y=x. 
x(x + yO) =x 
(For Exercise 13) 


O=x+y= (+x) +y 
(x+y) =x+0=x 


=x- 


Similarly, y = 0. 
[For part (c)] 


x(x + y) = (x + 0)(x + y) 
x+0y=x+y0 


x+0=x 


First, show that if ba = ca and ba’ = ca’, then b = c. Now 
takea =x,b=x+(y+z), andc = (x+y) +z and use this 
result. 


If the prime p divides n, p? does not divide n. 


Section 11.4 Review 


1. 
2. 


» LetAy,.. 


The exclusive-OR of x; and x2 is 0 if x; = x2, and 1 otherwise. 


A Boolean function is a function of the form 


f Oye 


where X is a Boolean expression. 


Xn) = X(x1,...,%n), 


. A minterm is a Boolean expression of the form 


yiAya2A-s'Adn, 


where each y; is either x; or Xj. 


. The disjunctive normal form of a not identically zero Boolean 


function f is 


f(x1,-- 


where each m; is a minterm. 


., Ax denote the elements A; of Z} for which f(A;) = 
1. For each Aj = (aj, ..., dn), Set mj) = y1 A+++ A Yn, Where 
yj = x; if aj = 1, and yj = 57 if a; = 0. Then 


f(r, -- 


-,Xn) =m Vm2V---V mr, 


-,Xn) =m V m2 V +--+ V mp. 


. A maxterm is a Boolean expression of the form 


Yi Vy2Ve+*V ns 


where each y; is either x; or Xj. 


. The conjunctive normal form of a not identically one Boolean 


function f is 


f(x1,-- 


where each m; is a maxterm. 


-,Xn) =m Amz A---Amg, 


Section 11.4 


In these hints, a A b is written ab. 


1. 
4. 
7. 
10. 


11. 
17. 
22. 
25. 


28. 


xy VXyV XY 

XyZ V XYZ V XVZV Xyz V XyZ 

XYZ V XVZV XYZ 

WxYZ V WXYZ V WXYZ V WXYZ V WXYZ 
V WxyZ V WxYZ V WXYZ V WXYZ V WXYZ 


xy V xy 14. xyz 
xyz V Xyz V xyZ V XyZ 20. 0 
o” 


(For Exercise 3) 
AVYVZAAXVIVZADAVYV z) 
(For Exercise 3) 


AVIYVAIAVIVIXVYVIAVYVABXVYV2Z) 


Section 11.5 Review 


1. 
2. 


eC nrnrntnawn 


A gate is a function from Z3 into Zp. 


A set of gates G is functionally complete if, given any posi- 
tive integer n and a function f from Z into Zp, it is possible 
to construct a combinatorial circuit that computes f using only 
the gates in G. 


. {AND, OR, NOT} 


. A NAND gate receives input x; and x2, where x; and x2 are 


bits, and produces output 0 if x; and x2 are both 1, and | oth- 
erwise. 


Yes 


. The problem of finding the best circuit 

. Small components that are themselves entire circuits 
. See Figure 11.5.8. 

. See Figure 11.5.9. 


Section 11.5 


1. 
2. 


AND can be expressed in terms of OR and NOT: xy = Xv y. 


A combinatorial circuit consisting only of AND gates would 
always output 0 when all inputs are 0. 


. We use induction on n to show that there is no n-gate com- 


binatorial circuit consisting of only AND and OR gates that 
computes f(x) = X. 


15. 


If n = O, the input x equals the output x, and so it is 
impossible for a 0-gate circuit to compute f. The Basis Step is 
proved. 

Suppose that there is no n-gate combinatorial circuit 
consisting of only AND and OR gates that computes f. Con- 
sider an (n + 1)-gate combinatorial circuit consisting of only 
AND and OR gates. The input x first arrives at either an AND 
or an OR gate. Suppose that x first arrives at an AND gate. 
(The argument is similar if x first arrives at an OR gate and 
is omitted.) Because the circuit is a combinatorial circuit, the 
other input to the AND gate is either x itself, the constant 1, 
or the constant 0. If both inputs to the AND gate are x itself, 
then the output of the AND gate is equal to the input. In this 
case, the behavior of the circuit is unchanged if we remove 
the AND gate and connect x to what was the output line of 
the AND gate. But we now have an equivalent n-gate circuit, 
which, by the inductive hypothesis, cannot compute f. Thus 
the (n + 1)-gate circuit cannot compute /. 

If the other input to the AND gate is the constant 1, the 
output of the AND gate is again equal to the input and we can 
argue as in the previous case that the (n+ 1)-gate circuit cannot 
compute f. 

If the other input to the AND gate is the constant 0, the 
AND gate always outputs 0 and, so, changing the value of x 
does not affect the output of the circuit. In this case, the cir- 
cuit cannot compute f. The Inductive Step is complete. There- 
fore, no n-gate combinatorial circuit consisting of only AND 
and OR gates can compute f(x) = x. Thus {AND, OR} is not 
functionally complete. 


eS 


| }>— 


+ Yi = X1x2 V (x2 V X3)5 y2 = XQ V3 
12. 


(For Exercise 3) The dnf may be simplified to xy V xz V x y and 


then rewritten as x(y V Z) V¥¥ = (x¥z) V XY = xyzxXy, which 


gives the circuit 


Ba 


OTD 
f p= a 
—s 


oe 


as 
= 


17. 


20. 


23. 


25. 


27. 


28. 
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z= alu) oly) 

XV¥=AlLYWLalyxaxlx 
xty=[@LnLoliIlI@lolo ly 

Since 


X=x 1.x; 


xVy=QAly)L@ ly), 


and {NOT, OR} is functionally complete, {NOR} is function- 
ally complete. 


\ Oe: 


The logic table is 


x y Zz Output 
1 1 1 1 
1 1 0 1 
1 0 1 1 
1 0 0 0 
0 1 1 1 
0 1 0 0 
0 0 1 0 
0 0 0 0 
The logic table is 
b ~~ FLAGIN c | FLAGOUT 
1 1 0 1 
1 0 1 1 
0 1 1 1 
0 0 0 0 


Thus c = b @ FLAGIN and FLAGOUT = b v FLAGIN. We 
obtain the circuit 


b 
FLAGOUT 


[pL 


FLAGIN 


010100 


722 


31. 


34. 
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FLAGOUT/FLAGIN 
FLAGIN FLAGOUT/FLAGIN 
toe 
0 2's y1 
b é 
t1 module aie y 
b € 
zm module Y 2's y3 
x3 b_| module 
Writing the truth tables shows that 


x=x—>0, xVy=(x>0)-> y. 

Therefore a NOT gate can be replaced by one — gate, and an 
OR gate can be replaced by two — gates. Since the set {NOT, 
OR} is functionally complete, it follows that the set {>} is 


functionally complete. 


Chapter 11 Self-Test 


1. 


[Section 11.1] 


x oy 2 AY) VzZ 
111 1 
1 1 0 1 
1 0 1 1 
1 0 0 0 
0 1 1 1 
0 1 0 1 
0 0 1 1 
0 0 0 1 


2. [Section 11.2] The circuits are equivalent. The logic table for 


either circuit is 


x y Output 
1 1 0 
1 0 1 
0 1 0 
0 0 0 


. [Section 11.2] The circuits are not equivalent. If x = 0, y = 1, 


and z = 0, the output of circuit (a) is 1, but the output of circuit 
(b) is 0. 


. [Section 11.3] Bound laws: 


XUU=U, XNG=6 foralXeS. 
Absorption laws: 


XU(XNY)=X, XN(XUY)=X forallX,YeS. 


. [Section 11.1] 1 


6. 


7. 


10. 


11. 


12. 


[Section 11.1] 
xy 


x2 


| 


[Section 11.2] The equation is true. The logic table for either 
expression is 


x y Zz Value 
1 1 1 1 
1 1 0 1 
1 0 1 0 
1 (0) 0 0 
0 1 1 1 
0 1 0 1 
0 0 1 1 
0 0 0 0 


. [Section 11.2] The equation is false. If x = 1, y = 0, and 


z= 1, then 
(xXAYAZDV (Vz =O), 
but 


GADV GAD= 1, 


. [Section 11.1] Suppose that x is 1. Then the upper input to the 


OR gate is 0. If y is 1, then the lower input to the OR gate is 0. 
Since both inputs to the OR gate are 0, the output y of the OR 
gate is 0, which is impossible. If y is 0, then the lower input to 
the OR gate is 1. Since an input to the OR gate is 1, the output 
y of the OR gate is 1, which is impossible. Therefore, if the 
input to the circuit is 1, the output is not uniquely determined. 
Thus the circuit is not a combinatorial circuit. 
[Section 11.3] (x(x + y-0))’ = (x +0))’ (Bound law) 
= (x+x)! (Identity law) 
= (Idempotent law) 


[Section 11.3] Dual: (x + «(y+ D)’ =x’ 


(xtxQvy+ Dy = @tx-1’ 
= (x+x)' 


ed (Idempotent law) 


(Bound law) 
(Identity law) 


[Section 11.3] ~ is not a unary operator on S. For example, 
{1,2} €S. 


In Exercises 13-16, a A b is written ab. 


13. 


[Section 11.4] x1%2x3 
x1 


ams a 
oS 


[4 


> 


14. [Section 11.4] x1.x2%3 V x1 X2x3 


al Il, 
| PFD" 


15. [Section 11.4] xjx2x3 V x1 X2X3 V X1X2X3 


aD: 
‘> 
oD | = 


16. [Section 11.4] x1.x2%3 V x1 X2X3 V X1x2x3 V X1X2xX3 
17. [Section 11.5] 


x y Zz Output 
1 1 1 1 
1 1 0 0 
1 0 1 1 
1 0 0 0 
0 1 1 0 
0 1 0 0 
0 0 1 1 
0 0 0 0 


18. [Section 11.5] Disjunctive normal form: xyz V xy ZV XyZV XYZ 
(xyz V XYZ) V XVZV XYZ = xy V (XyZV XYZ) 


=xXyVXZ 


fr -F 


| 
YU VVVY 


19. [Section 11.5] 


s. 


yy —-—_—_————_ 
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20. [Section 11.5] 


’ 


c 
Half Ee a )>- é 
adder Half 7 

adder 


Section 12.1 Review 


1. 


A unit time delay accepts as input a bit x; at time ¢ and outputs 
X;—1, the bit received as input at time t — 1. 


. A serial adder inputs two binary numbers and outputs their 


sum. 


. A finite-state machine consists of a finite set Z of input sym- 


bols, a finite set O of output symbols, a finite set S of states, 
a next-state function f from S x Z into S, an output function 
g from S x TZ into O, and an initial state o € S. 


. Let M = (Z,0,8,f,g,0) be a finite-state machine. The 


transition diagram of M is a digraph G whose vertices 
are the states. An arrow designates the initial state. A di- 
rected edge (0), 02) exists in G if there exists an input i with 
f(o1, i) = 02. In this case, if g(o1, 1) = o, the edge (01, 02) is 
labeled i/o. 


. The SR flip-flop is defined by the table 


Ss R | Q 
1 1 Not allowed 

1 0 1 

0 1 0 


0 { 1 ifSwaslastequalto1 


0 ifRwaslastequalto1 


Section 12.1 


1. 


6. 


al0 


00 aT (o> b/1 


b/1 


al0 


() 


all b/1 


c/0 


a/l 
c/1 


c/0 
T = {a,b}; O = {0, 1}; S = {oo, o7}; initial state = a9 


tT a b a b 
Ss 
00 onl 00 0 1 
oO! O| O| 1 1 
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9. T={a, b}; O={0, 1}; S={o0, o1, 02, 03}; initial state = 09 


ZT a b a b 
S 
90 O| o2 0 0 
oO! 00 02 1 0 
02 03 00 0 1 
03 foal o3 0 0 
11. 1110 14. 001110 
17. 20210 20. 020022201020 
21. 
1/0 
EOD a0 
() 1/1 
0/1 
24, 
0/0 
0/0 1/0 
27. When y is input, the machine outputs x,,x%,-1,... until 
x; = 1. Thereafter, it outputs x;. However, according to 


Algorithm 11.5.16, this is the 2’s complement of a. 


Section 12.2 Review 


1. A finite-state automaton consists of a finite set Z of input sym- 
bols, a finite set S of states, a next-state function f from S x Z 
into S, a subset A of S of accepting states, and an initial state 
aeéS. 


2. A string is accepted by a finite-state automaton A if, when the 
string is input to A, the last state reached is an accepting state. 


3. Finite-state automata are equivalent if they accept precisely 
the same strings. 


24. 


Section 12.2 


1. 


All incoming edges to og output | and all incoming edges to 
o output 0; hence the finite-state machine is a finite-state au- 
tomaton. 


10. 


13. 
16. 
18. 


21. 


(For Exercise 1) ZT = {a,b}; S = {o9, 01}; A = {oo}; initial 
state = 09 


Li a b 
Ss 

er) 90 oO! 

o1 ol 9% 
Accepted 
Accepted 


No matter which state we are in, after an a we move to an 
accepting state; however, after a b we move to a nonaccepting 


state. 
©) 
(F * 


———| 


b b b 


27. 


30. 


32. (For Exercise 1) This algorithm determines whether a string 
over {a,b} is accepted by the finite-state automaton whose 


35. 


38. 
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transition diagram is given in Exercise 1. 


Input: x, the length of the string (n = 0 designates the 
null string); s; ---s,, the string 
Output: “Accept” if the string is accepted 
“Reject” if the string is not accepted 


ex32(s, n) { 


state = ‘oo’ 
fori=l1ton{ 
if (state == ‘oo’ A s; == ‘b’) 
state = ‘o,’ 
if (state == ‘o,’ A 5; == ‘b’) 
state = ‘oo’ 
} 
if (state == ‘oo’) 
return “Accept” 
else 


return “Reject” 


} 


Make each accepting state nonaccepting and each nonaccept- 
ing state accepting. 


Using the construction given in Exercises 36 and 37, we ob- 
tain the following finite-state automaton that accepts Ly M Lo. 
(We designate the states in Exercise 6 with primes.) 

The finite-state automaton that accepts L; U Lz is the 
same as the finite-state automaton that accepts L; Lz except 
that the set of accepting states is 


{(01,09), (01,01), (01,03), (00, 03)}. 


41. Use the construction of Exercises 36 and 37. 


Section 12.3 Review 


1. A “natural language” refers to ordinary written and spoken 
words and combinations of words. A “formal language” is 
an artificial language consisting of a specified set of strings. 
Formal languages are used to model natural languages and to 
communicate with computers. 


2. A phrase-structure grammar consists of a finite set N of 
nonterminal symbols, a finite set T of terminal symbols where 
NNT = ©, a finite subset of [(N U T)* — T*] x (NU T)* 
called the set of productions, and a starting symbol in NV. 

3. Ifa~ — isa production and xay € (NUT)*, we say that xBy 
is directly derivable from xay. 

4. Ifa; € (NUT)* fori = 1,...,n,andaj+ is directly derivable 
from a; fori = 1,...,— 1, we say that a, is derivable from 
a, and write a] > dp. 
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5. 
6. 


15. 
16. 


17. 


18. 


19. 
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We calla; > a2 > ++: > a, a derivation of a, from a. 


The language generated by a grammar consists of all strings 
in terminals derivable from the start symbol. 


. Backus normal form (BNF) is a way to write the productions 


of a grammar. In BNF the nonterminal symbols typically begin 
with “(” and end with ‘)”. Also the arrow — is replaced with 
i=. Productions with the same left-hand side are combined 
using the bar “|”. An example is 


(signed integer) ::= 


+ (unsigned integer) | — (unsigned integer) 


. Inacontext-sensitive grammar, every production is of the form 


aAB — adp, where a,B € (NUT)*,A € N,andé € 
(NU T)* — {a}. 


. In a context-free grammar, every production is of the form 


A — 6, where A € N and 6 € (NUT)*. 


. Inaregular grammar, every production is of the form A > a, 


A — aB,orA — i, where A,B e NandaeT. 


. Acontext-sensitive grammar 
. A context-free grammar 
. A regular grammar 


. A language is context-sensitive if there is a context-sensitive 


grammar that generates it. 


A language is context-free if there is a context-free grammar 
that generates it. 


A language is regular if there is a regular grammar that gener- 
ates it. 


A context-free, interactive Lindenmayer grammar consists of 
a finite set N of nonterminal symbols; a finite set T of terminal 
symbols where NM T = ©; a finite set of productions A > B, 
where A € NUT and B € (N UT)*; and a starting symbol 
inN. 


The von Koch snowflake is generated by the context-free, in- 
teractive Lindenmayer grammar 
N = {D} 
T = {d,+, —} 
P={D>D—D++D—-D,D>d,+—>+4, 
Se} 


d means “draw a straight line of a fixed length in the current 
direction,” + means “turn right by 60°,” and — means “turn 
left by 60°.” 


Fractal curves are characterized by having a part of the whole 
curve resemble the whole. 


Section 12.3 


1. 
4. 
7. 


Regular, context-free, context-sensitive 
Context-free, context-sensitive 
o => bo = bbo = bbadA bbabA => bbabbA => 


bbabbao = bbabbab 


10. 


12. 


15. 


18. 


21. 


24. 
25. 


28. 


31. 


32. 


o = ABA => ABBA => ABBAA 
= ABBaAA => abBBaAA = abbBaAA 
=> abbbaAA => abbbaabA => abbbaabab 


(For Exercise 1) 


<a> 
<A> 


n= b<a>|a<A>|b 
n= a<o>|b<A>|a 


S— aA,A—aA,A 
Av>b,S->a 


S — aA, S > bS,S > »,A > aA, 
A— bB,A— i, B = aA, B > bS 


<exp number > 


> bA,A 


> da, 


n= <integer> E <integer> | 

< float number > | 

< float number > E <integer > 
S > aSa, § ~ bSb,S > a,S > b,S>i 


Ifa derivation begins S = aSb, the resulting string begins with 
a and ends with b. Similarly, if a derivation begins S > bSa, 
the resulting string begins with b and ends with a. Therefore, 
the grammar does not generate the string abba. 


If a derivation begins S => abS, the resulting string begins 
ab. If a derivation begins S = baS, the resulting string begins 
ba. If a derivation begins S = aSb, the resulting string starts 
with a and ends with b. If a derivation begins S => bSa, the 
resulting string begins with b and ends with a. Therefore, the 
grammar does not generate the string aabbabba. 


The grammar does generate L, the set of all strings over {a, b} 
with equal numbers of a’s and b’s. 

Any string generated by the grammar has equal num- 
bers of a’s and b’s since whenever any of the productions are 
used in a derivation, equal numbers of a’s and b’s are added to 
the string. 

To prove the converse, we consider an arbitrary string a 
in L, and we use induction on the length |@| of @ to show that 
a is generated by the grammar. The Basis Step is |a| = 0. In 
this case, @ is the null string, and S => 2 is a derivation of a. 

Let a be a nonnull string, and suppose that any string in 
L whose length is less than |a| is generated by the grammar. 
We first consider the case that @ starts with a. Then a can be 
written a = aa,ba2, where a; and a2 have equal numbers of 
a’s and b’s. By the inductive hypothesis, there are derivations 
S => q, and S > a2 of a; and a2. But now 


S => aSbS => aa,ba 


is a derivation of aw. Similarly, if a starts with b, there is a 
derivation of w. The Inductive Step is finished, and the proof 
is complete. 


Replace each production 
A> X,+++XyB, 
where n > 1, x; € T, and B € N, with the productions 
A> XA] 
A, — XxX2A2 


An-1 > XnB, 


where A,,..., Ay— are additional nonterminal symbols. 


D+D>d+d+d+d 


S>D+D+D+D 
D+D—-D—DD+D+D—D 


D+D-—-D-DD+D+4+D-D 
D+D—-D—DD+D+D—D 
D+D—D—DD+D+D—D 
=>d+d-—d—dd+d+d-d 
d—dd+d+d-d 
d+d-—d-—dd+d+d-d 
td—d—dd+d+d-d 
START 


Section 12.4 Review 


1. 


Leto be the start state, let T be the set of input symbols, and let 
N be the set of states. Let P be the set of productions § > xS’, 
if there is an edge labeled x from S to S’, and S + A if S is 
an accepting state. Let G be the regular grammar (N, T, P, o). 
Then the set of strings accepted by A is equal to L(G). 


. A nondeterministic finite-state automaton consists of a finite 


set Z of input symbols, a finite set S of states, a next-state 
function f from S x Z into P(S), a subset A of S of accepting 
states, and an initial stateo € S. 


. A string @ is accepted by a nondeterministic finite-state 


automaton A if there is some path representing @ in the tran- 
sition diagram of A beginning at the initial state and ending in 
an accepting state. 


. Nondeterministic finite-state automata are equivalent if they 


accept precisely the same strings. 


. Let G = (N,T,P,o) be a regular grammar. The finite-state 


automaton A is constructed as follows. The set of input sym- 
bols is T. The set of states is N together with an additional 
state F ¢ N UT. The next-state function f is defined as 


f(S,x) = {S'|S > xS' € PJ}U{F|S> xe P}. 
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The set of accepting states is F together with all S for which 
S — i is a production. Then A accepts precisely the strings 
L(G). 


Section 12.4 


1. 
b 
a 
4. 


6. T= {a,b}; S = {o0, 01, 02}; A = {01, 09}; 
initial state = 09 


Lt a b 
S 
00 {o1, 02} @ 
coal {oi} {o0, 02} 
02 @ i) 


9. T = {a,b}; S = {00, 01, 02, 03}; A = {03}; 
initial state = oo 


L a b 
S 
00 {oo} {o0, o1} 
coal {o2} © 
02 @ {o3} 
03 {o3} {03} 


11. (For Exercise 6) N = {00, 01, 02}, T = {a, b}, 


09 > do1, 09 > boo, 


01 — boo, 


0| > doo, 


02 > boi, 02 >a09, O2>A 


14. No. For the first three characters, bba, the moves are deter- 
mined and we end at C. From C, no edge contains an a; there- 
fore, bbabab is not accepted. 


17. Yes. The path (o, 0, 0,0, C, C), which represents the string 
aaaab, ends at C, which is an accepting state. 
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21. 


24. 


27. 


30. (For Exercise 21) 09 — ao,, 09 — bog, 05 — boo, 
02 — bo3, 03 > ao3, 03 — bo3, 04 > ao3,03 > Ad 


Section 12.5 Review 


1. LetA = (Z,S,f,.A,o) be a nondeterministic finite-state au- 
tomaton. An equivalent deterministic finite-state automaton 
can be constructed as follows. The set of states is the power 
set of S. The set of input symbols is Z (unchanged). The start 


symbol is {o} (essentially unchanged). The set of accepting 
states consists of all subsets of S that contain at least one ac- 
cepting state of A. The next state function is defined by the 
rule 


ef 2) ifX =@ 
f&xn= : 
toe (Sx) 9X #2. 


2. A language L is regular if and only if there exists a finite-state 
automaton that accepts precisely the strings in L. 
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Section 12.5 
1. (For Exercise 1) 
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7. 


10. 


11. 


14. 


17. 


(For Exercise 21) 


Figure 12.5.7 accepts the string ba”, n > 1, and strings that 
end b? or aba", n > 1. Using Example 12.5.8, we see that Fig- 
ure 12.5.9 accepts the string a"b, n > 1, and strings that start 
b? or aba, n >= 1. 


20. 


22. 09 — ao}, 09 > bor, 09 > a, 0, — aoo, 
O| > don, 01 —> bo\, 0; > b, 02 > boo 


25. Suppose that L is regular. Then there exists a finite-state au- 
tomaton A with L = Ac(A). Suppose that A has k states. Con- 
sider the string a*bba* and argue as in Example 12.5.6. 

28. The statement is false. Consider the regular language L = 
{a"b | n > _ O}, which is accepted by the finite-state 
automaton 
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a 10. [Section 12.1] Z = {a,b}; O = {0, 1}; S = {S, A, B}; initial 
b 2 state = § 
s GC) 
b 
a b : 


f g 

The language z a b a b 

Ss 

LU ={u" |weL,ne {1,2,...}} 5 A A 0 0 

A S B 1 1 
is not regular. Suppose that L’ is regular. Then there is a finite- B A B 1 
state automaton A that accepts L’. In particular, A accepts a”b 
for every n. It follows that for sufficiently large n, the path rep- 
resenting a"b contains a cycle of length k < n. Since A accepts 11. [Section 12.1] 
a"ba"b, A also accepts a"*kba"b, which is a contradiction. 

1/1 O/1 0/0 
Chapter 12 Self-Test O/1 
1. [Section 12.1] oa 7 0/1 Se 1/0 z 
b/1 b/0 


A 
() al0 () 1/0 


12. [Section 12.2] 


2. [Section 12.1] 1101 
3. [Section 12.2] 


4. [Section 12.2] Yes 
5. [Section 12.3] Context-free 


13. [Section 12.2] Every 0 is followed by a 1. 


14, [Section 12.3] S — ASB, S — AB, AB — BA, BA — AB, 
6. [Section 12.3] S$ = aSb = aaSbb = aaaSbbb > A>a,Bob 


OAR OPE ey AEN UEN see IAD 15. [Section 12:4] Z = {a, b}; S = (op, 01, 09}; A = {oo}; initial 
7. [Section 12.3] a'b),j<2+ij>1,i>0 state = 00 


8. [Section 12.4] 


TZ a b 
S 
00 {o0, 1} @ 
oO! i) {o2} 
02 {00,02} — {oa} 
9. [Section 12.5] 16. [Section 12.4] Yes, since the path 
a 
a a 


b 
b (00, 00, 01, 02, 02, 02, 02, 00) 
b 


represents aabaaba and oo is an accepting state. 
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17. [Section 12.4] 


@) 2 eC ae 
a) ~{ O41 | o2 | 3 


1 


18. [Section 12.5] 


19. [Section 12.5] Combine the nondeterministic finite-state 
automata that accept L; and Ly in the following way. Let $ 
be the start state of Ly. For each edge of the form (Sj, 52) 
labeled a in L; where S2 is an accepting state, add an edge 
(S,, S) labeled a. The start state of the nondeterministic finite- 
state automaton is the start state of L;. The accepting states of 
the nondeterministic finite-state automaton are the accepting 
states of Lo. 


20. [Section 12.5] Let A’ be a nondeterministic finite-state 
automaton that accepts a regular language that does not con- 
tain the null string. Add a state F. For each edge, (0, 0’) 
labeled a in A’ where o’ is accepting, add the edge (o, F) 
labeled a. Make F the only accepting state. The resulting non- 
deterministic finite-state automaton A has one accepting state. 
We claim that Ac(A) = Ac(A’). 

We show that Ac(A) C Ac(A’). [The argument that 
Ac(A’)C Ac(A) is similar and omitted.] Suppose that 
a € Ac(A). There is a path 


(00, 01, s+ On—-1; On) 


that represents a in A, with o,, an accepting state. Sincea 4 A, 
there is a last symbol a in a. Thus the edge (0,1, 0) is 
labeled a. Now the path 


(00, 01, +--+, On—-1, F) 


represents a in A’ and terminates in an accepting state. There- 
fore, a € Ac(A’). 

To see that the statement is false for an arbitrary regular 
language, consider the regular language 


L= {A} U (0! | dis odd } 
and a nondeterministic finite-state automaton A with start 
state S that accepts L. Since A € L, S is an accepting state. If 


S has a loop labeled 0, then A accepts all strings of 0’s; there- 
fore, there is no loop at S labeled 0. Since 0 € Land there is no 


4 


{90,01,02} 


loop at S, there is an edge from S to an accepting state S’ 4 S, 
which is a contradiction. Therefore, A has at least two accept- 
ing states. 


Appendix A 


lL. /24a 44+b1+4+¢c 
6+d 9+e34+f 
l+g—-1+h6+i 


2/57 7 
=7 10 =1 
5. /3 18 27 
0 12 -6 
8. /_2 -35 —56 
7-18 13 
9% /18 10 
14 -6 
23 1 
12. (—4) 
14. (a) 2x 3,3x3,3x2 
33 18 47 
(b) ap= (75 i 
16 56 
= (ae) 
4 18 38 
cCA=10 00 
21775 
> _ (177215 531 
ap = (1% 93 323 
18 65 
BC = | 3425 
12 54 


17. 


20. 


LetA = (bij), Ih = (qx), Aln = (cj). Then 
n 
Ck = > digi = bixdkx = Dix. 
j=l 
Therefore, AJ, = A. Similarly, [,A = A. 
The solution is ¥ = A7!C. 


Appendix B 


1. —4x 
plies ae ey 
3 
7. 1 1 _n +1-n _ 1 
n n+l n(n + 1) n(n + 1) 
We may use this equation to compute )77"_, WE as follows: 
“~d 
» i@it+ 1) 
“1 1 
“5 i#] 
a ea) el a) 
Z 2 3 n-l on 
1 1 
, “nt :) 
1 1 _ati-l_ n 
n+1 n+1 n+1 
8. 81 
11. 1/81 
14. (a), (c), and (g) are equal. (b) and (f) are equal. (d) and (e) are 
equal. 
16. x7 + 8x +15 
19. x7 + 8x + 16 
22. x -—4 
25. (x+5)~+ 1) 
28. (x — 4)? 
31. (2x+ D(@+5) 
34. (2x + 3)(2x — 3) 
37. 2+ )I)!4+mM4+Dm4+)! = @t+dD!1+@4+))] = 
(n+ 1)!m+2) =(n+2)! 
40. 
73-2") — 4.51) — 103-2"? — 4.5") 
= 2"(7 3-2 — 10-3) +5"? (-7+4-5 + 10-4) 
= 2"? .12 4.5"-7(—100) 
= 2-7 (2? 3) — 5"-7(57 4) 
= 3-2"—4.5" 
42. Factoring gives (x — 4)(x — 2) = 0, which has solutions 
x= 4,2. 
45. 2x <6,x <3 
48. i<nfori=1,...,n.Summing these inequalities, we obtain 


n 
3 i<nen=n’. 
i=l 


51. 


54. 
59. 
64. 
68. 
71. 
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Multiply by (n + 2)n?(n + 1)? to get 
(2n+1)(n+ 1)? > 2in+2)n? 
or 


Qn? + 5n* +.4n + 1 > 2n? + 4n? 


or 


nm +4n+1>0, 


which is true ifn > 1. 


6 57. 10 
2.584962501 62. —0.736965594 
2.39223 1208 67. 0.480415248 
1.489896102 


Let u = log, y and v = log, x. By definition, b‘ = y and 
bY = x. Now 


x08, y xl (bY ye pve (bY! ) v y les Be 


Appendix C 


1. 


13. 


First large is set to 2 and i is set to 2. Since i < n is true, the 
body of the while loop executes. Since s; > large is true, large 
is set to 3. 7 is set to 3 and the while loop executes again. 

Since i < nis true, the body of the while loop executes. 
Since s; > large is true, large is set to 8. i is set to 4 and the 
while loop executes again. 

Since i < nis true, the body of the while loop executes. 
Since s; > large is false, the value of large does not change. i 
is set to 5 and the while loop executes again. 

Since i < n is false, the while loop terminates. The 
value of large is 8, the largest element in the sequence. 


. First x is set to 4. Since b > x is false, x = b is not executed. 


Since c > x is true, x = c executes, and x is set to 5. Thus x is 
the largest of the numbers a, b, and c. 
min(a, b) { 
if (a < b) 
return a 
else 
return b 
} 
odds(n) { 
i=1 
while (i < n) { 
printin(i) 
i=i+2 
} 
} 


product(s, n) { 
partial_product = 1 
fori=1ton 
partial_product = partial_product * 5; 
return partial_product 


} 
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406, 454 

Directed acyclic graph, 395 
Directed edge, 142 
Directed Euler cycle, 394 
Directed graph, 375 

directed Euler cycle in, 394 
Directly derivable string, 580, 585 
Direct proof, 64 
Dirichlet drawer principle, 319 
Disjoint sets, 6 

pairwise, 6 
Disjunction, 14 
Disjunctive normal form, 553 
Disjunctive syllogism, 32 
Disproving a universally quantified 

statement, 68 

Dissimilarity function, 378 
Distance between vertices, 395 
Distributive laws, 609 

for Boolean algebras, 544 

for sets, 8 
Divide, 14, 214 
Divide-and-conquer, 204 
Divisor, 214 
Domain 

of discourse, 36 

of a function, 113 
Domino, 390 
Dual graph, 428 
Dual statement, 547 


Eccentricity of a vertex, 445 
Edelsbrunner, H., 371 
Edgar, W. J., 58 
Edge, 373, 375 
capacity of, 507 
directed, 142 
disjoint set, 477 


flow in, 507 
improperly oriented, 512 
incident, 375 
parallel, 375 
properly oriented, 512 
in series, 423 
weight, 376 
Egyptian form of a fraction, 108 
Empty set, 3 
Encrypt a message, 250 
English, E., 252 
Equal Boolean expressions, 540 
Equal matrices, 605 
Equal sets, 3 
Equals operator (==), 621 
Equivalence class, 153 
Equivalence relation, 151, 152, 277, 
285, 387, 540, 541, 577, 584 
equivalence class in, 153 
Equivalent combinatorial circuits, 541 
Equivalent finite-state automata, 577 
Equivalent grammars, 584 
Equivalent nondeterministic 
finite-state automata, 592 
Equivalent switching circuits, 543 
Erdos, P., 383 
Erd6s number, 383 
Euclidean algorithm, 238, 239, 
242, 246 
analysis of, 240 
recursive, 245 
Euler, L., 388, 423 
Euler cycle, 388, 394, 397 
directed, 394 
Euler numbers, 338 
Euler’s formula for graphs, 423, 426 
Evaluation function, 127, 495 
Even, S., 324, 401, 424, 425, 433, 502 
Even integer, 64 
Event, 297 
independent, 307 
mutually exclusive, 305 
probability of, 302 
sample space, 297 
Exclusive-or, 16, 551 
Exhaustive proof, 75 
Existence proof, 79 
constructive, 79 
nonconstructive, 79 
Existential generalization, 46 
Existential instantiation, 46 
Existentially quantified statement, 40 
Existential quantifier, 40 


Experiment, 297 
event in, 297 
Exponent, 610 
laws of, 611 
Exponentiation mod z by repeated 
squaring algorithm, 236 
Expression 
fully parenthesized form of, 475 
infix form of, 475 
postfix form of, 475 
prefix form of, 475 
External path length, 471 
Ezekiel, M., 371 


Face in a planar graph, 423 
Factor, 214, 252 
Factorial, 90 
algorithm to compute, 205 
Factoring, 612 
Fallacy, 31 
of affirming the conclusion, 33 
Family of sets, 6 
Feasible problem, 196 
Fermat’s little theorem, 319 
Fibonacci, L., 208 
Fibonacci sequence, 207, 241, 
328, 344 
File systems, 440 
Finding a maximal flow in a network 
algorithm, 514 
Finding the distance between a 
closest pair of points 
algorithm, 368 
Finding the largest and smallest 
elements in a sequence 
algorithm, 363 
Finding the maximum of three 
numbers algorithm, 174 
Finding the maximum sum of 
consecutive values 
algorithm, 202 
Finding the maximum value in a 
sequence algorithm, 175 
Finding the 2’s complement 
algorithm, 564 
Finite-state automaton, 574, 575 
accepting state, 575 
equivalent, 577 
initial state, 575 
input symbol, 575 
next-state function, 575 


nondeterministic, 589, 591 
state, 575 
string accepted by, 576 
Finite-state machine, 568, 569 
initial state, 569 
input string, 570 
input symbol, 569 
next-state function, 569 
output function, 569 
output string, 570 
output symbol, 569 
serial-adder, 571 
state, 569 
Finiteness of an algorithm, 173 
Finite sequence, 130 
Five-coins puzzle, 477 
Flip-flop circuit, 571 
Floor, 103, 117 
Flow 
algorithm for finding 
maximal, 514 
conservation of, 507 
in an edge, 507 
maximal, 511 
in a network, 507 
value of, 508 
into a vertex, 507 
out of a vertex, 507 
Ford, L. R., Jr., 515, 529 
Forest, 450 
For loop, 623 
Formal language, 579, 580 
Four-coins puzzle, 479 
Four-color problem, 429 
Four-queens problem, 455 
Fowler, P. A., 388 
Fractal, 586 
Fraction, 609 
combining, 609 
Free tree, 438 
Free variable, 39 
Frey, P., 502 
Fukunaga, K., 324 
Full-adder circuit, 561 
Full binary tree, 465 
Full m-ary tree, 471 
Fully parenthesized form of an 
expression, 475 
Function, 111-113, 141 
Ackermann’s, 334, 336 
arrow diagram of, 113 
bijection, 121 
binary operator, 123 
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Boolean, 551, 564 

characteristic, 127 

codomain of, 113 

composition of, 122 

dissimilarity, 378 

domain of, 113 

evaluation, 127, 495 

graph of, 114 

hash, 116 

injective, 118 

inverse, 121 

inverse image, 127 

next-state, 569, 575, 591 

one-to-one, 118 

onto, 120 

order of, 186 

output, 569 

parameter, 623 

print, 624 

printin, 624 

probability, 301 

propositional, 36 

pseudocode, 623 

pseudocode output, 624 

range of, 113 

recursive pseudocode, 204 

sequence, 130 

special type of relation, 141 

surjective, 120 

unary operator, 123 
Functionally complete set of 

gates, 556 

Fundamental cycle, 458 

matrix, 458 
Fundamental theorem of 

arithmetic, 217 


Gallier, J. H., 88 
Game tree, 493 
alpha-beta pruning in, 496 
alpha cutoff in, 497 
alpha value in, 496 
beta cutoff in, 497 
beta value in, 497 
minimax procedure to 
evaluate, 495 
n-level search in, 495 
Gardner, M., 211, 252, 433 
Gate, 532, 556 
AND, 533 
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functionally complete set, 556 
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inverter, 533 
NAND, 557 
NOR, 563 
NOT, 533 
OR, 533 
Generality of an algorithm, 173 
Generalized combinations, 283 
Generalized De Morgan’s laws for 
logic, 43, 52 
Generalized permutations, 283 
Generating combinations 
algorithm, 329 
Generating permutations 
algorithm, 295 
Genesereth, M. R., 88 
Geometric sum, 91 
Gessel, L., 493 
Ghahramani, S., 324 
Gibbons, A., 433, 502 
Gilchrist, M., 223 
GIMPS (Great Internet Mersenne 
Prime Search), 79 
Goldbach’s conjecture, 176 
Goldberg, S., 371 
Golomb, S. W., 94, 502 
Google, 18 
Gose, E., 324, 464 
Graf, S., 438 
Graff, M., 252 
Graham, R. L., 58, 99 
Grammar, 580 
context-free, 582 
context-free interactive 
Lindenmayer, 584 
context-sensitive, 582 
derivable string in, 580 
derivation in, 580 
directly derivable string in, 580 
equivalent, 584 
for integers, 581 
language generated by, 581 
nonterminal symbol in, 580 
phrase-structure, 580 
production in, 580 
regular, 582 
starting symbol in, 580 
terminal symbol in, 580 
type 1, 582 
type 2, 582 
type 3, 582 
Graph, 373, 375 
acyclic, 447 
adjacency matrix of, 410 


arc in, 375 

articulation point in, 394 
bipartite, 380 

circuit in, 387 

closed path in, 395 
coloring of, 428 
complement of, 424 
complete, 380 

complete bipartite, 381 
component of, 387 
connected, 385 

cycle in, 387 

dag, 395 

deficiency of, 527 
diameter of, 395 
digraph, 375 

directed, 375 

directed acyclic, 395 
dual, 428 

edge disjoint set for, 477 
edge in, 373, 375 

Euler cycle in, 388, 394, 397 
Euler’s formula for, 423, 426 
forest, 450 

of a function, 114 
fundamental cycle in, 458 
fundamental cycle matrix of, 458 
Hamiltonian cycle in, 396 
Hamiltonian path in, 405 
homeomorphic, 424 
homomorphism, 422 
incidence matrix of, 413 
independent set in, 395 
invariant, 418 

isolated vertex in, 376 
isomorphic, 415 
isomorphism, 415 

loop in, 376 

node in, 375 

parallel edges in, 375 
path in, 374, 384 
Petersen, 420 

planar, 423 

precedence, 384 

regular, 421 
representations, 410 
r-regular, 421 
self-complementary, 422 
series reduction in, 424 
similarity, 378 

simple, 376 

simple cycle in, 387 
simple path in, 387 


subgraph of, 386 
undirected, 375 
vertex cover of, 477 
vertex in, 373, 375 
weighted, 376 
Gray code, 398, 416 
Greatest common divisor, 218, 238 
Greatest lower bound, 85 
Great Internet Mersenne Prime 
Search (GIMPS), 79 
Greedy algorithm, 461 
Gries, D., 58 


Hailperin, T., 532, 564 
Haken, W., 429, 434 
Half-adder circuit, 560 
Hall, P., 525 
Hall’s marriage theorem, 525 
Halmos, P. R., 58, 170 
Halting problem, 197 
Hamilton, W. R., 396 
Hamiltonian cycle, 396 
randomized algorithm for, 402 
Hamiltonian cycle 
problem, 197, 397 
Hamiltonian path, 405 
Harary, F., 433, 502 
Hardy, G. H., 214 
Harkleroad, L., 588 
Harmonic numbers, 100 
Harmonic series, 102 
Hash function, 116 
Height of a tree, 439 
Hell, P., 422 
Helly’s theorem, 99 
Hexadecimal addition, 232 
Hexadecimal number system, 
224, 227 
Hexadecimal to decimal 
conversion, 228 
Hierarchical definition tree, 441 
Hilbert curve, 588 
Hillier, F. S., 529 
Hinz, A. M., 371 
HIV virus, detecting, 310 
Hohn, F., 564 
Holton, D. A., 402 
Homeomorphic graphs, 424 
Homomorphism of a graph, 422 
Hopcroft, J. E., 583, 601 
Hu, T. C., 324 


Huffman code, 442 

constructing an optimal, 442 
Hypercube, 379; see also n-cube 
Hypothesis, 21, 31 
Hypothetical syllogism, 32 


Idempotent laws 
for Boolean algebras, 546 
for sets, 8 
Identifier in a programming 
language, 159 
Identity laws 
for Boolean algebras, 544 
for sets, 8 
Identity matrix, 608 
If else statement, 621 
If statement, 620 
Improperly oriented edge, 512 
Incidence matrix, 413 
Incident, 375 
Inclusion-exclusion principle 
general, 267 
for three sets, 267 
for two sets, 262 
Inclusive-or, 16 
Incomparable, 146 
Increasing sequence, 132 
Indegree of a vertex, 394 
Independent events, 307 
Independent set, 395 
Index of a sequence, 130, 134 
Indirect proof, 72 
Induction, 88, 102; see also 
Mathematical induction 
Inductive step, 90, 102, 106, 107 
Inequality, 614 
laws of, 615 
Infinite sequence, 130 
Infix form of an expression, 475 
Inhomogeneous recurrence relation, 
341, 348 
Initial condition, 328 
Initial state, 569, 575, 591 
Injective function, 118 
Inorder traversal algorithm, 473 
Input, 173 
Input string, 570 
Input symbol, 569, 575, 591 
Insertion sort algorithm, 179, 181 
recursive version, 364 
Instant insanity, 429 


Integer, 2 
computer representation of, 225 
even, 64 
odd, 64 
representations of, 224 
Integrated circuit, 560 
Internal path length, 471 
Internal vertex, 446 
International standard book number 
(ISBN), 126 
Internet addresses, 259, 266 
Intersection of sets, 6, 9 
Intersection operator, 169 
Intractable problem, 197 
Invalid argument, 31 
Invariant for a graph, 418 
Inverse function, 121 
Inverse image, 127 
Inverse of n mod ¢, 246 
using the Euclidean algorithm to 
compute, 246 
Inverse relation, 147 
Inverter, 533 
Invertible matrix, 608 
Involution law 
for Boolean algebras, 546 
for sets, 8 
Irrational number, 6 
ISBN (international standard book 
number), 126 
Isolated vertex, 376 
Isomorphic 
binary trees, 486 
graphs, 415 
rooted trees, 485 
trees, 483 
Isomorphism 
of binary trees, 486 
of graphs, 415 
of rooted trees, 485 
of trees, 483 
Iteration method of solving 
recurrence relations, 338 


J 


Jacobs, H. R., 58 

Jai Alai, 501 

Java, 16, 17, 26 

Johnsonbaugh, R., 195, 197, 211, 
369, 370, 433, 442, 456, 
502, 603 

Join operator, 166 
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Jones, R. H., 510 
Josephus, F., 99 
Josephus problem, 99 


Kasparov, G., 498 
Kayal, N., 217 
Kelley, D., 601 
Kelly, D. G., 324 
Key, 166 

private, 250 

public, 251 
Kleinrock, L., 510 
Kline, M., 58 
Knight’s tour, 400 
Knuth, D. E., 58, 179, 211, 252, 

371, 502 

Kobler, J., 434 
Kocher, P., 252 
Kohavi, Z., 564 
Konig, D., 373, 433 
KGnigsberg bridge problem, 388 
Kroenke, D., 166, 170 
Kruse, R. L., 371 
Kruskal’s algorithm, 464 
Kuratowski’s theorem, 424 
Kurosaka, R. T., 482 


L 


Labeling procedure, 515 

Language, 579, 580 
context-free, 583 
context-sensitive, 583 
formal, 579, 580 
generated by a grammar, 581, 585 
natural, 579 
non-regular, 598 
regular, 583 

Law of detachment, 32 

Laws of exponents, 611 

Laws of inequalities, 615 

Laws of logarithms, 616 

Leaf, 446 

Least common multiple, 220 

Least upper bound, 83 

Left child, 465 

Leighton, F. T., 433, 502 

Lemma, 63 

Length of a path, 376, 384 

Length of a string, 135 

Lenstra, A., 252 
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Lerner, D., 369 
Lester, B. P., 433, 502 
Level of a vertex, 439 
Lewis, T. G., 433, 502 
Lexicographic order, 290 
Leyland, P., 252 
Lial, M. L., 609 
Lindenmayer, A., 586 
Linear congruential method for 
generating pseudorandom 
numbers, 116 
Linear homogeneous recurrence 
relation, 341, 348 
Linear programming problem, 529 
Lipschutz, S., 58, 170 
Literal, 535 
Liu, C. L., 58, 324, 370, 502, 529 
Logarithm, 616 
change-of-base formula 
for, 617 
laws of, 616 
Logic, 1 
De Morgan’s laws for, 25, 43, 52 
generalized De Morgan’s laws for, 
43, 52 
Logically equivalent 
propositions, 25 
Logic game, 53 
Logic table, 533 
Loop, 142, 376 
for, 623 
while, 623 
Loop invariant, 95 
Lower bound, 85 
greatest, 85 
Lower limit, 134 
Lucas, E., 332, 338 
Lucas sequence, 338 
Lugosi, B., 377 
Luhn, H. P., 112 
Luhn algorithm, 112 


Making postage, 249 
Manber, U., 211 
Mandelbrot, B. B., 601 
Martin, G. E., 94, 109 
m-ary tree, full, 471 
Matching, 523 
complete, 523 
maximal, 523 
network, 524 


Mathematical induction, 88, 102, 
206, 328 
basis step, 90, 107 
inductive step, 90, 102, 106, 107 
principle of, 89, 90 
strong form, 102 
Mathematical system, 63 
Matrix, 605 
adjacency, 410 
equal, 605 
fundamental cycle, 458 
identity, 608 
incidence, 413 
invertible, 608 
multiplication, 195, 606 
multiplication algorithm, 195 
power of, 607 
product, 606 
of a relation, 160 
scalar product, 606 
size of, 605 
square, 608 
sum, 606 
transpose, 608 
Max flow, min cut theorem, 519, 521 
Maximal flow, 511 
algorithm for finding, 514 
Maximal matching, 523 
Maxterm, 554 
McCalla, T. R., 560, 564 
McCarthy, J., 349 
McNaughton, R., 211, 601 
Median, 140 
Melissa virus, 257 
Mendelson, E., 560, 564, 565 
Merge sort algorithm, 358 
Merging two sequences 
algorithm, 357 
Mersenne, M., 79 
Mersenne prime, 79 
Mesh model for parallel 
computation, 416 
Method of nearest neighbors, 464 
Mihailescu, P., 276 
Miller, R., 211, 433, 502 
Minimal cut, 521 
Minimal spanning tree, 459 
algorithm for, 459, 463 
Minimax procedure, 495 
Minimization problem, 558 
Minimum, 67 
Minimum-queens problem, 459 
Minterm, 552 


Mitchison, G. J., 208 

Modulus operator, 115 

Modus ponens, 32 

Modus tollens, 32 

universal, 47 

Monty Hall problem, 300 
Moser, L. K., 501 
Multiplication of matrices, 195, 606 
Multiplication principle, 256 
Mu Torere, 501 

Mutually exclusive events, 305 


Nadler, M., 324 
NAND gate, 557 
n-ary relation, 165 
Natural language, 579 
Navratilova, M., 438 
Naylor, M., 208 
n-cube, 379, 398, 416 
Necessary condition, 23 
Negation of a proposition, 17 
Nested quantifiers, 50 
Network, 506 
cut in, 520 
finding a maximal flow in, 514 
flow in, 507 
matching, 524 
maximal flow in, 511 
pumping, 509 
sink in, 507 
source in, 507 
traffic flow, 509 
Newman, J. R., 433 
Next-state function, 569, 575, 591 
Nievergelt, J., 211, 502 
Nilsson, N. J., 502 
Nim, 493 
Niven, I., 252, 324 
n-level search, 495 
Node, 375 
Nonconstructive proof, 79 
Nondecreasing sequence, 132 
Nondeterministic finite-state 
automaton, 589, 591 
accepting state, 591 
equivalent, 592 
initial state, 591 
input symbol, 591 
next-state function, 591 
state, 591 
string accepted by, 592 


Nonincreasing sequence, 132 
Nonlinear recurrence relation, 341 
Non-regular language, 598 
Nonterminal symbol, 580, 584 
NOR gate, 563 
Not equal operator (—=), 621 
NOT gate, 533 
Not operator (pseudocode: —), 621 
NP-complete problem, 197 
n-queens problem, 323, 455 
nth root, 610 
n-tuple, 10 
Null set, 3 
Null string, 135 
Number system 
base, 224 
binary, 224 
decimal, 224 
hexadecimal, 224, 227 
octal, 224, 238 
Number theory, 214 
Nyhoff, L., 371 
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Octal number system, 224, 238 
Odd integer, 64 
Omega notation, 186 
One-to-one function, 118 
Onto function, 120 
Operand, 475 
Operations research, 506 
Operator, 475 

binary, 123 

difference, 100, 169 

intersection, 169 

join, 166 

modulus, 115 

precedence, 17 

projection, 166 

selection, 166 

unary, 123 

union, 169 
Ordered pair, 9, 83 
Ordered tree, 493 
Order of a function, 186 
Ore, O., 433, 502 
OR gate, 533 
Or operator (pseudocode: Vv), 621 
Outdegree of a vertex, 394 
Output, 173 

pseudocode functions, 624 
Output function, 569 


Output string, 570 
Output symbol, 569 


Pairwise disjoint sets, 6 
Palindrome, 265 
Parallel algorithm, 379 
Parallel circuit, 538 
Parallel computer, 379 
Parallel edges, 375 
Parameter, 623 
Parent of a vertex, 446 
Partial order, 146 
comparable elements in, 146 
incomparable elements in, 146 
Partition of a set, 9, 151, 281 
Pascal’s triangle, 315 
Path, 374, 384 
closed, 395 
crossing in, 428 
Hamiltonian, 405 
length, 376, 384 
representing, 576, 592 
simple, 387 
Path length 
external, 471 
internal, 471 
Pattern recognition, 308 
Pearl, J., 498 
Peitgen, H., 586 
Permutation, 269 
algorithm for 
generating, 295 
derangement, 352 
generalized, 283 
r-, 271 
rise/fall, 338 
Petersen graph, 420 
Pfleeger, C. P., 252 
Phrase-structure grammar, 580 
Pigeonhole principle, 319 
first form, 319 
second form, 320 
third form, 321 
Planar graph, 423 
face in, 423 
triangulation of, 428 
Planar map, 428 
Plane convex set, 99 
Pluznikov, P., 564 
Polish notation, 475 
Polygon, convex, 336 
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Polynomial evaluation algorithm, 372 
Polyomino, 94 
of order s, 94 
Population growth, 339, 344 
Posa, L., 401 
Postage, making, 249 
Postfix form of an expression, 475 
Postorder traversal algorithm, 474 
Power of a matrix, 607 
Power set, 5 
Precedence graph, 384 
Precedence of operators, 17 
Precision of an algorithm, 173 
Predicate, 36 
Prefix form of an expression, 475 
Premise, 31 
Preorder traversal algorithm, 472 
Preparata, F. P., 369, 371 
Prim’s algorithm, 459, 463 
Prime, 14, 215 
Mersenne, 79 
Principle of mathematical induction, 
89, 90 
print function, 624 
println function, 624 
Private key, 250 
Probability, 297, 298 
Bayes’ theorem, 308 
birthday problem, 303 
conditional, 306 
event, 297 
of an event, 302 
experiment, 297 
function, 301 
independent events, 307 
mutually exclusive events, 305 
sample space, 297 
Prodinger, H., 395 
Production, 580, 585 
Product notation, 134 
index in, 134 
lower limit in, 134 
upper limit in, 134 
Product of matrices, 606 
Projection operator, 166 
Proof, 62, 63 
by cases, 74 
common errors, 70 
constructive, 79 
by contradiction, 72, 87 
by contrapositive, 73 
direct, 64 
of equivalence, 76 
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exhaustive, 75 
existence, 79 
indirect, 72 
nonconstructive, 79 
resolution, 85 
subproof, 66 
Properly oriented edge, 512 
Proper subset, 5 
Properties of combinatorial 
circuits, 539 
Proposition, 14 
biconditional, 24 
conditional, 20 
conjunction of, 14 
disjunction of, 14 
exclusive-or, 16 
inclusive-or, 16 
logically equivalent, 25 
negation of, 17 
rules of inference for, 32 
truth table of, 15 
Propositional function, 36 
domain of discourse of, 36 
Prusinkiewicz, P., 586 
Pseudocode, 174, 620 
and operator (A), 621 
assignment operator (=), 620 
comment (//), 622 
concatenation operator 
(+), 624 
equals operator (==), 621 
for loop, 623 
function, 623 
function parameter, 623 
if else statement, 621 
if statement, 620 
not equal operator (==), 621 
not operator (—), 621 
or operator (V), 621 
output functions, 624 
print function, 624 
printin function, 624 
relational operator in, 621 
return statement, 624 
string, 624 
while loop, 623 
Pseudorandom numbers, 116 
linear congruential method for 
generating, 116 
Public key, 251 
Pumping network, 509 
Putahi, 501 
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Q (rational numbers), 2 
Qt (positive rational numbers), 3 
Q™ (negative rational numbers), 3 
Qre"’s (nonnegative rational 
numbers), 3 
Quadratic equation, 613 
quadratic formula, 614 
solution of, 613 
solving, 613 
Quadratic formula, 614 
Quadratic Koch islands, 588 
Quantified statements, rules of 
inference for, 46 
Quantifier, 36 
existential, 40 
nested, 50 
universal, 38 
Query, 166 
Quinn, M. J., 433, 502 
Quotient, 14, 214 
Quotient-remainder theorem, 
105, 118 


R (real numbers), 2 
R¢™ (positive real numbers), 3 
R™ (negative real numbers), 3 


R”’""*s (nonnegative real numbers), 3 


Randomized algorithms, 181 
Randomized Hamiltonian cycle 
algorithm, 402 
Range of a function, 113 
Rational number, 2 
r-combination, 272 
Read, R. C., 418 
Real number, 2 
completeness property of, 84 
greatest lower bound, 85 
least upper bound, 83 
lower bound, 85 
sign of, 82 
upper bound, 83 
Recurrence relation, 328 
inhomogeneous, 341, 349 
initial conditions for, 328 
linear homogeneous, 341, 349 
nonlinear, 341 
solving, 338 
Recursive algorithm, 204, 328 
base case for, 206 


Recursive function (pseudocode), 204 
Reflexive relation, 142 
Refutation complete, 88 
Regular grammar, 582 
Regular graph, 421 
Regular language, 583 
Reingold, E., 211, 324 
Relation, 141 
antisymmetric, 143 
binary, 141 
composition of, 147 
digraph of, 142 
equivalence, 151, 152, 277, 285, 
387, 540, 541, 577, 584 
inverse of, 147 
matrix of, 160 
n-ary, 165 
partial order, 146 
recurrence, 328 
reflexive, 142 
symmetric, 143 
total order, 146 
transitive, 144, 195 
transitive closure of, 157 
Relational database, 165, 166 
attribute in, 166 
difference operator, 169 
intersection operator, 169 
join operator, 166 
key in, 166 
projection operator, 166 
selection operator, 166 
union operator, 169 
Relational operator, 621 
Relative complement of a set, 6 
Repeated squaring, 233 
algorithm, 234 
used to compute 
a" mod z, 236 
Representation of graphs, 410 
Representations of integers, 224 
Resolution proof, 85 
by contradiction, 87 
correctness, 88 
refutation complete, 88 
Return statement, 624 
Reverse Polish notation, 475 
Right child, 465 
Right tromino, 94 
Ring model for parallel 
computation, 398 
Riordan, J., 313, 324 


Rise/fall permutation, 338 
Rivest, R. L., 251, 252 
Roberts, F. S., 324, 370 
Robinson, J. A., 85 
Robot walking 
algorithm, 207 
Rooted tree, 438 
isomorphic, 485 
isomorphism, 485 
Ross, S. M., 324 
Rozanov, Y. A., 324 
r-permutation, 271 
r-regular graph, 421 
RSA public-key cryptosystem, 250 
Rules of inference, 31, 46 
addition, 32 
conjunction, 32 
disjunctive syllogism, 32 
existential generalization, 46 
existential instantiation, 46 
hypothetical syllogism, 32 
law of detachment, 32 
modus ponens, 32 
modus tollens, 32 
for propositions, 32 
for quantified statements, 46 
simplification, 32 
universal generalization, 46 
universal instantiation, 46 


Saad, Y., 433 
Sabatini, G., 438 
Sample space, 297 
Saxena, N., 217 
Scalar product, 606 
Schneider, B., 297 
Schroder numbers, 336 
Schumer, P., 99 
Schwenk, A. S., 401 
Searching algorithms, 177 
binary search, 354 
breadth-first search, 453 
depth-first search, 454 
searching an unordered 
sequence, 195 
text search, 178 
Searching an unordered sequence 
algorithm, 195 
Searching the web, 18 
Sedol, L., 498 
Selection operator, 166 


Selection sort, 184 
Selection sort algorithm, 353 
Seles, M., 438 
Self-complementary graph, 422 
Sequence, 82, 130 
de Bruijn, 394 
decreasing, 132 
Fibonacci, 207, 241, 328, 344 
finite, 130 
increasing, 132 
index of, 130, 134 
infinite, 130 
Lucas, 338 
nondecreasing, 132 
nonincreasing, 132 
smoothing, 318 
subsequence, 132 
tribonacci, 212 
Sequential circuit, 532, 568 
Serial adder, 568 
circuit, 569 
finite-state machine, 571 
Serial algorithm, 379 
Serial computer, 379 
Series circuit, 538 
Series reduction in a graph, 424 
Set, 1,2 
absorption laws for, 8 
associative laws for, 8 
bound laws for, 8 
cardinality, 3 
Cartesian product of, 9 
collection of, 6 
commutative laws for, 8 
complement laws for, 8 
complement of, 6 
convex, 99 
De Morgan’s laws for, 8 
difference, 6 
disjoint, 6 
distributive laws for, 8 
empty, 3 
equal, 3 
family of, 6 
idempotent laws for, 8 
identity laws for, 8 
intersection of, 6, 9 
involution law for, 8 
null, 3 
pairwise disjoint, 6 
partition of, 9, 151, 281 
power, 5 
proper subset of, 5 
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relative complement of, 6 
set-builder notation, 2 
subset of, 4 
symmetric difference of, 13 
union of, 5, 8 
universal, 6 
universe, 6 
Venn diagram, 7 
void, 3 
0/1 laws for, 8 
Set-reset flip-flop circuit, 571 
Set-builder notation, 2 
Shamir, A., 251 
Shamos, M. I., 371 
Shannon, C. E., 532 
Sheehan, J., 420 
Shoe box principle, 319 
Shortest-path algorithm, 406 
Dijkstra’s, 406 
Shuffle algorithm, 182 
Sibling vertices, 446 
Sigler, L., 208 
Sigma notation, 134 
index in, 134 
lower limit in, 134 
upper limit in, 134 
Sign of a real number, 82 
Similarity graph, 378 
Simple cycle, 387 
Simple graph, 376 
complement of, 422 
self-complementary, 422 
Simple path, 387 
Simplex algorithm, 529 
Simplification, 32 
Single-elimination 
tournament, 438, 465 
Sink, 507 
Size of a matrix, 605 
Skiena, S., 501 
Slagle, J. R., 502 
Smith, A. R., 586 
Smoothing sequence, 318 
Solow, D., 109 
Solving a quadratic equation, 613 
Solving recurrence relations, 338 
iteration method, 338 
method for linear homogeneous 
recurrence, 342 
Solving the four-queens problem 
using backtracking 
algorithm, 455 
Sorting, 179 
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binary insertion sort, 504 
insertion sort, 179, 181 
insertion sort (recursive 
version), 364 
merge sort, 358 
problem, 480 
selection sort, 184, 353 
tournament sort, 483 
worst-case time for, 481 
Source, 507 
Spanning tree, 452 
breadth-first search for, 453 
depth-first search for, 454 
minimal, 459 
Square matrix, 608 
SR flip-flop circuit, 571 
*-set, 223 
Starting symbol, 580, 585 
State, 569, 575, 591 
accepting, 574, 575, 591 
initial, 569, 575, 591 
Stirling number 
of the first kind, 280, 338 
of the second kind, 281, 338 
Stoll, R. R., 58, 170 
Straight tromino, 98 
Strassen’s algorithm, 195 
String, 135 
accepted, 576, 592 
bit, 135 
concatenation, 135 
concatenation operator 
(pseudocode: +), 624 
derivable, 580, 585 
directly derivable, 580, 585 
input, 570 
length of, 135 
null, 135 
output, 570 
over X, 135 
in pseudocode, 624 
substring of, 135 
Strong form of mathematical 
induction, 102 
Subgraph, 386 
Subproof, 66 
Subsequence, 132 
Subset, 4 
proper, 5 
Subset-sum problem, 459 
Substring, 135 
Subtree, 446 
Sudkamp, T. A., 601 


Sudoku, 458 
Sufficient condition, 24 
Sullivan, M., 609 
Summation-by-parts 
formula, 139 
Sum notation, 134 
index in, 134 
lower limit in, 134 
upper limit in, 134 
Sum of matrices, 606 
Supersink, 509 
Supersource, 509 
Surjective function, 120 
Switching circuit, 538 
equivalent, 543 
Switching table, 538 
Symmetric difference of sets, 13 
Symmetric relation, 143 


T 


Tarjan, R. E., 334, 502, 529 

Task scheduling, 146 

Taubes, G., 252 

Telemarketing, 309 

Terminal symbol, 580, 584 

Terminal vertex, 446 

Testing for an accepted string 
algorithm, 577 

Testing whether an integer is prime 
algorithm, 216 

Testing whether two binary trees are 
isomorphic algorithm, 489 

Text search algorithm, 178 

Theorem, 63 

Theta notation, 186 

3D-septomino, 98 

Tiling, 94, 206 

Tiling a deficient board with 
trominoes algorithm, 206 

Time and space for algorithms, 181 

Total order, 146 

Tournament sort, 483 

Tower of Hanoi, 331, 340, 349 

Trace of an algorithm, 174 

Tractable problem, 196 

Traffic flow network, 509 

Transition diagram, 570 

Transitive closure of a relation, 157 

Transitive relation, 144, 195 

Transportation problem, 529 

Transport network, 506; see also 
Network 


Transpose of a matrix, 608 
Transposition, 27 
Traveling salesperson 
problem, 197, 377, 
398 
Tree, 438 
ancestor in, 446 
binary, 465 
binary search, 467 
branch vertex in, 446 
center for, 445 
child in, 446 
decision, 477 
descendant in, 446 
free, 438 
full m-ary, 471 
game, 493 
height of, 439 
hierarchical definition, 
441 
internal vertex in, 446 
isomorphic, 483 
isomorphism, 483 
leaf in, 446 
minimal spanning, 459 
ordered, 493 
parent in, 446 
rooted, 438 
sibling in, 446 
spanning, 452 
subtree of, 446 
terminal vertex in, 446 
Tree traversal, 471 
inorder, 473 
postorder, 474 
preorder, 472 
Triangulation of a planar 
graph, 428 
Tribonacci sequence, 212 
Tromino, 94 
right, 94 
straight, 98 
True by default, 22 
Truth table, 15 
T-tetromino, 98 
Tucker, A., 58, 313, 324, 370, 434, 
529 
Turing machine, 601 
Turing’s hypothesis, 601 
2’s complement, 564 
algorithm to find, 564 
2’s module, 563 
Type | grammar, 582 


Type 2 grammar, 582 
Type 3 grammar, 582 


Ullman, J. D., 170 
Unary operator, 123 
Undefined term, 63 
Undirected graph, 375; see also 
Graph 
Union of sets, 5, 8 
Union operator, 169 
Unique factorization theorem, 217 
Unit time delay, 568 
Universal generalization, 46 
Universal instantiation, 46 
Universally quantified 
statement, 38 

disproving, 68 
Universal modus tollens, 47 
Universal quantifier, 38 
Universal set, 6 
Universe, 6 
Unsolvable problem, 197 
Upper bound, 83 

least, 83 
Upper limit, 134 


Vv 


Vacuously true, 22 
Valid argument, 31 
Value of a flow, 508 


Variable 
bound, 39 
free, 39 
Venn diagram, 7 
Vertex, 142, 373, 375 
adjacent, 375 
ancestor of, 446 
branch, 446 
child of, 446 
cover, 477 
degree of, 388 
descendant of, 446 
distance between, 395 
eccentricity of, 445 
flow into, 507 
flow out of, 507 
incident, 375 
indegree of, 394 
internal, 446 
isolated, 376 
leaf, 446 
level of, 439 
outdegree of, 394 
parent of, 446 
sibling, 446 
terminal, 446 
Very large scale integration 
(VLSD, 95 
Vilenkin, N. Y., 324 
VLSI (very large scale 
integration), 95 
layout problem, 95 
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Vogan, E., 377 
Void set, 3 
von Koch snowflake, 585 


Wagner, R., 377 

Wagon, S., 395 

Ward, S. A., 564 

Web search, 18 

Weighted graph, 376 

Weight of an edge, 376 

Well-ordering property, 105 

West, D., 433 

While loop, 623 

Wilson, R. J., 433 

Winograd, S., 195 

Wong, D. F., 95 

Wood, D., 586, 601 

Worst-case time for an algorithm, 
181, 185, 191 

Worst-case time for sorting, 481 

Wos, L., 88 


Z (integers), 2 
Z* (positive integers), 3 
Z” (negative integers), 3 
zens (nonnegative integers), 3 
0/1 laws 
for Boolean algebras, 546 
for sets, 8 


List of Symbols 


LOGIC 
PVq_ porgq; page 14 P=Q Pand Qare logically equivalent; page 25 
PAq_ pandq; page 14 Vv for all; page 38 
=p not p; page 17 a there exists; page 40 
p— q if p, then q; pages 20-21 Fs therefore; page 31 


Pp <q pit and only if g; pages 24-25 


SET NOTATION 


{x1, tee Xn} 

{x | p@)} 

Z, Z, Zt, Tonnes 
Q, Q, Q*, Quonnes 


R, R, R', Rronnes 


xEx 
x€X 
X=Y 
|X| 


set consisting of the elements x), ..., x,3 page 2 

set consisting of those elements x satisfying property p(x); page 2 

sets of integers, negative integers, positive integers, nonnegative integers; pages 2-3 
sets of rational numbers, negative rational numbers, positive rational numbers, 
nonnegative rational numbers; pages 2-3 


sets of real numbers, negative real numbers, positive real numbers, 
nonnegative real numbers; pages 2-3 


x is an element of X; page 3 

x is not an element of X; page 3 

set equality (X and Y have the same elements); page 3 
cardinality of X (number of elements in X); page 3 
empty set; page 3 

X is a subset of Y; page 4 

X is a proper subset of Y; page 5 

power set of X (all subsets of X); page 5 

X union Y (all elements in X or Y); page 5 


union of X;,..., Xp (all elements that belong to at least one of X1, X2,..., Xn); page 9 


union of X;, X2,... (all elements that belong to at least one of X1, X2,...); page 9 


union of S (all elements that belong to at least one set in S); page 8 
X intersect Y (all elements in X and Y); pages 5-6 


intersection of X;,..., X,, (all elements that belong to every one of X,, X2,..., X,); page 9 


intersection of X,, X2,... (all elements that belong to every one of X,, X2,...); page 9 


intersection of S (all elements that belong to every set in S); page 9 
set difference (all elements in X but not in Y); page 6 

complement of X (all elements not in X); page 6 

ordered pair; page 9 

n-tuple; page 10 


XxY Cartesian product of X and Y [pairs (x, y) with x in X and y in Y]; page 9 
X, x X2 x +--+ x X, Cartesian product of X;, X2,..., X, (n-tuples with x; € X;); page 10 
XAY symmetric difference of X and Y; page 13 

RELATIONS 
xRy (x, y) is in R (x is related to y by the relation R); page 141 
[x] equivalence class containing x; page 153 
R-! inverse relation [all (y, x) with (x, y) in R]; page 147 


R> 0 R, composition of relations; page 147 


x<y  xRy; page 146 


FUNCTIONS 


value assigned to x; page 114 

function from X to Y; page 113 

composition of f and g; page 122 

inverse function [all (y, x) with (x, y) inf]; page 121 


f(r) = O(g(n)) |f(@™)| < Clg(@)| for n sufficiently large; page 186 
fr) = Q(g(n)) clg()| < |f(@)| for n sufficiently large; page 186 
fr) = O(g(a)) cle~™| < If@| < Clg(@)| for n sufficiently large; page 186 


COUNTING 


C(n, r) number of r-combinations of an n-element set (n!/[(n — r)!r!]); page 273 
P(n,r) number of r-permutations of an n-element set [n(n — 1) ---(n — r+ 1)]; page 271 


GRAPHS 
G=(V,E) graph G with vertex set V and edge set E; page 375 
(vy, w) edge; page 375 
5(v) degree of vertex v; page 388 
(Y1,--+5Vn) path from v; to v,; pages 384-385 
(V1,---, Vn); V1 = Vn cycle; page 387 
Kn complete graph on n vertices; page 380 
Kinn complete bipartite graph on m and n vertices; page 381 
w(i, j) weight of edge (i, j); page 405 
Fy flow in edge (i, j); page 507 
Ci capacity of edge (i, j); page 507 
(P, P) cut in a network; page 520 
PROBABILITY 


P(x) probability of outcome x; page 301 
P(E) probability of event E; page 302 
P(E|F) conditional probability of E given F [P(E NM F)/P(F)]; page 306 


List of Symbols 


BOOLEAN ALGEBRAS AND CIRCUITS 


xVy x ory (1 ifx ory is 1, 0 otherwise); page 533 

xAYy x and y (1 if x and y are 1, 0 otherwise); page 533 

x@®y exclusive-OR of x and y (0 if x = y, 1 otherwise); page 551 
x not x (O if x is 1, 1 if x is 0); page 533 

eey x NOR y (0 if x or y is 1, 1 otherwise); page 563 

xty x NAND y (0 if x and y are 1, 1 otherwise); page 557 


>) > OR gate; page 533 
“>. AND gate; page 533 
>— NOT gate (inverter); page 533 
=> NOR gate; page 563 
“>_>. NAND gate; page 557 


STRINGS, GRAMMARS, AND LANGUAGES 


Xr null string; page 135 

|s| length of the string s; page 135 

st concatenation of strings s and t (s followed by 1); page 135 

a” aa---a(na’s); page 135 

x* set of all strings over X; page 135 

Xt set of all nonnull strings over X; page 135 

a—>B production in a grammar; page 580 

a> Bp B is derivable from a; page 580 

a) >a. >--- >a, derivation of a, from a1; page 580 

L(G) language generated by the grammar G; page 581 

Si=T Backus normal form (BNF); page 581 

os f= 7; | To S::= 7), S := To; page 581 

Ac(A) set of strings accepted by A; page 576 
MATRICES 


(aj) matrix with entries aj; page 605 

A =B matrices A and B are equal (A and B are the same size and their corresponding entries are equal); page 605 
A+B matrix sum; page 606 

cA scalar product; page 606 

—-A  (—1)A; page 606 

A—B A+ (-—B); page 606 

AB matrix product; page 606 

A” matrix product AA ---A (nA’s); page 607 


MISCELLANEOUS 


Igx 

Inx 

m mod n 
Lx] 

[x] 

b\a 
bia 
gcd(a, b) 
Icm(a, b) 


n! 


logarithm to the base 2 of x; page 188 

natural logarithm of x (logarithm to the base e of x); page 201 
remainder when m is divided by n; page 115 

floor of x (greatest integer less than or equal to x); page 117 
ceiling of x (smallest integer greater than or equal to x); page 117 
b divides a; page 214 

b does not divide a; page 214 

greatest common divisor of a and b; page 218 

least common multiple of a and b; page 220 

n factorial [n(n — 1) ---2+1]; page 90 

nth Fibonacci number; page 207 

nth Catalan number; page 276 


Gn + Amn+1 Si a dn; page 133 


the sum of the elements in the set {a; | i € X}; page 134 


An *Am+1 +++ Ans page 133 


the product of the elements in the set {a; | i € X}; page 134 


ieX 
ALGORITHM NOTATION 
x=y assign the value y to x; page 620 
{} mark the beginning and end of a block of statements; page 621 
if (p) if p is true, execute action; page 620 
action 
if (p) if p is true, execute action/; if p is false, execute action2; page 621 
action 
else 
action2 
// a comment starts with // and continues to the end of the line; page 622 
return suspend execution and return to invoker; page 624 
return x suspend execution and return the value x to invoker; page 624 
while (p) if p is true, execute action and repeat this process; page 623 
action 


for var = init to limit execute action for values of var from init to limit; page 623 


action 


funct(p,,... 


print(s, . 


printin(s,.. 


» Dk) invoke function funct with arguments p;, ..., py; page 623 
ses) print the values s1,..., 5,3 page 624 
-5 Sn) print the values s1,..., 5, and then add a newline; page 624 


